DeepSeek-OCR
Visit ToolDeepSeek-OCR is an open-source OCR tool that focuses on contexts optical compression. It allows exploration of visual-text compression boundaries and is supported in upstream vLLM.
At a glance
Trending
DeepSeek-OCR is an open-source OCR tool that focuses on contexts optical compression. It allows exploration of visual-text compression boundaries and is supported in upstream vLLM.
Trending
About
DeepSeek-OCR is an open-source tool developed by DeepSeek-AI, designed for advanced Optical Character Recognition (OCR) with a focus on contexts optical compression. It enables users to explore the boundaries of visual-text compression, offering various resolution modes including native (Tiny, Small, Base, Large) and dynamic (Gundam). The tool is officially supported in upstream vLLM, providing efficient inference capabilities for both image and PDF processing. It also supports inference via Transformers, allowing for flexible integration into existing workflows. DeepSeek-OCR can handle diverse prompts, from converting documents to markdown and free OCR to parsing figures and general image descriptions, making it a versatile solution for developers and data scientists working with visual data extraction.
Capabilities
Pricing & Plans
Open Source
Free
FAQs
Trending