back to home

Best Open Source ocr Libraries

A curated list of the most popular GitHub repositories tagged with ocr. Select any project to visualize its architecture and dive into the codebase using RepoMind's AI engine.

#1tesseract-ocr/tesseract

Tesseract Open Source OCR Engine (main repository)

72,484C++
Analyze Code

#2PaddlePaddle/PaddleOCR

Turn any PDF or image document into structured data for your AI. A powerful, lightweight OCR toolkit that bridges the gap between images/PDFs and LLMs. Supports 100+ languages.

70,987Python
Analyze Code

#3opendatalab/MinerU

Transforms complex documents like PDFs into LLM-ready markdown/JSON for your Agentic workflows.

54,581Python
Analyze Code

#4hiroi-sora/Umi-OCR

OCR software, free and offline. 开源、免费的离线OCR软件。支持截屏/批量导入图片,PDF文档识别,排除水印/页眉页脚,扫描/生成二维码。内置多国语言库。

42,159Python
Analyze Code

#5siyuan-note/siyuan

A privacy-first, self-hosted, fully open source personal knowledge management software, written in typescript and golang.

41,378TypeScript
Analyze Code

#6naptha/tesseract.js

Pure Javascript OCR for more than 100 Languages 📖🎉🖥

37,866JavaScript
Analyze Code

#7paperless-ngx/paperless-ngx

A community-supported supercharged document management system: scan, index and archive all your documents

36,804Python
Analyze Code

#8ShareX/ShareX

ShareX is a free and open-source application that enables users to capture or record any area of their screen with a single keystroke. It also supports uploading images, text, and various file types to a wide range of destinations.

35,673C#
Analyze Code

#9ocrmypdf/OCRmyPDF

OCRmyPDF adds an OCR text layer to scanned PDF files, allowing them to be searched

32,673Python
Analyze Code

#10JaidedAI/EasyOCR

Ready-to-use OCR with 80+ supported languages and all popular writing scripts including Latin, Chinese, Arabic, Devanagari, Cyrillic and etc.

28,981Python
Analyze Code

#11Unstructured-IO/unstructured

Convert documents to structured data effortlessly. Unstructured is open-source ETL solution for transforming complex documents into clean, structured formats for language models. Visit our website to learn more about our enterprise grade Platform product for production grade workflows, partitioning, enrichments, chunking and embedding.

14,014HTML
Analyze Code

#12sml2h3/ddddocr

带带弟弟 通用验证码识别OCR pypi版

13,574Python
Analyze Code

#13tisfeng/Easydict

一个简洁优雅的词典翻译 macOS App。开箱即用,支持离线 OCR 识别,支持有道词典,🍎 苹果系统词典,🍎 苹果系统翻译,OpenAI,Gemini,DeepL,Google,Bing,腾讯,百度,阿里,小牛,彩云和火山翻译。A concise and elegant Dictionary and Translator macOS App for looking up words and translating text.

12,278Swift
Analyze Code

#14DayBreak-u/chineseocr_lite

超轻量级中文ocr,支持竖排文字识别, 支持ncnn、mnn、tnn推理 ( dbnet(1.8M) + crnn(2.5M) + anglenet(378KB)) 总模型仅4.7M

12,269C++
Analyze Code

#15getomni-ai/zerox

OCR & Document Extraction using vision models

12,140TypeScript
Analyze Code

#16yusufkaraaslan/Skill_Seekers

Convert documentation websites, GitHub repositories, and PDFs into Claude AI skills with automatic conflict detection

9,680Python
Analyze Code

#17ripperhe/Bob

Bob 是一款 macOS 平台的翻译和 OCR 软件。

9,549
Analyze Code

#18zyddnys/manga-image-translator

Translate manga/image 一键翻译各类图片内文字 https://cotrans.touhou.ai/ (no longer working)

9,415Python
Analyze Code

#19pymupdf/PyMuPDF

PyMuPDF is a high performance Python library for data extraction, analysis, conversion & manipulation of PDF (and other) documents.

9,097Python
Analyze Code

#20bytedance/Dolphin

The official repo for “Dolphin: Document Image Parsing via Heterogeneous Anchor Prompting”, ACL, 2025.

8,827Python
Analyze Code

#21adithya-s-k/omniparse

Ingest, parse, and optimize any data format ➡️ from documents to multimedia ➡️ for enhanced compatibility with GenAI frameworks

6,799Python
Analyze Code

#22clovaai/donut

Official Implementation of OCR-free Document Understanding Transformer (Donut) and Synthetic Document Generator (SynthDoG), ECCV 2022

6,788Python
Analyze Code