back to home
Best Open Source tesseract Libraries
A curated list of the most popular GitHub repositories tagged with tesseract. Select any project to visualize its architecture and dive into the codebase using RepoMind's AI engine.
#1tesseract-ocr/tesseract
Tesseract Open Source OCR Engine (main repository)
72,484C++
Analyze Code
#2naptha/tesseract.js
Pure Javascript OCR for more than 100 Languages 📖🎉🖥
37,866JavaScript
Analyze Code
#3ocrmypdf/OCRmyPDF
OCRmyPDF adds an OCR text layer to scanned PDF files, allowing them to be searched
32,673Python
Analyze Code
#4pymupdf/PyMuPDF
PyMuPDF is a high performance Python library for data extraction, analysis, conversion & manipulation of PDF (and other) documents.
9,097Python
Analyze Code