back to home
Best Open Source document parser Libraries
A curated list of the most popular GitHub repositories tagged with document parser. Select any project to visualize its architecture and dive into the codebase using RepoMind's AI engine.
#1infiniflow/ragflow
RAGFlow is a leading open-source Retrieval-Augmented Generation (RAG) engine that fuses cutting-edge RAG with Agent capabilities to create a superior context layer for LLMs
73,497Python
Analyze Code
#2docling-project/docling
Get your documents ready for gen AI
53,757Python
Analyze Code
#3Unstructured-IO/unstructured
Convert documents to structured data effortlessly. Unstructured is open-source ETL solution for transforming complex documents into clean, structured formats for language models. Visit our website to learn more about our enterprise grade Platform product for production grade workflows, partitioning, enrichments, chunking and embedding.
14,014HTML
Analyze Code