back to home
Best Open Source pdf to json Libraries
A curated list of the most popular GitHub repositories tagged with pdf to json. Select any project to visualize its architecture and dive into the codebase using RepoMind's AI engine.
#1docling-project/docling
Get your documents ready for gen AI
53,757Python
Analyze Code
#2Unstructured-IO/unstructured
Convert documents to structured data effortlessly. Unstructured is open-source ETL solution for transforming complex documents into clean, structured formats for language models. Visit our website to learn more about our enterprise grade Platform product for production grade workflows, partitioning, enrichments, chunking and embedding.
14,014HTML
Analyze Code