back to home
Best Open Source pdf parsing Libraries
A curated list of the most popular GitHub repositories tagged with pdf parsing. Select any project to visualize its architecture and dive into the codebase using RepoMind's AI engine.
#1py-pdf/pypdf
A pure-python PDF library capable of splitting, merging, cropping, and transforming the pages of PDF files
9,822Python
Analyze Code
#2jsvine/pdfplumber
Plumb a PDF for detailed information about each char, rectangle, line, et cetera — and easily extract text and tables.
9,741Python
Analyze Code