Best Open Source data Libraries
A curated list of the most popular GitHub repositories tagged with data. Select any project to visualize its architecture and dive into the codebase using RepoMind's AI engine.
#1Asabeneh/30-Days-Of-Python
The 30 Days of Python programming challenge is a step-by-step guide to learn the Python programming language in 30 days. This challenge may take more than 100 days. Follow your own pace. These videos may help too: https://www.youtube.com/channel/UC7PNRuno1rzYPb1xLa4yktw
#2TanStack/query
🤖 Powerful asynchronous state management, server-state utilities and data fetching for the web. TS/JS, React Query, Solid Query, Svelte Query and Vue Query.
#3run-llama/llama_index
LlamaIndex is the leading document agent and OCR platform
#4metabase/metabase
The easy-to-use open source Business Intelligence and Embedded Analytics tool that lets everyone work with data :bar_chart:
#5DataExpert-io/data-engineer-handbook
This is a repo with links to everything you'd ever want to learn about data engineering
#6SheetJS/sheetjs
📗 SheetJS Spreadsheet Data Toolkit -- New home https://git.sheetjs.com/SheetJS/sheetjs
#7vercel/swr
React Hooks for Data Fetching
#8sinaptik-ai/pandas-ai
Chat with your database or your datalake (SQL, CSV, parquet). PandasAI makes data analysis conversational using LLMs and RAG.
#9PrefectHQ/prefect
Prefect is a workflow orchestration framework for building resilient data pipelines in Python.
#10airbytehq/airbyte
The leading data integration platform for ETL / ELT data pipelines from APIs, databases & files to data warehouses, data lakes & data lakehouses. Both self-hosted and Cloud-hosted.
#11faker-js/faker
Generate massive amounts of fake data in the browser and node.js
#12oxnr/awesome-bigdata
A curated list of awesome big data frameworks, ressources and other awesomeness.
#13bchavez/Bogus
:card_index: A simple fake data generator for C#, F#, and VB.NET. Based on and ported from the famed faker.js.
#14D4Vinci/Scrapling
🕷️ An adaptive Web Scraping framework that handles everything from a single request to a full-scale crawl!
#15rawgraphs/rawgraphs-app
A web interface to create custom vector-based visualizations on top of RAWGraphs core
#16mage-ai/mage-ai
🧙 Build, run, and manage data pipelines for integrating and transforming data.
#17flyteorg/flyte
Scalable and flexible workflow orchestration platform that seamlessly unifies data, ML and analytics stacks.