back to home
Best Open Source data pipeline Libraries
A curated list of the most popular GitHub repositories tagged with data pipeline. Select any project to visualize its architecture and dive into the codebase using RepoMind's AI engine.
#1airbytehq/airbyte
The leading data integration platform for ETL / ELT data pipelines from APIs, databases & files to data warehouses, data lakes & data lakehouses. Both self-hosted and Cloud-hosted.
20,743Python
Analyze Code
#2apache/shardingsphere
Empowering Data Intelligence with Distributed SQL for Sharding, Scalability, and Security Across All Databases.
20,677Java
Analyze Code
#3debezium/debezium
Change data capture for a variety of databases. Please log issues at https://github.com/debezium/dbz/issues.
12,427Java
Analyze Code