back to home
Best Open Source etl Libraries
A curated list of the most popular GitHub repositories tagged with etl. Select any project to visualize its architecture and dive into the codebase using RepoMind's AI engine.
#1pathwaycom/pathway
Python ETL framework for stream processing, real-time analytics, LLM pipelines, and RAG.
59,654Python
Analyze Code
#2apache/airflow
Apache Airflow - A platform to programmatically author, schedule, and monitor workflows
44,349Python
Analyze Code
#3airbytehq/airbyte
The leading data integration platform for ETL / ELT data pipelines from APIs, databases & files to data warehouses, data lakes & data lakehouses. Both self-hosted and Cloud-hosted.
20,743Python
Analyze Code
#4dagster-io/dagster
An orchestration platform for the development, production, and observation of data assets.
14,983Python
Analyze Code
#5mage-ai/mage-ai
🧙 Build, run, and manage data pipelines for integrating and transforming data.
8,651Python
Analyze Code