Best Open Source bigdata Libraries
A curated list of the most popular GitHub repositories tagged with bigdata. Select any project to visualize its architecture and dive into the codebase using RepoMind's AI engine.
#1DataExpert-io/data-engineer-handbook
This is a repo with links to everything you'd ever want to learn about data engineering
#2taosdata/TDengine
High-performance, scalable time-series database designed for Industrial IoT (IIoT) scenarios
#3rustfs/rustfs
🚀2.3x faster than MinIO for 4KB object payloads. RustFS is an open-source, S3-compatible high-performance object storage system supporting migration and coexistence with other S3-compatible platforms such as MinIO and Ceph.
#4apache/shardingsphere
Empowering Data Intelligence with Distributed SQL for Sharding, Scalability, and Security Across All Databases.
#5oxnr/awesome-bigdata
A curated list of awesome big data frameworks, ressources and other awesomeness.
#6juicedata/juicefs
JuiceFS is a distributed POSIX file system built on top of Redis and S3.
#7databendlabs/databend
Data Agent Ready Warehouse : One for Analytics, Search, AI, Python Sandbox. — rebuilt from scratch. Unified architecture on your S3.