Best Open Source distributed Libraries
A curated list of the most popular GitHub repositories tagged with distributed. Select any project to visualize its architecture and dive into the codebase using RepoMind's AI engine.
#1tensorflow/tensorflow
An Open Source Machine Learning Framework for Everyone
#2ClickHouse/ClickHouse
ClickHouse® is a real-time analytics database management system
#3mudler/LocalAI
:robot: The free, Open Source alternative to OpenAI, Claude and others. Self-hosted and local-first. Drop-in replacement, running on consumer-grade hardware. No GPU required. Runs gguf, transformers, diffusers and many more. Features: Generate Text, MCP, Audio, Video, Images, Voice Cloning, Distributed, P2P and decentralized inference
#4milvus-io/milvus
Milvus is a high-performance, cloud-native vector database built for scalable vector ANN search
#5ray-project/ray
Ray is an AI compute engine. Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads.
#6nextcloud/server
☁️ Nextcloud server, a safe home for all your data
#7surrealdb/surrealdb
A scalable, distributed, collaborative, document-graph database, for the realtime web
#8xuxueli/xxl-job
A distributed task scheduling framework.(分布式任务调度平台XXL-JOB)
#9ageron/handson-ml
⛔️ DEPRECATED – See https://github.com/ageron/handson-ml3 instead.
#10taosdata/TDengine
High-performance, scalable time-series database designed for Industrial IoT (IIoT) scenarios
#11redisson/redisson
Redisson - Valkey & Redis Java client. Real-Time Data Platform. Sync/Async/RxJava/Reactive API. Over 50 Valkey and Redis based Java objects and services: Set, Multimap, SortedSet, Map, List, Queue, Deque, Semaphore, Lock, AtomicLong, Map Reduce, Bloom filter, Spring, Tomcat, Scheduler, JCache API, Hibernate, RPC, local cache..
#12phoenixframework/phoenix
Peace of mind from prototype to production
#13dgraph-io/dgraph
high-performance graph database for real-time use cases
#14dianping/cat
CAT 作为服务端项目基础组件,提供了 Java, C/C++, Node.js, Python, Go 等多语言客户端,已经在美团点评的基础架构中间件框架(MVC框架,RPC框架,数据库框架,缓存框架等,消息队列,配置系统等)深度集成,为美团点评各业务线提供系统丰富的性能指标、健康状况、实时告警等。
#15teambit/bit
AI-powered development workspaces with reusable components, architectural clarity and zero overhead.
#16microsoft/LightGBM
A fast, distributed, high performance gradient boosting (GBT, GBDT, GBRT, GBM or MART) framework based on decision tree algorithms, used for ranking, classification and many other machine learning tasks.
#17microsoft/CNTK
Microsoft Cognitive Toolkit (CNTK), an open source deep-learning toolkit
#18microsoft/nni
An open source AutoML toolkit for automate machine learning lifecycle, including feature engineering, neural architecture search, model compression and hyper-parameter tuning.
#19diaspora/diaspora
A privacy-aware, distributed, open source social network.
#20optuna/optuna
A hyperparameter optimization framework
#21Oneflow-Inc/oneflow
OneFlow is a deep learning framework designed to be user-friendly, scalable and efficient.
#22orbitdb/orbitdb
Peer-to-Peer Databases for the Decentralized Web
#23apache/storm
Apache Storm
#24hatchet-dev/hatchet
🪓 Run Background Tasks at Scale
#25hazelcast/hazelcast
Hazelcast is a unified real-time data platform combining stream processing with a fast data store, allowing customers to act instantly on data-in-motion for real-time insights.