back to home

Best Open Source distributed Libraries

A curated list of the most popular GitHub repositories tagged with distributed. Select any project to visualize its architecture and dive into the codebase using RepoMind's AI engine.

#1tensorflow/tensorflow

An Open Source Machine Learning Framework for Everyone

193,874C++
Analyze Code

#2ClickHouse/ClickHouse

ClickHouse® is a real-time analytics database management system

45,985C++
Analyze Code

#3mudler/LocalAI

:robot: The free, Open Source alternative to OpenAI, Claude and others. Self-hosted and local-first. Drop-in replacement, running on consumer-grade hardware. No GPU required. Runs gguf, transformers, diffusers and many more. Features: Generate Text, MCP, Audio, Video, Images, Voice Cloning, Distributed, P2P and decentralized inference

42,939Go
Analyze Code

#4milvus-io/milvus

Milvus is a high-performance, cloud-native vector database built for scalable vector ANN search

42,914Go
Analyze Code

#5ray-project/ray

Ray is an AI compute engine. Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads.

41,416Python
Analyze Code

#6nextcloud/server

☁️ Nextcloud server, a safe home for all your data

34,143PHP
Analyze Code

#7surrealdb/surrealdb

A scalable, distributed, collaborative, document-graph database, for the realtime web

31,270Rust
Analyze Code

#8xuxueli/xxl-job

A distributed task scheduling framework.(分布式任务调度平台XXL-JOB)

29,918Java
Analyze Code

#9ageron/handson-ml

⛔️ DEPRECATED – See https://github.com/ageron/handson-ml3 instead.

25,876Jupyter Notebook
Analyze Code

#10taosdata/TDengine

High-performance, scalable time-series database designed for Industrial IoT (IIoT) scenarios

24,734C
Analyze Code

#11redisson/redisson

Redisson - Valkey & Redis Java client. Real-Time Data Platform. Sync/Async/RxJava/Reactive API. Over 50 Valkey and Redis based Java objects and services: Set, Multimap, SortedSet, Map, List, Queue, Deque, Semaphore, Lock, AtomicLong, Map Reduce, Bloom filter, Spring, Tomcat, Scheduler, JCache API, Hibernate, RPC, local cache..

24,254Java
Analyze Code

#12phoenixframework/phoenix

Peace of mind from prototype to production

22,889Elixir
Analyze Code

#13dgraph-io/dgraph

high-performance graph database for real-time use cases

21,618Go
Analyze Code

#14dianping/cat

CAT 作为服务端项目基础组件,提供了 Java, C/C++, Node.js, Python, Go 等多语言客户端,已经在美团点评的基础架构中间件框架(MVC框架,RPC框架,数据库框架,缓存框架等,消息队列,配置系统等)深度集成,为美团点评各业务线提供系统丰富的性能指标、健康状况、实时告警等。

18,977Java
Analyze Code

#15teambit/bit

AI-powered development workspaces with reusable components, architectural clarity and zero overhead.

18,356TypeScript
Analyze Code

#16microsoft/LightGBM

A fast, distributed, high performance gradient boosting (GBT, GBDT, GBRT, GBM or MART) framework based on decision tree algorithms, used for ranking, classification and many other machine learning tasks.

18,095C++
Analyze Code

#17microsoft/CNTK

Microsoft Cognitive Toolkit (CNTK), an open source deep-learning toolkit

17,612C++
Analyze Code

#18microsoft/nni

An open source AutoML toolkit for automate machine learning lifecycle, including feature engineering, neural architecture search, model compression and hyper-parameter tuning.

14,341Python
Analyze Code

#19diaspora/diaspora

A privacy-aware, distributed, open source social network.

13,874Ruby
Analyze Code

#20optuna/optuna

A hyperparameter optimization framework

13,551Python
Analyze Code

#21Oneflow-Inc/oneflow

OneFlow is a deep learning framework designed to be user-friendly, scalable and efficient.

9,394C++
Analyze Code

#22orbitdb/orbitdb

Peer-to-Peer Databases for the Decentralized Web

8,737JavaScript
Analyze Code

#23apache/storm

Apache Storm

6,672Java
Analyze Code

#24hatchet-dev/hatchet

🪓 Run Background Tasks at Scale

6,633Go
Analyze Code

#25hazelcast/hazelcast

Hazelcast is a unified real-time data platform combining stream processing with a fast data store, allowing customers to act instantly on data-in-motion for real-time insights.

6,596Java
Analyze Code