Best Open Source distributed systems Libraries
A curated list of the most popular GitHub repositories tagged with distributed systems. Select any project to visualize its architecture and dive into the codebase using RepoMind's AI engine.
#1Snailclimb/JavaGuide
Java 面试 & 后端通用面试指南,覆盖计算机基础、数据库、分布式、高并发与系统设计。准备后端技术面试,首选 JavaGuide!
#2doocs/advanced-java
😮 Core Interview Questions & Answers For Experienced Java(Backend) Developers | 互联网 Java 工程师进阶知识完全扫盲:涵盖高并发、分布式、高可用、微服务、海量数据处理等领域知识
#3redis/redis
For developers, who are building real-time data-driven applications, Redis is the preferred, fastest, and most feature-rich cache, data structure server, and document and vector query engine.
#4binhnguyennus/awesome-scalability
The Patterns of Scalable, Reliable, and Performant Large-Scale Systems
#5etcd-io/etcd
Distributed reliable key-value store for the most critical data of a distributed system
#6karanpratapsingh/system-design
Learn how to design systems at scale and prepare for system design interviews
#7apache/dubbo
The java implementation of Apache Dubbo. An RPC and microservice framework.
#8spacedriveapp/spacedrive
Spacedrive is an open source cross-platform file explorer, powered by a virtual distributed filesystem written in Rust.
#9ashishps1/awesome-system-design-resources
Learn System Design concepts and prepare for interviews using free resources.
#10anoma/anoma
Reference implementation of Anoma
#11conductor-oss/conductor
Conductor is an event driven agentic orchestration platform providing durable and highly resilient execution engine for applications and AI Agents
#12seaweedfs/seaweedfs
SeaweedFS is a fast distributed storage system for blobs, objects, files, and data lake, for billions of files! Blob store has O(1) disk seek, cloud tiering. Filer supports Cloud Drive, xDC replication, Kubernetes, POSIX FUSE mount, S3 API, S3 Gateway, Hadoop, WebDAV, encryption, Erasure Coding. Enterprise version is at seaweedfs.com.
#13dmlc/xgboost
Scalable, Portable and Distributed Gradient Boosting (GBDT, GBRT or GBM) Library, for Python, R, Java, Scala, C++ and more. Runs on single machine, Hadoop, Spark, Dask, Flink and DataFlow
#14nsqio/nsq
A realtime distributed messaging platform
#15nats-io/nats-server
High-Performance server for NATS.io, the cloud and edge native messaging system.
#16temporalio/temporal
Temporal service
#17rqlite/rqlite
The lightweight, fault-tolerant database built on SQLite. Designed to keep your data highly available with minimal effort.
#18ty4z2008/Qix
Machine Learning、Deep Learning、PostgreSQL、Distributed System、Node.Js、Golang
#19git-bug/git-bug
Distributed, offline-first bug tracker embedded in git
#20cadence-workflow/cadence
Cadence is a distributed, scalable, durable, and highly available orchestration engine to execute asynchronous long-running business logic in a scalable and resilient way.
#21twitter/finagle
A fault tolerant, protocol-agnostic RPC system
#22SeleniumHQ/docker-selenium
Provides a simple way to run Selenium Grid with Chrome, Firefox, and Edge using Container Platform, making it easier to perform browser automation at scale
#23supabase/realtime
Broadcast, Presence, and Postgres Changes via WebSockets
#24paradigmxyz/reth
Modular, contributor-friendly and blazing-fast implementation of the Ethereum protocol, in Rust
#25taubyte/tau
Fullstack Workspace for Humans & Machines
#26restatedev/restate
Restate is the platform for building resilient applications that tolerate all infrastructure faults w/o the need for a PhD.
#27golemcloud/golem
Golem Cloud is the agent-native platform for building AI agents and distributed applications that never lose state, never duplicate work, and never require you to build infrastructure.
#28lambdaclass/ethrex
Minimalist, stable, modular, fast, and ZK native implementation of the Ethereum protocol in Rust. L1 and L2 execution client.
#29Rain-kl/OpenFlare
A lightweight, self-hosted control plane for OpenResty that manages reverse proxy rules, configuration releases, node synchronization, TLS certificates, and observability.