back to home

Best Open Source distributed systems Libraries

A curated list of the most popular GitHub repositories tagged with distributed systems. Select any project to visualize its architecture and dive into the codebase using RepoMind's AI engine.

#1Snailclimb/JavaGuide

Java 面试 & 后端通用面试指南,覆盖计算机基础、数据库、分布式、高并发与系统设计。准备后端技术面试,首选 JavaGuide!

154,301Java
Explore Repo

#2doocs/advanced-java

😮 Core Interview Questions & Answers For Experienced Java(Backend) Developers | 互联网 Java 工程师进阶知识完全扫盲:涵盖高并发、分布式、高可用、微服务、海量数据处理等领域知识

78,904Java
Explore Repo

#3redis/redis

For developers, who are building real-time data-driven applications, Redis is the preferred, fastest, and most feature-rich cache, data structure server, and document and vector query engine.

73,459C
Explore Repo

#4binhnguyennus/awesome-scalability

The Patterns of Scalable, Reliable, and Performant Large-Scale Systems

69,462
Explore Repo

#5etcd-io/etcd

Distributed reliable key-value store for the most critical data of a distributed system

51,680Go
Explore Repo

#6karanpratapsingh/system-design

Learn how to design systems at scale and prepare for system design interviews

42,177
Explore Repo

#7apache/dubbo

The java implementation of Apache Dubbo. An RPC and microservice framework.

41,708Java
Explore Repo

#8spacedriveapp/spacedrive

Spacedrive is an open source cross-platform file explorer, powered by a virtual distributed filesystem written in Rust.

37,384Rust
Explore Repo

#9ashishps1/awesome-system-design-resources

Learn System Design concepts and prepare for interviews using free resources.

35,208Java
Explore Repo

#10anoma/anoma

Reference implementation of Anoma

34,000Elixir
Explore Repo

#11conductor-oss/conductor

Conductor is an event driven agentic orchestration platform providing durable and highly resilient execution engine for applications and AI Agents

31,520Java
Explore Repo

#12seaweedfs/seaweedfs

SeaweedFS is a fast distributed storage system for blobs, objects, files, and data lake, for billions of files! Blob store has O(1) disk seek, cloud tiering. Filer supports Cloud Drive, xDC replication, Kubernetes, POSIX FUSE mount, S3 API, S3 Gateway, Hadoop, WebDAV, encryption, Erasure Coding. Enterprise version is at seaweedfs.com.

30,965Go
Explore Repo

#13dmlc/xgboost

Scalable, Portable and Distributed Gradient Boosting (GBDT, GBRT or GBM) Library, for Python, R, Java, Scala, C++ and more. Runs on single machine, Hadoop, Spark, Dask, Flink and DataFlow

28,138C++
Explore Repo

#14nsqio/nsq

A realtime distributed messaging platform

25,881Go
Explore Repo

#15nats-io/nats-server

High-Performance server for NATS.io, the cloud and edge native messaging system.

19,360Go
Explore Repo

#16temporalio/temporal

Temporal service

18,942Go
Explore Repo

#17rqlite/rqlite

The lightweight, fault-tolerant database built on SQLite. Designed to keep your data highly available with minimal effort.

17,352Go
Explore Repo

#18ty4z2008/Qix

Machine Learning、Deep Learning、PostgreSQL、Distributed System、Node.Js、Golang

15,080
Explore Repo

#19juicedata/juicefs

JuiceFS is a distributed POSIX file system built on top of Redis and S3.

13,463Go
Explore Repo

#20git-bug/git-bug

Distributed, offline-first bug tracker embedded in git

9,718Go
Explore Repo

#21cadence-workflow/cadence

Cadence is a distributed, scalable, durable, and highly available orchestration engine to execute asynchronous long-running business logic in a scalable and resilient way.

9,220Go
Explore Repo

#22twitter/finagle

A fault tolerant, protocol-agnostic RPC system

8,871Scala
Explore Repo

#23SeleniumHQ/docker-selenium

Provides a simple way to run Selenium Grid with Chrome, Firefox, and Edge using Container Platform, making it easier to perform browser automation at scale

8,613Shell
Explore Repo

#24hazelcast/hazelcast

Hazelcast is a unified real-time data platform combining stream processing with a fast data store, allowing customers to act instantly on data-in-motion for real-time insights.

6,584Java
Explore Repo

#25Eventual-Inc/Daft

High-performance data engine for AI and multimodal workloads. Process images, audio, video, and structured data at any scale

5,429Rust
Explore Repo

#26taubyte/tau

Fullstack Workspace for Humans & Machines

4,987Go
Explore Repo

#27matrixorigin/matrixone

AI-native HTAP database with Git-for-Data and built-in vector search, serving as the data and memory backbone for intelligent agents and applications.

1,840Go
Explore Repo

#28scalar-labs/scalardb

Universal HTAP Engine

640Java
Explore Repo