back to home

Best Open Source mlops Libraries

A curated list of the most popular GitHub repositories tagged with mlops. Select any project to visualize its architecture and dive into the codebase using RepoMind's AI engine.

#1GokuMohandas/Made-With-ML

Learn how to develop, deploy and iterate on production-grade ML applications.

46,810Jupyter Notebook
Explore Repo

#2apache/airflow

Apache Airflow - A platform to programmatically author, schedule, and monitor workflows

44,671Python
Explore Repo

#3qdrant/qdrant

Qdrant - High-performance, massive-scale Vector Database and Vector Search Engine for the next generation of AI. Also available in the cloud https://cloud.qdrant.io/

29,618Rust
Explore Repo

#4HumanSignal/label-studio

Label Studio is a multi-type data labeling and annotation tool with standardized output format

26,745TypeScript
Explore Repo

#5mlflow/mlflow

The open source AI engineering platform for agents, LLMs, and ML models. MLflow enables teams of all sizes to debug, evaluate, monitor, and optimize production-quality AI applications while controlling costs and managing access to models and data.

24,816Python
Explore Repo

#6Avaiga/taipy

Turns Data and AI algorithms into production-ready web applications in no time.

19,113Python
Explore Repo

#7NirDiamant/agents-towards-production

This repository delivers end-to-end, code-first tutorials covering every layer of production-grade GenAI agents, guiding you from spark to scale with proven patterns and reusable blueprints for real-world launches.

18,269Jupyter Notebook
Explore Repo

#8stas00/ml-engineering

Machine Learning Engineering Open Book

17,414Python
Explore Repo

#9argoproj/argo-workflows

Workflow Engine for Kubernetes

16,531Go
Explore Repo

#10weaviate/weaviate

Weaviate is an open-source vector database that stores both objects and vectors, allowing for the combination of vector search with structured filtering with the fault tolerance and scalability of a cloud-native database​.

15,817Go
Explore Repo

#11microsoft/agent-lightning

The absolute trainer to light up AI agents.

15,473Python
Explore Repo

#12dagster-io/dagster

An orchestration platform for the development, production, and observation of data assets.

15,112Python
Explore Repo

#13Netflix/metaflow

Build, Manage and Deploy AI/ML Systems

9,951Python
Explore Repo

#14activeloopai/deeplake

the GPU-native, sandboxed Postgres for AI agents

9,037C++
Explore Repo

#15argilla-io/argilla

Argilla is a collaboration tool for AI engineers and domain experts to build high-quality datasets

4,895Python
Explore Repo

#16tencentmusic/cube-studio

cube studio开源云原生一站式机器学习/深度学习/大模型AI平台,mlops算法链路全流程,算力租赁平台,notebook在线开发,拖拉拽任务流pipeline编排,多机多卡分布式训练,超参搜索,推理服务VGPU虚拟化,边缘计算,标注平台自动化标注,deepseek等大模型sft微调/奖励模型/强化学习训练,vllm/ollama/mindie大模型多机推理,私有知识库,AI模型市场,支持国产cpu/gpu/npu 昇腾生态,支持RDMA,支持pytorch/tf/mxnet/deepspeed/paddle/colossalai/horovod/ray/volcano等分布式

4,887Python
Explore Repo

#17PacktPublishing/LLM-Engineers-Handbook

The LLM's practical guide: From the fundamentals to deploying advanced LLM and RAG apps to AWS using LLMOps best practices

4,834Python
Explore Repo

#18alvinreal/awesome-opensource-ai

Curated list of the best truly open-source AI projects, models, tools, and infrastructure.

3,004Python
Explore Repo

#19pixeltable/pixeltable

Data Infrastructure providing a declarative, incremental approach for multimodal AI workloads.

1,548Python
Explore Repo

#20hongbo-miao/hongbomiao.com

A personal research and development (R&D) lab that facilitates the sharing of knowledge.

294Python
Explore Repo

#21caraml-dev/merlin

Kubernetes-friendly ML model management, deployment, and serving.

183Go
Explore Repo

#22flyteorg/flyte-sdk

Type-safe, distributed orchestration of agents, ML pipelines, and real-time inference — in pure Python with async/await.

111Python
Explore Repo