Best Open Source observability Libraries
A curated list of the most popular GitHub repositories tagged with observability. Select any project to visualize its architecture and dive into the codebase using RepoMind's AI engine.
#1netdata/netdata
The fastest path to AI-powered full stack observability, even for lean teams.
#2langfuse/langfuse
πͺ’ Open source LLM engineering platform: LLM Observability, metrics, evals, prompt management, playground, datasets. Integrates with OpenTelemetry, Langchain, OpenAI SDK, LiteLLM, and more. πYC W23
#3SigNoz/signoz
SigNoz is an open-source observability platform native to OpenTelemetry with logs, traces and metrics in a single application. An open-source alternative to DataDog, NewRelic, etc. π₯ π₯. π Open source Application Performance Monitoring (APM) & Observability tool
#4mlflow/mlflow
The open source AI engineering platform for agents, LLMs, and ML models. MLflow enables teams of all sizes to debug, evaluate, monitor, and optimize production-quality AI applications while controlling costs and managing access to models and data.
#5apache/skywalking
APM, Application Performance Monitoring System
#6cilium/cilium
eBPF-based Networking, Security, and Observability
#7elastic/kibana
Your window into all of your data
#8mikeroyal/Self-Hosting-Guide
Self-Hosting Guide. Learn all about locally hosting (on premises & private web servers) and managing software applications by yourself or your organization. Including Cloud, LLMs, WireGuard, Automation, Home Assistant, and Networking.
#9openobserve/openobserve
OpenObserve is an open-source observability platform for logs, metrics, traces, and frontend monitoring. A cost-effective alternative to Datadog, Splunk, and Elasticsearch with 140x lower storage costs and single binary deployment.
#10openzipkin/zipkin
Zipkin is a distributed tracing system
#11kubesphere/kubesphere
The container platform tailored for Kubernetes multi-cloud, datacenter, and edge management β π₯ βοΈ
#12VictoriaMetrics/VictoriaMetrics
VictoriaMetrics: fast, cost-effective monitoring solution and time series database
#13upgundecha/howtheysre
A curated collection of publicly available resources on how technology and tech-savvy organizations around the world practice Site Reliability Engineering (SRE)
#14hyperdxio/hyperdx
Resolve production issues, fast. An open source observability platform unifying session replays, logs, metrics, traces and errors powered by ClickHouse and OpenTelemetry.
#15highlight/highlight
highlight.io: The open source, full-stack monitoring platform. Error monitoring, session replay, logging, distributed tracing, and more.
#16openstatusHQ/openstatus
π« Status page with uptime monitoring & API monitoring as code π«
#17grafana/mimir
Grafana Mimir provides horizontally scalable, highly available, multi-tenant, long-term storage for Prometheus.
#18micrometer-metrics/micrometer
An application observability facade for the most popular observability tools. Think SLF4J, but for observability.
#19parca-dev/parca
Continuous profiling for analysis of CPU and memory usage, down to the line number and throughout time. Saving infrastructure cost, improving performance, and increasing reliability.
#20rajnandan1/kener
Stunning status pages, batteries included!
#21Agenta-AI/agenta
The open-source LLMOps platform: prompt playground, prompt management, LLM evaluation, and LLM observability all in one place.
#22odigos-io/odigos
Distributed tracing without code changes. π Instantly monitor any application using OpenTelemetry and eBPF
#23DataDog/datadog-agent
Main repository for Datadog Agent
#24percona/pmm
Percona Monitoring and Management: an open source database monitoring, observability and management tool
#25santifer/cv-santiago
Interactive CV with AI chat integration. Built with React 19, TypeScript, Claude API. Chat with my AI avatar about my experience.
#26eunomia-bpf/agentsight
Zero instrucment LLM and AI agent (e.g. claude code, openclaw, gemini-cli) observability in eBPF
#27Linuxfabrik/monitoring-plugins
230+ monitoring plugins for Icinga, Nagios & friends. Python 3.9+, all platforms. Smart defaults, auto-discovery, consistent cross-platform metrics, minimal dependencies.
#28jfrog/boost
Less is more. Make your agents smarter and faster. Itβs not just about saving time; itβs about the feeling of not wasting it.
#29netobserv/netobserv-operator
A Kubernetes operator for network observability
#30Siddhant-K-code/agent-trace
Observability for AI agents. See what your agent did, why it cost that much, and what to fix.
#31last9/last9-mcp-server
Last9 MCP Server