Best Open Source analytics Libraries
A curated list of the most popular GitHub repositories tagged with analytics. Select any project to visualize its architecture and dive into the codebase using RepoMind's AI engine.
#1grafana/grafana
The open and composable observability and data visualization platform. Visualize metrics, logs, and traces from multiple sources like Prometheus, Loki, Elasticsearch, InfluxDB, Postgres and many more.
#2apache/superset
Apache Superset is a Data Visualization and Data Exploration Platform
#3metabase/metabase
The easy-to-use open source Business Intelligence and Embedded Analytics tool that lets everyone work with data :bar_chart:
#4ClickHouse/ClickHouse
ClickHouse® is a real-time analytics database management system
#5mindsdb/mindsdb
Query Engine for AI Analytics: Build self-reasoning agents across all your live data
#6duckdb/duckdb
DuckDB is an analytical in-process SQL database management system
#7umami-software/umami
Umami is a modern, privacy-focused analytics platform. An open-source alternative to Google Analytics, Mixpanel and Amplitude.
#8PostHog/posthog
🦔 PostHog is an all-in-one developer platform for building successful products. We offer product analytics, web analytics, session replay, error tracking, feature flags, experimentation, surveys, data warehouse, a CDP, and an AI product assistant to help debug your code, ship features faster, and keep all your usage and customer data in one stack.
#9academic/awesome-datascience
:memo: An awesome Data Science repository to learn and apply for real world problems.
#10langfuse/langfuse
🪢 Open source LLM engineering platform: LLM Observability, metrics, evals, prompt management, playground, datasets. Integrates with OpenTelemetry, Langchain, OpenAI SDK, LiteLLM, and more. 🍊YC W23
#11getredash/redash
Make Your Company Data Driven. Connect to any data source, easily visualize, dashboard and share your data.
#12plausible/analytics
Open source, privacy-first web analytics. Lightweight, cookie-free Google Analytics alternative. Self-hosted or cloud.
#13cube-js/cube
📊 Cube Core is open-source semantic layer for AI, BI and embedded analytics
#14openobserve/openobserve
OpenObserve is an open-source observability platform for logs, metrics, traces, and frontend monitoring. A cost-effective alternative to Datadog, Splunk, and Elasticsearch with 140x lower storage costs and single binary deployment.
#15ActivityWatch/activitywatch
The best free and open-source automated time tracker. Cross-platform, extensible, privacy-focused.
#16dagster-io/dagster
An orchestration platform for the development, production, and observation of data assets.
#17StarRocks/starrocks
The world's fastest open query engine for sub-second analytics both on and off the data lakehouse. With the flexibility to support nearly any scenario, StarRocks provides best-in-class performance for multi-dimensional analytics, real-time analytics, and ad-hoc queries. A Linux Foundation project.
#18leeoniya/uPlot
📈 A small, fast chart for time series, lines, areas, ohlc & bars
#19aws-amplify/amplify-js
A declarative JavaScript library for application development using cloud services.
#20getlago/lago
Open Source Metering and Usage Based Billing API ⭐️ Consumption tracking, Subscription management, Pricing iterations, Payment orchestration & Revenue analytics
#21hyperdxio/hyperdx
Resolve production issues, fast. An open source observability platform unifying session replays, logs, metrics, traces and errors powered by ClickHouse and OpenTelemetry.
#22delta-io/delta
An open-source storage framework that enables building a Lakehouse architecture with compute engines including Spark, PrestoDB, Flink, Trino, and Hive and APIs
#23antonkomarev/github-profile-views-counter
It counts how many times your GitHub profile has been viewed. Free cloud micro-service.
#24crate/crate
CrateDB is a distributed and scalable SQL database for storing and analyzing massive amounts of data in near real-time, even with complex queries. It is PostgreSQL-compatible, and based on Lucene.
#25openmeterio/openmeter
Metering and Billing for AI, API and DevOps. Collect and aggregate millions of usage events in real-time and enable usage-based billing.