Best Open Source statistics Libraries
A curated list of the most popular GitHub repositories tagged with statistics. Select any project to visualize its architecture and dive into the codebase using RepoMind's AI engine.
#1scikit-learn/scikit-learn
scikit-learn: machine learning in Python
#2umami-software/umami
Umami is a modern, privacy-focused analytics platform. An open-source alternative to Google Analytics, Mixpanel and Amplitude.
#3CamDavidsonPilon/Probabilistic-Programming-and-Bayesian-Methods-for-Hackers
aka "Bayesian Methods for Hackers": An introduction to Bayesian methods + probabilistic programming with a computation/understanding-first, mathematics-second point of view. All in pure Python ;)
#4plausible/analytics
Simple, open source, lightweight and privacy-friendly web analytics alternative to Google Analytics.
#5qax-os/excelize
Go language library for reading and writing Microsoft Excel™ (XLAM / XLSM / XLSX / XLTM / XLTX) spreadsheets
#6virgili0/Virgilio
Your new Mentor for Data Science E-Learning.
#7XAMPPRocky/tokei
Count your code, quickly.
#8ydataai/ydata-profiling
1 Line of code data quality profiling & exploratory data analysis for Pandas and Spark DataFrames.
#9johnkerl/miller
Miller is like awk, sed, cut, join, and sort for name-indexed data such as CSV, TSV, and tabular JSON
#10git-quick-stats/git-quick-stats
▁▅▆▃▅ Git quick statistics is a simple and efficient way to access various statistics in git repository.
#11mahmoud/boltons
🔩 Like builtins, but boltons. 250+ constructs, recipes, and snippets which extend (and rely on nothing but) the Python standard library. Nothing like Michael Bolton.
#12IonicaBizau/git-stats
🍀 Local git statistics including GitHub-like contributions calendars.