back to home

Best Open Source pandas Libraries

A curated list of the most popular GitHub repositories tagged with pandas. Select any project to visualize its architecture and dive into the codebase using RepoMind's AI engine.

#1Asabeneh/30-Days-Of-Python

The 30 Days of Python programming challenge is a step-by-step guide to learn the Python programming language in 30 days. This challenge may take more than 100 days. Follow your own pace. These videos may help too: https://www.youtube.com/channel/UC7PNRuno1rzYPb1xLa4yktw

58,405Python
Analyze Code

#2pandas-dev/pandas

Flexible and powerful data analysis / manipulation library for Python, providing labeled data structures similar to R data.frame objects, statistical functions, and much more

47,933Python
Analyze Code

#3jakevdp/PythonDataScienceHandbook

Python Data Science Handbook: full text in Jupyter Notebooks

46,807Jupyter Notebook
Analyze Code

#4microsoft/Data-Science-For-Beginners

10 Weeks, 20 Lessons, Data Science for All!

33,972Jupyter Notebook
Analyze Code

#5tqdm/tqdm

:zap: A Fast, Extensible Progress Bar for Python and CLI

30,972Python
Analyze Code

#6donnemartin/data-science-ipython-notebooks

Data science Python notebooks: Deep learning (TensorFlow, Theano, Caffe, Keras), scikit-learn, Kaggle, big data (Spark, Hadoop MapReduce, HDFS), matplotlib, pandas, NumPy, SciPy, Python essentials, AWS, and various command lines.

28,880Python
Analyze Code

#7sinaptik-ai/pandas-ai

Chat with your database or your datalake (SQL, CSV, parquet). PandasAI makes data analysis conversational using LLMs and RAG.

23,211Python
Analyze Code

#8ranaroussi/yfinance

Download market data from Yahoo! Finance's API

21,678Python
Analyze Code

#9huggingface/datasets

🤗 The largest hub of ready-to-use datasets for AI models with fast, easy-to-use and efficient data manipulation tools

21,200Python
Analyze Code

#10waditu/tushare

TuShare is a utility for crawling historical data of China stocks

14,458Python
Analyze Code

#11dask/dask

Parallel computing with task scheduling

13,746Python
Analyze Code

#12mwaskom/seaborn

Statistical data visualization in Python

13,739Python
Analyze Code

#13ydataai/ydata-profiling

1 Line of code data quality profiling & exploratory data analysis for Pandas and Spark DataFrames.

13,387Python
Analyze Code

#14tangyudi/Ai-Learn

人工智能学习路线图,整理近200个实战案例与项目,免费提供配套教材,零基础入门,就业实战!包括:Python,数学,机器学习,数据分析,深度学习,计算机视觉,自然语言处理,PyTorch tensorflow machine-learning,deep-learning data-analysis data-mining mathematics data-science artificial-intelligence python tensorflow tensorflow2 caffe keras pytorch algorithm numpy pandas matplotlib seaborn nlp cv等热门领域

12,642
Analyze Code

#15guipsamora/pandas_exercises

Practice your pandas skills!

12,180Jupyter Notebook
Analyze Code

#16rapidsai/cudf

cuDF - GPU DataFrame Library

9,495C++
Analyze Code

#17saulpw/visidata

A terminal spreadsheet multitool for discovering and arranging data

8,836Python
Analyze Code

#18iamseancheney/python_for_data_analysis_2nd_chinese_version

《利用Python进行数据分析·第2版》

8,790
Analyze Code