buckaroo-data / buckaroo
Buckaroo - The data table UI for Notebooks. Quickly explore dataframes, scroll through dataframes, search, sort, view summary stats and histograms. Works with Pandas, Polars, Jupyter, Marimo, VSCode Notebooks
View on GitHubAI Architecture Analysis
This repository is indexed by RepoMind. By analyzing buckaroo-data/buckaroo in our AI interface, you can instantly generate complete architecture diagrams, visualize control flows, and perform automated security audits across the entire codebase.
Our Agentic Context Augmented Generation (Agentic CAG) engine loads full source files into context on-demand, avoiding the fragmentation of traditional RAG systems. Ask questions about the architecture, dependencies, or specific features to see it in action.
Repository Overview (README excerpt)
Crawler viewBuckaroo - The Data Table for Jupyter Buckaroo is a modern data table for Jupyter that expedites the most common exploratory data analysis tasks. The most basic data analysis task - looking at the raw data, is cumbersome with the existing pandas tooling. Buckaroo starts with a modern performant data table that is sortable, has value formatting, and scrolls infinitely. On top of the core table experience, extra features like summary stats, histograms, smart sampling, auto-cleaning, and a low code UI are added. All of the functionality has sensible defaults that can be overridden to customize the experience for your workflow. Try it now with Marimo in your browser Play with Buckaroo without any installation. Full Tour Quick start Then in a Jupyter notebook: When you , it becomes the default display for Pandas and Polars DataFrames. Claude Code MCP Integration Buckaroo can be used as an MCP server in Claude Code, giving Claude the ability to open data files in an interactive table viewer. Install That's it. This downloads Buckaroo from PyPI into an isolated environment and registers the MCP server. No other installation steps are needed. Usage Once installed, ask Claude Code to view any CSV, TSV, Parquet, or JSON file: > show me sales_data.csv Claude will call the tool, which opens the file in Buckaroo's interactive table UI in your browser. Compatibility Buckaroo works in the following notebook environments: • (version >=3.6.0) • (version >=7.0) • Marimo • (with extra install) • Jupyter Lite • • (via MCP) Buckaroo works with the following DataFrame libraries: • **pandas** (version >=1.3.5) • **polars** (optional, ) Features High performance table The core data grid is based on AG-Grid. It loads thousands of cells in under a second, with highly customizable display, formatting and scrolling. Data is loaded lazily into the browser as you scroll, and serialized with parquet. You no longer have to use to poke at portions of your data. Fixed width formatting by default By default numeric columns are formatted to use a fixed width font and commas are added. This allows quick visual confirmation of magnitudes in a column. Histograms Histograms for every column give you a very quick overview of the distribution of values, including uniques and N/A. Summary stats The summary stats view can be toggled by clicking on the below the icon. Summary stats are similar to and extensible. Sorting All visible data is sortable by clicking on a column name; further clicks change sort direction then disable sort for that column. Because extreme values are included with sample rows, you can see outlier values too. Search Search is built into Buckaroo so you can quickly find the rows you are looking for. Lowcode UI Buckaroo has a simple low code UI with Python code gen. This view can be toggled by clicking the checkbox below the (lambda) icon. Autocleaning Select a cleaning method from the status bar. The autocleaning system inspects each column and runs statistics to decide if cleaning should be applied (parsing dates, stripping non-integer characters, parsing implied booleans like "yes"/"no"), then adds those operations to the low code UI. Different cleaning methods can be tried because dirty data isn't deterministic. Access it with . Read more: Autocleaning docs Extensibility at the core Summary stats are built on the Pluggable Analysis Framework that allows individual summary stats to be overridden, and new summary stats to be built in terms of existing ones. Care is taken to prevent errors in summary stats from preventing display of a dataframe. Learn More Interactive Styling Gallery The interactive styling gallery lets you see different styling configurations. You can live edit code and play with different configs. Videos • Buckaroo Full Tour 6m50s - A broad introduction to Buckaroo • The Column's the limit - PyData Boston 2025 43m - How LazyBuckaroo reliably processes laptop-large data • 19 Million row scrolling and stats demo 58s • Buckaroo PyData Boston 2025 49m - Full tour with audience Q&A • Using Buckaroo and pandas to investigate large CSVs 9m • Autocleaning quick demo 2m38s • Writing your own autocleaning functions 10m10s • Extending Buckaroo 12m56s • Styling Buckaroo 8m18s • Understanding JLisp in Buckaroo 12m42s • GeoPandas Support Articles • Using Buckaroo and pandas to investigate large CSVs • Speed up exploratory data analysis with Buckaroo Example Notebooks • Full Tour (Marimo) - Start here. Broad overview of Buckaroo's features. Also available on Google Colab and JupyterLite • Notebook on GitHub • Live Styling Gallery - Examples of all the formatters and styling available • Live Autocleaning - How autocleaning works and how to implement your own • Live Histogram Demo - Explanation of the embedded histograms • Live JLisp overview - The small lisp interpreter powering the lowcode UI • Extending Buckaroo - Adding post processing methods and custom styling • Styling Howto - In depth custom styling guide • Pluggable Analysis Framework - Adding new summary stats • Solara Buckaroo - Using Buckaroo with Solara • GeoPandas with Buckaroo Example apps built on Buckaroo • Buckaroo Compare - Join two dataframes and highlight visual differences • Buckaroo Pandera - Validate a dataframe with Pandera, then visually highlight where it fails Contributing See CONTRIBUTING.md for development setup, build instructions, and release process. We welcome issue reports; be sure to choose the proper issue template so we get the necessary information.