flashinfer-ai / flashinfer-bench

Building the Virtuous Cycle for AI-driven LLM Systems

204 stars

31 forks

34 issues

Chat with Codebase Architecture Scan Security Audit Explain Codebase

AI Architecture Analysis

This repository is indexed by RepoMind. By analyzing flashinfer-ai/flashinfer-bench in our AI interface, you can instantly generate complete architecture diagrams, visualize control flows, and perform automated security audits across the entire codebase.

Our Agentic Context Augmented Generation (Agentic CAG) engine loads full source files into context on-demand, avoiding the fragmentation of traditional RAG systems. Ask questions about the architecture, dependencies, or specific features to see it in action.

Source files are only loaded when you start an analysis to optimize performance.

Click here to launch the interactive analysis workspace

Embed this Badge

Showcase RepoMind's analysis directly in your repository's README.

[![Analyzed by RepoMind](https://img.shields.io/badge/Analyzed%20by-RepoMind-4F46E5?style=for-the-badge)](https://repomind.in/repo/flashinfer-ai/flashinfer-bench)

Preview:

Repository Overview (README excerpt)

Crawler view

**Building the Virtuous Cycle for AI-driven LLM Systems** Get Started | Documentation | Blogpost | Slack (#flashinfer-bench) **FlashInfer-Bench** is a benchmark suite and production workflow designed to build a virtuous cycle of self-improving AI systems. It is part of a broader initiative to build the *virtuous cycle of AI improving AI systems* — enabling AI agents and engineers to collaboratively optimize the very kernels that power large language models. Installation Install FlashInfer-Bench with pip: Import FlashInfer-Bench: Get Started This guide shows you how to use FlashInfer-Bench python module with the FlashInfer-Trace dataset. FlashInfer Trace Dataset We provide an official dataset called **FlashInfer-Trace** with kernels and workloads in real-world AI system deployment environments. FlashInfer-Bench can use this dataset to measure and compare the performance of kernels. It follows the FlashInfer Trace Schema. The official dataset is on HuggingFace: https://huggingface.co/datasets/flashinfer-ai/flashinfer-trace Clone it with Git LFS pointer files only (large tensor files are downloaded on demand during benchmarking): Collaborators Our collaborators include: &emsp; &emsp; &emsp; &emsp;