back to home

Best Open Source retrieval augmented generation Libraries

A curated list of the most popular GitHub repositories tagged with retrieval augmented generation. Select any project to visualize its architecture and dive into the codebase using RepoMind's AI engine.

#1infiniflow/ragflow

RAGFlow is a leading open-source Retrieval-Augmented Generation (RAG) engine that fuses cutting-edge RAG with Agent capabilities to create a superior context layer for LLMs

73,497Python
Analyze Code

#2pathwaycom/llm-app

Ready-to-run cloud templates for RAG, AI pipelines, and enterprise search with live data. 🐳Docker-friendly.⚡Always in sync with Sharepoint, Google Drive, S3, Kafka, PostgreSQL, real-time data APIs, and more.

56,280Jupyter Notebook
Analyze Code

#3chatchat-space/Langchain-Chatchat

Langchain-Chatchat(原Langchain-ChatGLM)基于 Langchain 与 ChatGLM, Qwen 与 Llama 等语言模型的 RAG 与 Agent 应用 | Langchain-Chatchat (formerly langchain-ChatGLM), local knowledge based LLM (like ChatGLM, Qwen and Llama) RAG and Agent app with langchain

37,306Python
Analyze Code

#4HKUDS/LightRAG

[EMNLP2025] "LightRAG: Simple and Fast Retrieval-Augmented Generation"

28,485Python
Analyze Code

#5stanford-oval/storm

An LLM-powered knowledge curation system that researches a topic and generates a full-length report with citations.

27,914Python
Analyze Code

#6deepset-ai/haystack

Open-source AI orchestration framework for building context-engineered, production-ready LLM applications. Design modular pipelines and agent workflows with explicit control over retrieval, routing, memory, and generation. Built for scalable agents, RAG, multimodal applications, semantic search, and conversational systems.

24,250MDX
Analyze Code

#7llmware-ai/llmware

Unified framework for building enterprise RAG pipelines with small, specialized models

14,848Python
Analyze Code

#8HKUDS/RAG-Anything

"RAG-Anything: All-in-One RAG Framework"

13,642Python
Analyze Code

#9memvid/memvid

Memory layer for AI Agents. Replace complex RAG pipelines with a serverless, single-file memory layer. Give your agents instant retrieval and long-term memory.

13,175Rust
Analyze Code

#10neuml/txtai

💡 All-in-one AI framework for semantic search, LLM orchestration and language model workflows

12,192Python
Analyze Code

#11yichuan-w/LEANN

[MLsys2026]: RAG on Everything with LEANN. Enjoy 97% storage savings while running a fast, accurate, and 100% private RAG application on your personal device.

9,999Python
Analyze Code

#12simular-ai/Agent-S

Agent S: an open agentic framework that uses computers like a human

9,841Python
Analyze Code