Best Open Source retrieval augmented generation Libraries
A curated list of the most popular GitHub repositories tagged with retrieval augmented generation. Select any project to visualize its architecture and dive into the codebase using RepoMind's AI engine.
#1infiniflow/ragflow
RAGFlow is a leading open-source Retrieval-Augmented Generation (RAG) engine that fuses cutting-edge RAG with Agent capabilities to create a superior context layer for LLMs
#2pathwaycom/llm-app
Ready-to-run cloud templates for RAG, AI pipelines, and enterprise search with live data. 🐳Docker-friendly.⚡Always in sync with Sharepoint, Google Drive, S3, Kafka, PostgreSQL, real-time data APIs, and more.
#3chatchat-space/Langchain-Chatchat
Langchain-Chatchat(原Langchain-ChatGLM)基于 Langchain 与 ChatGLM, Qwen 与 Llama 等语言模型的 RAG 与 Agent 应用 | Langchain-Chatchat (formerly langchain-ChatGLM), local knowledge based LLM (like ChatGLM, Qwen and Llama) RAG and Agent app with langchain
#4HKUDS/LightRAG
[EMNLP2025] "LightRAG: Simple and Fast Retrieval-Augmented Generation"
#5stanford-oval/storm
An LLM-powered knowledge curation system that researches a topic and generates a full-length report with citations.
#6deepset-ai/haystack
Open-source AI orchestration framework for building context-engineered, production-ready LLM applications. Design modular pipelines and agent workflows with explicit control over retrieval, routing, memory, and generation. Built for scalable agents, RAG, multimodal applications, semantic search, and conversational systems.
#7llmware-ai/llmware
Unified framework for building enterprise RAG pipelines with small, specialized models
#8HKUDS/RAG-Anything
"RAG-Anything: All-in-One RAG Framework"
#9memvid/memvid
Memory layer for AI Agents. Replace complex RAG pipelines with a serverless, single-file memory layer. Give your agents instant retrieval and long-term memory.
#10neuml/txtai
💡 All-in-one AI framework for semantic search, LLM orchestration and language model workflows
#11yichuan-w/LEANN
[MLsys2026]: RAG on Everything with LEANN. Enjoy 97% storage savings while running a fast, accurate, and 100% private RAG application on your personal device.
#12simular-ai/Agent-S
Agent S: an open agentic framework that uses computers like a human