back to home

Best Open Source semantic search Libraries

A curated list of the most popular GitHub repositories tagged with semantic search. Select any project to visualize its architecture and dive into the codebase using RepoMind's AI engine.

#1microsoft/generative-ai-for-beginners

21 Lessons, Get Started Building with Generative AI

106,699Jupyter Notebook
Analyze Code

#2meilisearch/meilisearch

A lightning-fast search engine API bringing AI-powered hybrid search to your sites and applications.

56,011Rust
Analyze Code

#3khoj-ai/khoj

Your AI second brain. Self-hostable. Get answers from the web or your docs. Build custom agents, schedule automations, do deep research. Turn any online or local LLM into your personal, autonomous AI (gpt, claude, gemini, llama, qwen, mistral). Get started - free.

32,554Python
Analyze Code

#4typesense/typesense

Open Source alternative to Algolia + Pinecone and an Easier-to-Use alternative to ElasticSearch ⚡ 🔍 ✨ Fast, typo tolerant, in-memory fuzzy Search Engine for building delightful search experiences

25,255C++
Analyze Code

#5deepset-ai/haystack

Open-source AI orchestration framework for building context-engineered, production-ready LLM applications. Design modular pipelines and agent workflows with explicit control over retrieval, routing, memory, and generation. Built for scalable agents, RAG, multimodal applications, semantic search, and conversational systems.

24,250MDX
Analyze Code

#6arc53/DocsGPT

Private AI platform for agents, assistants and enterprise search. Built-in Agent Builder, Deep research, Document analysis, Multi-model support, and API connectivity for agents.

17,716Python
Analyze Code

#7memvid/memvid

Memory layer for AI Agents. Replace complex RAG pipelines with a serverless, single-file memory layer. Give your agents instant retrieval and long-term memory.

13,175Rust
Analyze Code

#8Tencent/WeKnora

LLM-powered framework for deep document understanding, semantic retrieval, and context-aware answers using RAG paradigm.

13,064Go
Analyze Code

#9neuml/txtai

💡 All-in-one AI framework for semantic search, LLM orchestration and language model workflows

12,192Python
Analyze Code

#10lancedb/lancedb

Developer-friendly OSS embedded retrieval library for multimodal AI. Search More; Manage Less.

9,029HTML
Analyze Code