back to home

Best Open Source large language models Libraries

A curated list of the most popular GitHub repositories tagged with large language models. Select any project to visualize its architecture and dive into the codebase using RepoMind's AI engine.

#1langflow-ai/langflow

Langflow is a powerful tool for building and deploying AI-powered agents and workflows.

144,945Python
Analyze Code

#2rasbt/LLMs-from-scratch

Implement a ChatGPT-like LLM in PyTorch from scratch, step by step

85,633Jupyter Notebook
Analyze Code

#3mlabonne/llm-course

Course to get into Large Language Models (LLMs) with roadmaps and Colab notebooks.

75,434
Analyze Code

#4binary-husky/gpt_academic

为GPT/GLM等LLM大语言模型提供实用化交互接口,特别优化论文阅读/润色/写作体验,模块化设计,支持自定义快捷按钮&函数插件,支持Python和C++等项目剖析&自译解功能,PDF/LaTex论文翻译&总结功能,支持并行问询多种LLM模型,支持chatglm3等本地模型。接入通义千问, deepseekcoder, 讯飞星火, 文心一言, llama2, rwkv, claude2, moss等。

70,117Python
Analyze Code

#5hiyouga/LlamaFactory

Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)

67,420Python
Analyze Code

#6FlowiseAI/Flowise

Build AI Agents, Visually

49,249TypeScript
Analyze Code

#7ray-project/ray

Ray is an AI compute engine. Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads.

41,416Python
Analyze Code

#8google/langextract

A Python library for extracting structured information from unstructured text using LLMs with precise source grounding and interactive visualization.

33,427Python
Analyze Code

#9asgeirtj/system_prompts_leaks

Collection of extracted System Prompts from popular chatbots like ChatGPT, Claude & Gemini

32,374HTML
Analyze Code

#10HKUDS/LightRAG

[EMNLP2025] "LightRAG: Simple and Fast Retrieval-Augmented Generation"

28,485Python
Analyze Code

#11stanford-oval/storm

An LLM-powered knowledge curation system that researches a topic and generates a full-length report with citations.

27,914Python
Analyze Code

#12aishwaryanr/awesome-generative-ai-guide

A one stop repository for generative AI research updates, interview resources, notebooks and much more!

24,796HTML
Analyze Code

#13deepset-ai/haystack

Open-source AI orchestration framework for building context-engineered, production-ready LLM applications. Design modular pipelines and agent workflows with explicit control over retrieval, routing, memory, and generation. Built for scalable agents, RAG, multimodal applications, semantic search, and conversational systems.

24,250MDX
Analyze Code

#14HandsOnLLM/Hands-On-Large-Language-Models

Official code repo for the O'Reilly Book - "Hands-On Large Language Models"

22,477Jupyter Notebook
Analyze Code

#15langfuse/langfuse

🪢 Open source LLM engineering platform: LLM Observability, metrics, evals, prompt management, playground, datasets. Integrates with OpenTelemetry, Langchain, OpenAI SDK, LiteLLM, and more. 🍊YC W23

22,137TypeScript
Analyze Code

#16QwenLM/Qwen

The official repo of Qwen (通义千问) chat & pretrained large language model proposed by Alibaba Cloud.

20,428Python
Analyze Code

#17ymcui/Chinese-LLaMA-Alpaca

中文LLaMA&Alpaca大语言模型+本地CPU/GPU训练部署 (Chinese LLaMA & Alpaca LLMs)

18,962Python
Analyze Code

#18AI4Finance-Foundation/FinGPT

FinGPT: Open-Source Financial Large Language Models! Revolutionize 🔥 We release the trained model on HuggingFace.

18,630Jupyter Notebook
Analyze Code

#19NVIDIA/DeepLearningExamples

State-of-the-Art Deep Learning scripts organized by models - easy to train and deploy with reproducible accuracy and performance on enterprise-grade infrastructure.

14,732Jupyter Notebook
Analyze Code

#20ggml-org/ggml

Tensor library for machine learning

14,035C++
Analyze Code

#21Lightning-AI/litgpt

20+ high-performance LLMs with recipes to pretrain, finetune and deploy at scale.

13,174Python
Analyze Code

#22GoogleCloudPlatform/generative-ai

Sample code and notebooks for Generative AI on Google Cloud, with Gemini on Vertex AI

12,709Jupyter Notebook
Analyze Code

#23eugeneyan/open-llms

📋 A list of open LLMs available for commercial use.

12,643
Analyze Code

#24sapientinc/HRM

Hierarchical Reasoning Model Official Release

12,317Python
Analyze Code

#25neuml/txtai

💡 All-in-one AI framework for semantic search, LLM orchestration and language model workflows

12,192Python
Analyze Code

#26bigscience-workshop/petals

🌸 Run LLMs at home, BitTorrent-style. Fine-tuning and inference up to 10x faster than offloading

9,953Python
Analyze Code

#27KalyanKS-NLP/llm-engineer-toolkit

A curated list of 120+ LLM libraries category wise.

9,823
Analyze Code

#28FMInference/FlexLLMGen

Running large language models on a single GPU for throughput-oriented scenarios.

9,380Python
Analyze Code

#29yzhao062/anomaly-detection-resources

Anomaly detection related books, papers, videos, and toolboxes. Last update late 2025 for LLM and VLM works!

9,173Python
Analyze Code

#30OpenRLHF/OpenRLHF

An Easy-to-use, Scalable and High-performance Agentic RL Framework based on Ray (PPO & DAPO & REINFORCE++ & TIS & vLLM & Ray & Async RL)

9,017Python
Analyze Code

#31activeloopai/deeplake

Database for AI. Store Vectors, Images, Texts, Videos, etc. Use with LLMs/LangChain. Store, query, version, & visualize any AI data. Stream data in real-time to PyTorch/TensorFlow. https://activeloop.ai

9,008C++
Analyze Code

#32Tiiny-AI/PowerInfer

High-speed Large Language Model Serving for Local Deployment

8,714C++
Analyze Code

#33MineDojo/Voyager

An Open-Ended Embodied Agent with Large Language Models

6,680JavaScript
Analyze Code

#34QwenLM/Qwen-VL

The official repo of Qwen-VL (通义千问-VL) chat & pretrained large vision language model proposed by Alibaba Cloud.

6,534Python
Analyze Code