Best Open Source large language models Libraries
A curated list of the most popular GitHub repositories tagged with large language models. Select any project to visualize its architecture and dive into the codebase using RepoMind's AI engine.
#1langflow-ai/langflow
Langflow is a powerful tool for building and deploying AI-powered agents and workflows.
#2rasbt/LLMs-from-scratch
Implement a ChatGPT-like LLM in PyTorch from scratch, step by step
#3mlabonne/llm-course
Course to get into Large Language Models (LLMs) with roadmaps and Colab notebooks.
#4binary-husky/gpt_academic
为GPT/GLM等LLM大语言模型提供实用化交互接口,特别优化论文阅读/润色/写作体验,模块化设计,支持自定义快捷按钮&函数插件,支持Python和C++等项目剖析&自译解功能,PDF/LaTex论文翻译&总结功能,支持并行问询多种LLM模型,支持chatglm3等本地模型。接入通义千问, deepseekcoder, 讯飞星火, 文心一言, llama2, rwkv, claude2, moss等。
#5hiyouga/LlamaFactory
Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)
#6FlowiseAI/Flowise
Build AI Agents, Visually
#7ray-project/ray
Ray is an AI compute engine. Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads.
#8google/langextract
A Python library for extracting structured information from unstructured text using LLMs with precise source grounding and interactive visualization.
#9asgeirtj/system_prompts_leaks
Collection of extracted System Prompts from popular chatbots like ChatGPT, Claude & Gemini
#10HKUDS/LightRAG
[EMNLP2025] "LightRAG: Simple and Fast Retrieval-Augmented Generation"
#11stanford-oval/storm
An LLM-powered knowledge curation system that researches a topic and generates a full-length report with citations.
#12aishwaryanr/awesome-generative-ai-guide
A one stop repository for generative AI research updates, interview resources, notebooks and much more!
#13deepset-ai/haystack
Open-source AI orchestration framework for building context-engineered, production-ready LLM applications. Design modular pipelines and agent workflows with explicit control over retrieval, routing, memory, and generation. Built for scalable agents, RAG, multimodal applications, semantic search, and conversational systems.
#14HandsOnLLM/Hands-On-Large-Language-Models
Official code repo for the O'Reilly Book - "Hands-On Large Language Models"
#15langfuse/langfuse
🪢 Open source LLM engineering platform: LLM Observability, metrics, evals, prompt management, playground, datasets. Integrates with OpenTelemetry, Langchain, OpenAI SDK, LiteLLM, and more. 🍊YC W23
#16QwenLM/Qwen
The official repo of Qwen (通义千问) chat & pretrained large language model proposed by Alibaba Cloud.
#17ymcui/Chinese-LLaMA-Alpaca
中文LLaMA&Alpaca大语言模型+本地CPU/GPU训练部署 (Chinese LLaMA & Alpaca LLMs)
#18AI4Finance-Foundation/FinGPT
FinGPT: Open-Source Financial Large Language Models! Revolutionize 🔥 We release the trained model on HuggingFace.
#19NVIDIA/DeepLearningExamples
State-of-the-Art Deep Learning scripts organized by models - easy to train and deploy with reproducible accuracy and performance on enterprise-grade infrastructure.
#20ggml-org/ggml
Tensor library for machine learning
#21Lightning-AI/litgpt
20+ high-performance LLMs with recipes to pretrain, finetune and deploy at scale.
#22GoogleCloudPlatform/generative-ai
Sample code and notebooks for Generative AI on Google Cloud, with Gemini on Vertex AI
#23eugeneyan/open-llms
📋 A list of open LLMs available for commercial use.
#24sapientinc/HRM
Hierarchical Reasoning Model Official Release
#25neuml/txtai
💡 All-in-one AI framework for semantic search, LLM orchestration and language model workflows
#26bigscience-workshop/petals
🌸 Run LLMs at home, BitTorrent-style. Fine-tuning and inference up to 10x faster than offloading
#27KalyanKS-NLP/llm-engineer-toolkit
A curated list of 120+ LLM libraries category wise.
#28FMInference/FlexLLMGen
Running large language models on a single GPU for throughput-oriented scenarios.
#29yzhao062/anomaly-detection-resources
Anomaly detection related books, papers, videos, and toolboxes. Last update late 2025 for LLM and VLM works!
#30OpenRLHF/OpenRLHF
An Easy-to-use, Scalable and High-performance Agentic RL Framework based on Ray (PPO & DAPO & REINFORCE++ & TIS & vLLM & Ray & Async RL)
#31activeloopai/deeplake
Database for AI. Store Vectors, Images, Texts, Videos, etc. Use with LLMs/LangChain. Store, query, version, & visualize any AI data. Stream data in real-time to PyTorch/TensorFlow. https://activeloop.ai
#32Tiiny-AI/PowerInfer
High-speed Large Language Model Serving for Local Deployment
#33MineDojo/Voyager
An Open-Ended Embodied Agent with Large Language Models
#34QwenLM/Qwen-VL
The official repo of Qwen-VL (通义千问-VL) chat & pretrained large vision language model proposed by Alibaba Cloud.