back to home

Best Open Source llama Libraries

A curated list of the most popular GitHub repositories tagged with llama. Select any project to visualize its architecture and dive into the codebase using RepoMind's AI engine.

#1ollama/ollama

Get up and running with Kimi-K2.5, GLM-5, MiniMax, DeepSeek, gpt-oss, Qwen, Gemma and other models.

165,352Go
Explore Repo

#2vllm-project/vllm

A high-throughput and memory-efficient inference and serving engine for LLMs

73,416Python
Explore Repo

#3hiyouga/LlamaFactory

Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)

68,588Python
Explore Repo

#4unslothai/unsloth

Fine-tuning & Reinforcement Learning for LLMs. 🦥 Train OpenAI gpt-oss, DeepSeek, Qwen, Llama, Gemma, TTS 2x faster with 70% less VRAM.

54,106Python
Explore Repo

#5mudler/LocalAI

:robot: The free, Open Source alternative to OpenAI, Claude and others. Self-hosted and local-first. Drop-in replacement, running on consumer-grade hardware. No GPU required. Runs gguf, transformers, diffusers and many more. Features: Generate Text, MCP, Audio, Video, Images, Voice Cloning, Distributed, P2P and decentralized inference

43,766Go
Explore Repo

#6Aider-AI/aider

aider is AI pair programming in your terminal

42,046Python
Explore Repo

#7chatchat-space/Langchain-Chatchat

Langchain-Chatchat(原Langchain-ChatGLM)基于 Langchain 与 ChatGLM, Qwen 与 Llama 等语言模型的 RAG 与 Agent 应用 | Langchain-Chatchat (formerly langchain-ChatGLM), local knowledge based LLM (like ChatGLM, Qwen and Llama) RAG and Agent app with langchain

37,542Python
Explore Repo

#8sgl-project/sglang

SGLang is a high-performance serving framework for large language models and multimodal models.

28,834Python
Explore Repo

#9fishaudio/fish-speech

SOTA Open Source TTS

27,926Python
Explore Repo

#10AstrBotDevs/AstrBot

Agentic IM Chatbot infrastructure that integrates lots of IM platforms, LLMs, plugins and AI feature, and can be your openclaw alternative. ✨

25,496Python
Explore Repo

#11ymcui/Chinese-LLaMA-Alpaca

中文LLaMA&Alpaca大语言模型+本地CPU/GPU训练部署 (Chinese LLaMA & Alpaca LLMs)

18,970Python
Explore Repo

#12meta-llama/llama-cookbook

Welcome to the Llama Cookbook! This is your go to guide for Building with Llama: Getting started with Inference, Fine-Tuning, RAG. We also show you how to solve end to end problems using Llama model family and using them on various provider services

18,258Jupyter Notebook
Explore Repo

#13cheahjs/free-llm-api-resources

A list of free LLM inference resources accessible via API.

16,049Python
Explore Repo

#14GaiZhenbiao/ChuanhuChatGPT

GUI for ChatGPT API and many LLMs. Supports agents, file-based QA, GPT finetuning and query with web search. All with a neat UI.

15,369Python
Explore Repo

#15LlamaFamily/Llama-Chinese

Llama中文社区,实时汇总最新Llama学习资料,构建最好的中文Llama大模型开源生态,完全开源可商用

14,739Python
Explore Repo

#16modelscope/ms-swift

Use PEFT or Full-parameter to CPT/SFT/DPO/GRPO 600+ LLMs (Qwen3.6, DeepSeek-V4, GLM-5.1, InternLM3, Llama4, ...) and 300+ MLLMs (Qwen3-VL, Qwen3-Omni, InternVL3.5, Ovis2.5, GLM4.5v, Gemma4, Llava, Phi4, ...) (AAAI 2025).

14,351Python
Explore Repo

#17xorbitsai/inference

Swap GPT for any LLM by changing a single line of code. Xinference lets you run open-source, speech, and multimodal models on cloud, on-prem, or your laptop — all through one unified, production-ready inference API.

9,135Python
Explore Repo

#18oumi-ai/oumi

Easily fine-tune, evaluate and deploy gpt-oss, Qwen3, DeepSeek-R1, or any open source LLM / VLM!

8,911Python
Explore Repo

#19Tiiny-AI/PowerInfer

High-speed Large Language Model Serving for Local Deployment

8,830C++
Explore Repo

#20SCIR-HI/Huatuo-Llama-Med-Chinese

Repo for BenTsao [original name: HuaTuo (华驼)], Instruction-tuning Large Language Models with Chinese Medical Knowledge. 本草(原名:华驼)模型仓库,基于中文医学知识的大语言模型指令微调

4,943Python
Explore Repo

#21h2oai/h2o-llmstudio

H2O LLM Studio - a framework and no-code GUI for fine-tuning LLMs. Documentation: https://docs.h2o.ai/h2o-llmstudio/

4,900Python
Explore Repo

#22transformerlab/transformerlab-app

The open source research environment for AI researchers to seamlessly train, evaluate, and scale models from local hardware to GPU clusters.

4,825Python
Explore Repo

#23mostlygeek/llama-swap

Reliable model swapping for any local OpenAI/Anthropic compatible server - llama.cpp, vllm, etc

4,347Go
Explore Repo

#24ModelTC/LightLLM

LightLLM is a Python-based LLM (Large Language Model) inference and serving framework, notable for its lightweight design, easy scalability, and high-speed performance.

4,082Python
Explore Repo

#25superiorlu/AITreasureBox

🤖 Automatically collected AI repos, tools, websites, papers & tutorials. 实用AI百宝箱 💎

810Ruby
Explore Repo

#26FuJacob/cotabby

Cotabby is local AI autocomplete for your entire Mac. Open source. On device. Everywhere you type.

633Swift
Explore Repo

#27mgonzs13/llama_ros

llama.cpp (GGUF LLMs) and llava.cpp (GGUF VLMs) for ROS 2

253C++
Explore Repo