back to home

Best Open Source nlp Libraries

A curated list of the most popular GitHub repositories tagged with nlp. Select any project to visualize its architecture and dive into the codebase using RepoMind's AI engine.

#1huggingface/transformers

🤗 Transformers: the model-definition framework for state-of-the-art machine learning models in text, vision, audio, and multimodal models, for both inference and training.

157,977Python
Explore Repo

#2hiyouga/LlamaFactory

Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)

68,588Python
Explore Repo

#3microsoft/AI-For-Beginners

12 Weeks, 24 Lessons, AI for All!

46,112Jupyter Notebook
Explore Repo

#4apachecn/ailearning

AiLearning:数据分析+机器学习实战+线性代数+PyTorch+NLTK+TF2

42,128Python
Explore Repo

#5666ghj/BettaFish

微舆:人人可用的多Agent舆情分析助手,打破信息茧房,还原舆情原貌,预测未来走向,辅助决策!从0实现,不依赖任何框架。

39,254Python
Explore Repo

#6hankcs/HanLP

中文分词 词性标注 命名实体识别 依存句法分析 成分句法分析 语义依存分析 语义角色标注 指代消解 风格转换 语义相似度 新词发现 关键词短语提取 自动摘要 文本分类聚类 拼音简繁转换 自然语言处理

36,205Python
Explore Repo

#7google/langextract

A Python library for extracting structured information from unstructured text using LLMs with precise source grounding and interactive visualization.

34,749Python
Explore Repo

#8explosion/spaCy

💫 Industrial-strength Natural Language Processing (NLP) in Python

33,343Python
Explore Repo

#9ashishpatel26/500-AI-Machine-learning-Deep-learning-Computer-vision-NLP-Projects-with-code

500 AI Machine learning Deep learning Computer vision NLP Projects with code

32,271
Explore Repo

#10stanford-oval/storm

An LLM-powered knowledge curation system that researches a topic and generates a full-length report with citations.

28,008Python
Explore Repo

#11ymcui/Chinese-LLaMA-Alpaca

中文LLaMA&Alpaca大语言模型+本地CPU/GPU训练部署 (Chinese LLaMA & Alpaca LLMs)

18,970Python
Explore Repo

#12AI4Finance-Foundation/FinGPT

FinGPT: Open-Source Financial Large Language Models! Revolutionize 🔥 We release the trained model on HuggingFace.

18,844Jupyter Notebook
Explore Repo

#13AccumulateMore/CV

✔(已完结)超级全面的 深度学习 笔记【土堆 Pytorch】【李沐 动手学深度学习】【吴恩达 深度学习】【大飞 大模型Agent】

18,306Jupyter Notebook
Explore Repo

#14keon/awesome-nlp

:book: A curated list of resources dedicated to Natural Language Processing (NLP)

18,291
Explore Repo

#15NLP-LOVE/ML-NLP

此项目是机器学习(Machine Learning)、深度学习(Deep Learning)、NLP面试中常考到的知识点和代码实现,也是作为一个算法工程师必会的理论基础知识。

17,559Jupyter Notebook
Explore Repo

#16dair-ai/ML-YouTube-Courses

📺 Discover the latest machine learning / AI courses on YouTube.

17,120
Explore Repo

#17bharathgs/Awesome-pytorch-list

A comprehensive list of pytorch related content on github,such as different models,implementations,helper libraries,tutorials etc.

16,419
Explore Repo

#18piskvorky/gensim

Topic Modelling for Humans

16,373Python
Explore Repo

#19oxford-cs-deepnlp-2017/lectures

Oxford Deep NLP 2017 course

15,864
Explore Repo

#20graykode/nlp-tutorial

Natural Language Processing Tutorial for Deep Learning Researchers

14,871Jupyter Notebook
Explore Repo

#21NVIDIA/DeepLearningExamples

State-of-the-Art Deep Learning scripts organized by models - easy to train and deploy with reproducible accuracy and performance on enterprise-grade infrastructure.

14,744Jupyter Notebook
Explore Repo

#22openvinotoolkit/openvino

OpenVINO™ is an open source toolkit for optimizing and deploying AI inference

9,899C++
Explore Repo

#23brightmart/nlp_chinese_corpus

大规模中文自然语言处理语料 Large Scale Chinese Corpus for NLP

9,867
Explore Repo

#24jadore801120/attention-is-all-you-need-pytorch

A PyTorch implementation of the Transformer model in "Attention is All You Need".

9,652Python
Explore Repo

#25sloria/TextBlob

Simple, Pythonic, text processing--Sentiment analysis, part-of-speech tagging, noun phrase extraction, translation, and more.

9,515Python
Explore Repo

#26modelscope/modelscope

ModelScope: bring the notion of Model-as-a-Service to life.

8,798Python
Explore Repo

#27microsoft/presidio

An open-source framework for detecting, redacting, masking, and anonymizing sensitive data (PII) across text, images, and structured data. Supports NLP, pattern matching, and customizable pipelines.

7,662Python
Explore Repo

#28Nyandwi/machine_learning_complete

A comprehensive machine learning repository containing 30+ notebooks on different concepts, algorithms and techniques.

4,990Jupyter Notebook
Explore Repo

#29shibing624/text2vec

text2vec, text to vector. 文本向量表征工具,把文本转化为向量矩阵,实现了Word2Vec、RankBM25、Sentence-BERT、CoSENT等文本表征、文本相似度计算模型,开箱即用。

4,952Python
Explore Repo

#30SCIR-HI/Huatuo-Llama-Med-Chinese

Repo for BenTsao [original name: HuaTuo (华驼)], Instruction-tuning Large Language Models with Chinese Medical Knowledge. 本草(原名:华驼)模型仓库,基于中文医学知识的大语言模型指令微调

4,943Python
Explore Repo

#31argilla-io/argilla

Argilla is a collaboration tool for AI engineers and domain experts to build high-quality datasets

4,895Python
Explore Repo

#32thunlp/OpenPrompt

An Open-Source Framework for Prompt-Learning.

4,841Python
Explore Repo