Best Open Source nlp Libraries
A curated list of the most popular GitHub repositories tagged with nlp. Select any project to visualize its architecture and dive into the codebase using RepoMind's AI engine.
#1huggingface/transformers
🤗 Transformers: the model-definition framework for state-of-the-art machine learning models in text, vision, audio, and multimodal models, for both inference and training.
#2hiyouga/LlamaFactory
Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)
#3microsoft/AI-For-Beginners
12 Weeks, 24 Lessons, AI for All!
#4apachecn/ailearning
AiLearning:数据分析+机器学习实战+线性代数+PyTorch+NLTK+TF2
#5666ghj/BettaFish
微舆:人人可用的多Agent舆情分析助手,打破信息茧房,还原舆情原貌,预测未来走向,辅助决策!从0实现,不依赖任何框架。
#6hankcs/HanLP
中文分词 词性标注 命名实体识别 依存句法分析 成分句法分析 语义依存分析 语义角色标注 指代消解 风格转换 语义相似度 新词发现 关键词短语提取 自动摘要 文本分类聚类 拼音简繁转换 自然语言处理
#7google/langextract
A Python library for extracting structured information from unstructured text using LLMs with precise source grounding and interactive visualization.
#8explosion/spaCy
💫 Industrial-strength Natural Language Processing (NLP) in Python
#9ashishpatel26/500-AI-Machine-learning-Deep-learning-Computer-vision-NLP-Projects-with-code
500 AI Machine learning Deep learning Computer vision NLP Projects with code
#10stanford-oval/storm
An LLM-powered knowledge curation system that researches a topic and generates a full-length report with citations.
#11ymcui/Chinese-LLaMA-Alpaca
中文LLaMA&Alpaca大语言模型+本地CPU/GPU训练部署 (Chinese LLaMA & Alpaca LLMs)
#12AI4Finance-Foundation/FinGPT
FinGPT: Open-Source Financial Large Language Models! Revolutionize 🔥 We release the trained model on HuggingFace.
#13AccumulateMore/CV
✔(已完结)超级全面的 深度学习 笔记【土堆 Pytorch】【李沐 动手学深度学习】【吴恩达 深度学习】【大飞 大模型Agent】
#14keon/awesome-nlp
:book: A curated list of resources dedicated to Natural Language Processing (NLP)
#15NLP-LOVE/ML-NLP
此项目是机器学习(Machine Learning)、深度学习(Deep Learning)、NLP面试中常考到的知识点和代码实现,也是作为一个算法工程师必会的理论基础知识。
#16dair-ai/ML-YouTube-Courses
📺 Discover the latest machine learning / AI courses on YouTube.
#17bharathgs/Awesome-pytorch-list
A comprehensive list of pytorch related content on github,such as different models,implementations,helper libraries,tutorials etc.
#18piskvorky/gensim
Topic Modelling for Humans
#19oxford-cs-deepnlp-2017/lectures
Oxford Deep NLP 2017 course
#20graykode/nlp-tutorial
Natural Language Processing Tutorial for Deep Learning Researchers
#21NVIDIA/DeepLearningExamples
State-of-the-Art Deep Learning scripts organized by models - easy to train and deploy with reproducible accuracy and performance on enterprise-grade infrastructure.
#22openvinotoolkit/openvino
OpenVINO™ is an open source toolkit for optimizing and deploying AI inference
#23brightmart/nlp_chinese_corpus
大规模中文自然语言处理语料 Large Scale Chinese Corpus for NLP
#24jadore801120/attention-is-all-you-need-pytorch
A PyTorch implementation of the Transformer model in "Attention is All You Need".
#25sloria/TextBlob
Simple, Pythonic, text processing--Sentiment analysis, part-of-speech tagging, noun phrase extraction, translation, and more.
#26modelscope/modelscope
ModelScope: bring the notion of Model-as-a-Service to life.
#27microsoft/presidio
An open-source framework for detecting, redacting, masking, and anonymizing sensitive data (PII) across text, images, and structured data. Supports NLP, pattern matching, and customizable pipelines.
#28Nyandwi/machine_learning_complete
A comprehensive machine learning repository containing 30+ notebooks on different concepts, algorithms and techniques.
#29shibing624/text2vec
text2vec, text to vector. 文本向量表征工具,把文本转化为向量矩阵,实现了Word2Vec、RankBM25、Sentence-BERT、CoSENT等文本表征、文本相似度计算模型,开箱即用。
#30SCIR-HI/Huatuo-Llama-Med-Chinese
Repo for BenTsao [original name: HuaTuo (华驼)], Instruction-tuning Large Language Models with Chinese Medical Knowledge. 本草(原名:华驼)模型仓库,基于中文医学知识的大语言模型指令微调
#31argilla-io/argilla
Argilla is a collaboration tool for AI engineers and domain experts to build high-quality datasets
#32thunlp/OpenPrompt
An Open-Source Framework for Prompt-Learning.