PyPI Stats
  • Insights
  • PyPI
  • GitHub
  • Search
  • Compare
  • Advisories
  • Ecosystem
  • About
Home

Natural Language Processing Python Packages

Python packages with the GitHub topic natural-language-processing. Sorted by relevance, with stars and monthly downloads.
huggingface
huggingface-hub

The official Python client for the Hugging Face Hub.

244.7M 4K 1K
huggingface
tokenizers

💥 Fast State-of-the-Art Tokenizers optimized for Research and Production

163.2M 11K 1K
huggingface
transformers

🤗 Transformers: the model-definition framework for state-of-the-art machine learning models in text, vision, audio, and multimodal models, for both inference and training.

143.5M 160K 33K
huggingface
datasets

🤗 The largest hub of ready-to-use datasets for AI models with fast, easy-to-use and efficient data manipulation tools

118.9M 21K 3K
nltk
nltk

NLTK Source

60.5M 15K 3K
google
sentencepiece

Unsupervised text tokenizer for Neural Network-based text generation.

32.8M 12K 1K
explosion
thinc

🔮 A refreshing functional take on deep learning, compatible with your favorite libraries

24.8M 3K 294
explosion
spacy

💫 Industrial-strength Natural Language Processing (NLP) in Python

22.1M 34K 5K
explosion
spacy-loggers

📟 Logging utilities for spaCy

17.7M 12 17
adbar
htmldate

Fast and robust date extraction from web pages, with Python or on the command-line

9.6M 148 30
datamade
usaddress

:us: a python library for parsing unstructured United States address strings into address components

6.5M 2K 308
Unstructured-IO
unstructured

Convert documents to structured data effortlessly. Unstructured is open-source ETL solution for transforming complex documents into clean, structured formats for language models. Visit our website to learn more about our enterprise grade Platform product for production grade workflows, partitioning, enrichments, chunking and embedding.

5.3M 15K 1K
RaRe-Technologies
gensim

Topic Modelling for Humans

5.1M 16K 4K
sloria
textblob

Simple, Pythonic, text processing--Sentiment analysis, part-of-speech tagging, noun phrase extraction, translation, and more.

2.2M 10K 1K
pemistahl
lingua-language-detector

The most accurate natural language detection library for Python, suitable for short text and mixed-language text

1.7M 2K 59
openvinotoolkit
openvino

OpenVINO™ is an open source toolkit for optimizing and deploying AI inference

1.4M 10K 3K
autogluon
autogluon-core

Fast and Accurate ML in 3 Lines of Code

1.3M 10K 1K
autogluon
autogluon-features

Fast and Accurate ML in 3 Lines of Code

1.2M 10K 1K
PyThaiNLP
pythainlp

Thai natural language processing in Python

1.2M 1K 295
autogluon
autogluon-tabular

Fast and Accurate ML in 3 Lines of Code

1.1M 10K 1K
autogluon
autogluon-common

Fast and Accurate ML in 3 Lines of Code

1.1M 10K 1K
JohnSnowLabs
spark-nlp

State of the Art Natural Language Processing

1.1M 4K 743
autogluon
autogluon

Fast and Accurate ML in 3 Lines of Code

1M 10K 1K
microsoft
flaml

A fast library for AutoML and tuning. Join our Discord: https://discord.gg/Cppx2vSPVP.

950K 4K 560
    • Data from PyPI, GitHub, ClickHouse, and BigQuery