PyPI Stats
  • Insights
  • PyPI
  • GitHub
  • Search
  • Compare
  • Advisories
  • Ecosystem
  • About
Home

Search Packages

Find Python packages by name, description, GitHub topic, or filter by metrics
adbar
htmldate

Fast and robust date extraction from web pages, with Python or on the command-line

9.4M 148 30
run-llama
llama-cloud

Python SDK for OCR and document parsing in the cloud with LlamaParse

8.1M 28 7
urchade
gliner

Generalist and Lightweight Model for Named Entity Recognition (Extract any entity types from texts)

556K 3K 271
EventRegistry
eventregistry

Python package for API access to news articles and events in the Event Registry

125K 256 57
elyase
geotext

Geotext extracts country and city mentions from text

79K 139 50
natasha
yargy

Rule-based facts extraction for Russian language

54K 332 44
modelscope
adaseq

AdaSeq: An All-in-One Library for Developing State-of-the-Art Sequence Understanding Models

53K 453 44
philgooch
abbreviations

Python3 implementation of the Schwartz-Hearst algorithm for extracting abbreviation-definition pairs

49K 89 21
jaidevd
numerizer

A Python module to convert natural language numerics into ints and floats.

44K 233 24
PaddlePaddle
paddlenlp

Easy-to-use and powerful LLM and SLM library with awesome model zoo.

36K 13K 3K
PaddlePaddle
tool-helpers

Easy-to-use and powerful LLM and SLM library with awesome model zoo.

10K 13K 3K
marijnkoolen
fuzzy-search

Fuzzy search modules for searching lists of words in low quality OCR and HTR text.

9K 23 1
jackboyla
glirel

Generalist and Lightweight Model for Relation Extraction (Extract any relationship types from text)

9K 273 23
PaddlePaddle
fast-dataindex

Easy-to-use and powerful LLM and SLM library with awesome model zoo.

8K 13K 3K
brevia-ai
brevia

Extensible API and framework to build your Retrieval Augmented Generation (RAG) and Information Extraction (IE) applications with LLMs

6K 32 3
krlabsorg
lettucedetect

Lightweight hallucination detection framework for RAG applications

5K 568 39
vmenger
deduce

Deduce: de-identification method for Dutch medical text

5K 64 27
arnebinder
pytorch-ie

PyTorch-IE: State-of-the-art Information Extraction in PyTorch

5K 77 6
dpasse
extr

Named Entity Recognition (NER) and Relation Extraction (RE) library using Regular Expressions

4K 10 0
huspacy
huspacy

HuSpaCy: industrial-strength Hungarian natural language processing

3K 182 18
huspacy
huspacy-nightly

HuSpaCy: industrial-strength Hungarian natural language processing

3K 182 18
zjunlp
deepke

[EMNLP 2022] An Open Toolkit for Knowledge Graph Extraction and Construction

3K 4K 742
lasigeBioTM
bent

Biomedical Term Annotator

2K 9 1
yifanfeng97
hyperextract

Transform unstructured text into structured knowledge with LLMs. Graphs, hypergraphs, and spatio-temporal extractions — with one command.

2K 816 85
    • Data from PyPI, GitHub, ClickHouse, and BigQuery