PyPI Stats
  • Insights
  • PyPI
  • GitHub
  • Search
  • Compare
  • Advisories
  • Ecosystem
  • About
Home

Search Packages

Find Python packages by name, description, GitHub topic, or filter by metrics
chatnoir-eu
fastwarc

A robust web archive analytics toolkit

1.3M 137 18
chatnoir-eu
resiliparse

A robust web archive analytics toolkit

1.3M 137 18
chrismattmann
tika

Tika-Python is a Python binding to the Apache Tika™ REST services allowing Tika to be called natively in the Python community.

412K 2K 251
NeelShah18
emot

Open source Emoticons and Emoji detection library: emot

214K 196 78
viraptor
arpy

ar archive extraction library written in Python

82K 13 13
yobix-ai
extractous

Fast and efficient unstructured data extraction. Written in Rust with bindings for many languages.

61K 2K 96
Trusted-AI
adversarial-robustness-toolbox

Adversarial Robustness Toolbox (ART) - Python Library for Machine Learning Security - Evasion, Poisoning, Extraction, Inference - Red and Blue Teams

31K 6K 1K
nightlark
python-msi

Pure Python library for reading, parsing, and extracting the contents of Windows installer (.msi) files

13K 57 7
nazywam
autoit-ripper

Extract AutoIt scripts embedded in PE binaries

8K 237 43
aubio
aubio-ledfx

a library for audio and music analysis

7K 4K 414
Lattyware
unrpa

A program to extract files from the RPA archive format.

7K 735 85
bug-ops
exarch

Secure archive library: TAR/ZIP/7z extraction & creation with CVE protection. Type-safe Rust core, Python/Node.js bindings, zero unsafe code.

4K 4 2
ARKlab
artesian-sdk

Python Library for Artesian

4K 3 1
retospect
acatome-extract

PDF extraction pipeline for acatome — Marker/fitz, metadata, block chunking

3K 0 0
KnowledgeCaptureAndDiscovery
somef

SOftware Metadata Extraction Framework: A tool for automatically extracting relevant software information from code repositories (using README files, package metadata, etc.)

2K 72 30
Crowlingo
pycrowlingo

Python SDK to use Crowlingo APIs

2K 4 1
ASukhanov
apstrim

Logger and extractor of time-series data.

2K 0 0
JoshuaMKW
pyisotools

python library for working with Gamecube ISOs (GCM)

1K 45 9
rossumai
docile-benchmark

DocILE: Document Information Localization and Extraction Benchmark

1K 146 12
0xMassi
webclaw

Python SDK for the Webclaw web extraction API

1K 1 0
Colearo
huhuseg

Simple Chinese segmentator, keywords extractor and other examples

1K 8 1
skblaz
rakun2

RaKUn 2.0 - A fast keyword detection algorithm

1K 73 7
usc-isi-i2
etk

Extraction Toolkit

915 83 48
tenuo-ai
safe-unzip

Secure zip extraction. Prevents Zip Slip and Zip Bombs.

792 2 1
    • Data from PyPI, GitHub, ClickHouse, and BigQuery