PyPI Stats
  • Insights
  • PyPI
  • GitHub
  • Search
  • Compare
  • Advisories
  • Ecosystem
  • About
Home

Search Packages

Find Python packages by name, description, GitHub topic, or filter by metrics
lipoja
urlextract

URLExtract is python class for collecting (extracting) URLs from given text based on locating TLD.

801K 277 64
fhamborg
news-please

news-please - an integrated web crawler and information extractor for news that just works

118K 2K 452
nexB
extractcode

A mostly universal file extraction library and CLI tool to extract almost any archive in a reasonably safe way on Linux, macOS and Windows.

73K 38 23
taojinmin
spparser

ETL tools

37K 33 1
offici5l
fcetool

Extract specific files from remote ROM.ZIP archives without downloading the full ROM

18K 32 8
diogo-alves
eml-extractor

A CLI tool to extract attachments from .eml files (email messages saved as files)

7K 20 7
mefistotelis
pylabview

Python reader of LabVIEW RSRC files (VI, CTL, LLB). File format description on the Wiki.

5K 126 30
DanielJDufour
date-extractor

Extract dates from text

3K 66 14
tatuylonen
wiktextract

Wiktionary dump file parser and multilingual data extractor

2K 1K 110
CBICA
openpatchminer

OpenSlide Patch Manager

1K 4 6
myifeng
article-parser

Extract article or news by url or html, parse the title and content, output in markdown format.

824 50 6
aphp
edspdf-poppler

Poppler extension for EDS-PDF

651 0 0
mutalyzer
mutalyzer-algebra

A Boolean Algebra for Genetic Variants

634 13 2
Arech
yacce

Non-intrusive bazel compile_commands.json extractor

593 5 0
vishaltanwar96
aadhaar-py

Extract embedded information from Aadhaar Secure QR Code.

570 15 1
rs-develop
forioccrawler

A forensic ioc crawler and parser.

435 5 2
sgl-umons
gigawork

A tool for extracting GitHub Actions workflows

428 8 2
febos
contactextractor

Contact Extractor from PDB/mmCIF coordinate files

256 0 0
Coskon
ytget

Easily get data and download youtube videos, focused on speed and simplicity.

220 0 0
nanaelie
mailgrab

Email Extractor est un outil Python permettant d'extraire des adresses email à partir de pages web ou de fichiers texte. Il utilise les expressions régulières pour identifier et extraire les emails, et Playwright pour le scraping web. Ce projet est idéal pour récupérer des adresses email à partir de différentes sources.

206 2 0
marcpage
pylavi

Python LabVIEW resource file parser

203 11 4
laxyapahuja
fontin

A better font extractor and installer for bulk fonts in one archive.

192 7 0
MikeMeliz
torcrawl

Crawl and extract (regular or onion) webpages through TOR network

189 507 88
aboutcode-org
android-inspector

A collection of ScanCode.io pipelines dedicated to Android APK analysis.

178 1 0
    • Data from PyPI, GitHub, ClickHouse, and BigQuery