PyPI Stats
  • Insights
  • PyPI
  • GitHub
  • Search
  • Compare
  • Advisories
  • Ecosystem
  • About
Home

Search Packages

Find Python packages by name, description, GitHub topic, or filter by metrics
adbar
htmldate

Fast and robust date extraction from web pages, with Python or on the command-line

9.4M 148 30
kreuzberg-dev
kreuzberg

A polyglot document intelligence framework with a Rust core. Extract text, metadata, images, and structured information from PDFs, Office documents, images, and 97+ formats. Available for Rust, Python, Ruby, Java, Go, PHP, Elixir, C#, R, C, TypeScript (Node/Bun/Wasm/Deno)- or use via CLI, REST API, or MCP server.

165K 8K 472
oduwsdl
aiu

A library for interacting with web archive collections at Archive-It, Trove, Pandora, and more.

7K 8 1
iluvcapra
wavinfo

Probe WAVE Files for all metadata

6K 43 10
kobaltcore
pymage-size

A utility package for getting image dimensions without loading files into memory. No dependencies!

4K 16 1
jakiki6
ruminant

Recursive metadata extraction tool

2K 5 1
tern-tools
tern

Tern is a software composition analysis tool and Python library that generates a Software Bill of Materials for container images and Dockerfiles. The SBOM that Tern generates will give you a layer-by-layer view of what's inside your container in a variety of formats including human-readable, JSON, HTML, SPDX and more.

2K 1K 188
fvaleye
metadata-guardian

Provide an easy way with Python to protect your data sources by searching its metadata. 🛡️

905 18 1
radusuciu
traktor-nowplaying

traktor_nowplaying uses Traktor's broadcast functionality to extract metadata about the currently playing song.

832 66 8
lstein
photomapai

A modern image browser and search tool that uses AI to generate a "semantic map" of your collection.

831 70 4
DanTsai0903
namingpaper

CLI tool to rename academic papers using AI-extracted metadata

770 7 1
d3x-at
sd-parsers

A Python library to read metadata from images created by Stable Diffusion.

744 45 4
rsmvdl
metaspector

Python library to inspect and export metadata from MP4/M4V/M4A, MP3 and FLAC media files.

654 3 0
baughmann
tikara

The metadata and text content extractor for almost every file type.

537 9 0
sdsc-ordes
gimie

Extract linked metadata from repositories

531 14 2
lttkgp
music-metadata-extractor

Extract song metadata from YouTube links with Spotify API

498 16 7
VritraSecz
gitspyx

Advanced OSINT tool for GitHub reconnaissance — get profiles, repo insights & metadata instantly.

478 6 1
mauricelambert
spyware

This package implements a complete SpyWare.

415 154 32
m8sec
pymetasec

Utility to download and extract document metadata from an organization. This technique can be used to identify: domains, usernames, software/version numbers and naming conventions.

353 513 88
shantanubafna
geotcha

Extract and harmonize RNA-seq metadata from NCBI GEO

352 0 0
meysam81
sitemap-harvester

Crawl sitemap of a given website and export metadata of its pages recursively into CSV format.

324 5 0
itsbigspark
pymetagen

Metadata Generator

289 0 0
ankit-chaubey
surgery

Offline CLI tool for inspecting and modifying media metadata.

200 10 1
ymrohit
openscenesense

OpenSceneSense is a Python library that harnesses AI for advanced video analysis, offering customizable frame and audio insights for dynamic applications in media, education, and content moderation.

196 22 1
    • Data from PyPI, GitHub, ClickHouse, and BigQuery