PyPI Stats
  • Insights
  • PyPI
  • GitHub
  • Search
  • Compare
  • Advisories
  • Ecosystem
  • About
Home

Search Packages

Find Python packages by name, description, GitHub topic, or filter by metrics
scrapy-plugins
scrapy-playwright

🎭 Playwright integration for Scrapy

922K 1K 160
vgalin
html2image

A package acting as a wrapper around the headless mode of existing web browsers to generate images from URLs and from HTML+CSS strings or files.

428K 452 49
scrapfly
scrapfly-sdk

Official Python SDK for the Scrapfly platform: web scraping, screenshots, AI extraction, crawling, and a remote anti-bot browser. Integrates with Scrapy, LlamaIndex, and LangChain.

313K 55 15
CloakHQ
cloakbrowser

Stealth Chromium that passes every bot detection test. Drop-in Playwright replacement with source-level fingerprint patches. 30/30 tests passed.

126K 2K 125
ArchiveBox
abx-plugins

🧩 Plugins and extractors that ArchiveBox + abx-dl use: chrome, ytdlp, wget, singlefile, readability, forum-dl, gallery-dl, papers-dl, and more...

26K 4 0
ArchiveBox
archivebox

🗃 Open source self-hosted web archiving. Takes URLs/browser history/bookmarks/Pocket/Pinboard/etc., saves HTML, JS, PDFs, media, and more...

6K 27K 2K
scrapy-plugins
scrapy-playwright-full

Playwright integration for Scrapy

551 1K 160
elacuesta
scrapy-pyppeteer

Pyppeteer integration for Scrapy

540 58 13
plasmate-labs
plasmate

The browser engine for agents. HTML in, Semantic Object Model out. 10x token compression, V8 JS rendering, CDP compatible. Apache-2.0.

418 21 3
plasmate-labs
plasmate-browser-use

The browser engine for agents. HTML in, Semantic Object Model out. 10x token compression, V8 JS rendering, CDP compatible. Apache-2.0.

278 21 3
bkauto3
conduit-browser

Headless browser with SHA-256 hash chain + Ed25519 audit trails. MCP server for AI agents. Stealth. Self-verifiable proof bundles.

169 3 0
ArchiveBox
archivebox-likn

The self-hosted internet archive.

132 27K 2K
ivan-sincek
scrapy-scraper

Web crawler and scraper based on Scrapy and Playwright's headless browser.

108 17 7
plasmate-labs
som-parser

The browser engine for agents. HTML in, Semantic Object Model out. 10x token compression, V8 JS rendering, CDP compatible. Apache-2.0.

94 21 3
    • Data from PyPI, GitHub, ClickHouse, and BigQuery