PyPI Stats
  • Insights
  • PyPI
  • GitHub
  • Search
  • Compare
  • Advisories
  • Ecosystem
  • About
Home

Webscraping Python Packages

Python packages with the GitHub topic webscraping. Sorted by relevance, with stars and monthly downloads.
requests-cache
requests-cache

Persistent HTTP cache for python requests

21.2M 1K 156
adbar
htmldate

Fast and robust date extraction from web pages, with Python or on the command-line

9.8M 148 30
firecrawl
firecrawl-py

🔥 The API to search, scrape, and interact with the web for AI

7M 114K 7K
Kaliiiiiiiiii-Vinyzu
patchright

Undetected Python version of the Playwright testing and automation library.

4.9M 1K 96
daijro
camoufox

🦊 Anti-detect browser

904K 8K 679
firecrawl
firecrawl

🔥 The API to search, scrape, and interact with the web for AI

746K 114K 7K
D4Vinci
scrapling

🕷️ An adaptive Web Scraping framework that handles everything from a single request to a full-scale crawl!

612K 47K 4K
AliAkhtari78
spotifyscraper

Spotify Scraper to extract all the information from spotify, download mp3 with cover of the song

544K 252 28
ZenRows
zenrows

SDK to access ZenRows API directly from Python. We handle proxies rotation, headless browsers and CAPTCHAs for you.

158K 18 9
CloakHQ
cloakbrowser

Stealth Chromium that passes every bot detection test. Drop-in Playwright replacement with source-level fingerprint patches. 30/30 tests passed.

130K 2K 125
assafelovic
gpt-researcher

An autonomous agent that conducts deep research on any data using any LLM providers

75K 27K 4K
jpjacobpadilla
stealth-requests

Undetected web-scraping & seamless HTML parsing in Python!

57K 467 48
openzim
libzim

Libzim binding for Python: read/write ZIM files in Python

39K 102 30
ZacharyHampton
homeharvest

Python package for scraping real estate property data

21K 675 156
Hyper-Solutions
hyper-sdk

Python SDK for Bot Protection Bypass - Automate Akamai, Incapsula, Kasada, and DataDome. No browsers required. Solve challenges and generate valid sensors/cookies via API.

13K 58 3
0xMH
pyfunda

Reverse Engineering funda (funda.nl) mobile APIs

10K 128 12
vypivshiy
ssc-codegen

python-dsl code converter to html parser for web scraping

9K 3 0
ScrapingAnt
scrapingant-client

ScrapingAnt API client for Python.

9K 43 5
maxhumber
gazpacho

🥫 The simple, fast, and modern web scraping library

8K 769 55
phoenixthrush
aniworld

AniWorld Downloader is a cross-platform tool for streaming and downloading anime from aniworld.to, as well as series from s.to. It runs on Windows, macOS, and Linux, providing a seamless experience for offline viewing or instant playback. If you enjoy using it, feel free to leave a ⭐!

7K 241 40
openzim
zimscraperlib

Collection of Python code to re-use across Python-based scrapers

6K 28 22
mov-cli
mov-cli

Watch everything from your terminal.

5K 1K 52
AlexMili
reachable

Check if a URL exists and is reachable

4K 6 0
kameleo-io
kameleo-local-api-client

Anti-detect browser for web scraping and automation. Engine-level fingerprint masking for Chromium and Firefox. Self-hosted, Docker-ready. Integrates with Selenium, Playwright, and Puppeteer via SDKs in Python, JavaScript, and C#.

4K 104 22
    • Data from PyPI, GitHub, ClickHouse, and BigQuery