PyPI Stats
  • Insights
  • PyPI
  • GitHub
  • Search
  • Compare
  • Advisories
  • Ecosystem
  • About
Home

Scraper Python Packages

Python packages with the GitHub topic scraper. Sorted by relevance, with stars and monthly downloads.
firecrawl
firecrawl-py

🔥 The API to search, scrape, and interact with the web for AI

7M 114K 7K
rushter
selectolax

Python binding to Modest and Lexbor engines. Fast HTML5 parser with CSS selectors for Python.

5.7M 2K 91
codelucas
newspaper3k

newspaper3k is a news, full-text, and article metadata extraction in Python 3. Advanced docs:

1M 15K 2K
firecrawl
firecrawl

🔥 The API to search, scrape, and interact with the web for AI

718K 114K 7K
JoMingyu
google-play-scraper

Google play scraper for Python inspired by <facundoolano/google-play-scraper>

583K 967 246
AliAkhtari78
spotifyscraper

Spotify Scraper to extract all the information from spotify, download mp3 with cover of the song

544K 252 28
apify
crawlee

Crawlee—A web scraping and browser automation library for Python to build reliable crawlers. Extract data for AI, LLMs, RAG, or GPTs. Download HTML, PDF, JPG, PNG, and other files from websites. Works with Parsel, BeautifulSoup, Playwright, and raw HTTP. Both headful and headless mode. With proxy rotation.

541K 9K 712
0x676e67
rnet

An ergonomic Python HTTP Client with TLS fingerprint

496K 1K 104
spider-rs
spider-client

Python, Javascript, and Rust libraries for the Spider Cloud API.

413K 25 9
scrapfly
scrapfly-sdk

Official Python SDK for the Scrapfly platform: web scraping, screenshots, AI extraction, crawling, and a remote anti-bot browser. Integrates with Scrapy, LlamaIndex, and LangChain.

313K 55 15
d60
twikit

Twitter API Scraper | Without an API key | Twitter Internal API | Free | Twitter scraper | Twitter Bot

162K 4K 526
ZenRows
zenrows

SDK to access ZenRows API directly from Python. We handle proxies rotation, headless browsers and CAPTCHAs for you.

160K 18 9
JustAnotherArchivist
snscrape

A social networking service scraper in Python

98K 5K 780
cinemagoer
cinemagoer

Cinemagoer is a Python package useful to retrieve and manage the data of the IMDb (to which we are not affiliated in any way) movie database about movies, people, characters and companies

89K 1K 376
vladkens
twscrape

2025! X / Twitter API scrapper with authorization support. Allows you to scrape search results, User's profiles (followers/following), Tweets (favoriters/retweeters) and more.

61K 2K 289
henrique-coder
perplexity-webui-scraper

An advanced, high-performance Python client, MCP server, and REST API for reverse-engineering Perplexity AI's WebUI.

44K 76 19
dermasmid
scrapetube

A YouTube scraper for scraping channels, playlists, and searching 🔎

43K 508 67
outscraper
outscraper

The library provides convenient access to the Outscraper API from applications written in the Python language. Allows using Outscraper's services from your code.

40K 92 21
isaackogan
tiktoklive

The definitive Python library to receive livestream events (comments, gifts, etc.) in realtime from TikTok LIVE.

33K 1K 270
0x676e67
wreq

An ergonomic Python HTTP Client with TLS fingerprint

31K 1K 104
cinemagoer
imdbpy

Cinemagoer is a Python package useful to retrieve and manage the data of the IMDb (to which we are not affiliated in any way) movie database about movies, people, characters and companies

29K 1K 376
tn3w
is-crawler

Crawler detection from User-Agent strings in 50 ns. Issues and pull requests welcome!

29K 0 0
BrianWeiHaoMa
misoreports

A comprehensive Python library for downloading Midcontinent Independent System Operator (MISO) public reports into pandas dataframes.

26K 8 0
cowboy-bebug
app-store-scraper

Single API ☝ App Store Review Scraper 🧹

22K 100 61
    • Data from PyPI, GitHub, ClickHouse, and BigQuery