PyPI Stats
  • Insights
  • PyPI
  • GitHub
  • Search
  • Compare
  • Advisories
  • Ecosystem
  • About
Home

Webscraper Python Packages

Python packages with the GitHub topic webscraper. Sorted by relevance, with stars and monthly downloads.
AliAkhtari78
spotifyscraper

Spotify Scraper to extract all the information from spotify, download mp3 with cover of the song

544K 252 28
EchterAlsFake
phub

A lightweight API for Pornhub

5K 141 36
nmcassa
letterboxdpy

A letterboxd webscraper

5K 155 28
foreandr
hypersel

HyperSel is a Python-based framework for web automation and data scraping. It simplifies crawling, automating interactions, and data extraction from web pages. It is built with various robust tools to handle dynamic and complex web interactions efficiently, offering features such as headless browsing, proxy support, and API sniffing.

3K 1 0
rootVIII
proxy-requests

a class that uses scraped proxies to make http GET/POST requests (Python requests)

2K 388 43
xtream1101
scraperx

Library for scraping websites or apis at any scale

1K 54 11
HelloThereMatey
tedata

Download data from Trading Economics for free and without any account or API key. Uses selenium, javascript & bs4 to scrape data. Trading Economics is one of the greatest stores of economic data on the web and contains millions of time-series for hundreds of different countries.

1K 26 7
MichaelYochpaz
isubrip

A Python package for scraping and downloading subtitles from AppleTV / iTunes movie pages.

1K 207 23
vypivshiy
scrape-schema

Structuring parsed data into python objects

1K 4 1
JonathanVusich
pcpartpicker

This is an unofficial API for the website pcpartpicker.com.

909 119 11
eugen1j
aioscrapy

Python asynchronous library for web scraping

824 10 3
thicccat688
mintospy

Mintos web scraper (Updated for 2023)

818 9 4
imyourboyroy
web-scraper-toolkit

A powerful, standalone web scraping toolkit using Playwright and various parsers.

808 5 2
GeminidSystems
googlenewsscraper

A Python package that scrapes Google News article data while remaining undetected by Google. Our scraper can scrape page data up until the last page and never trigger a CAPTCHA (download stats: https://pepy.tech/project/GoogleNewsScraper)

778 11 5
reppon97
soccer-data-api

A Python web-scrap package to get soccer data/stats.

766 8 0
ishan-surana
metadatascraper

MetaDataScraper is a Python package designed to automate the extraction of follower counts and post details from a public Facebook page. It uses Selenium WebDriver for web automation and scraping. Official documentation at https://metadatascraper.readthedocs.io

715 13 1
JaonHax
scpscraper

A Python library designed for scraping data from the SCP wiki.

700 16 4
HimashaHerath
llm-webextract

AI-powered web content extraction — turn any website into structured JSON using LLMs and LangChain

609 2 0
zembrodt
py-mdb

Python package to both parse datsets provided by IMDb and scrape information from imdb.com

527 6 0
brandonrobertz
autoscrape

An automated, programming-free web scraper for interactive sites

505 111 20
tyleracorn
covid-alberta

looking at some of the alberta specific covid data

494 1 0
mrwan200
pytwitterscraper

Twitter Scraper With Python

456 12 3
J-J-B-J
sentraltimetable

A simple Python function to summon your timetable from Sentral.

448 8 0
Leinadium
microhorario-dl

PUC-Rio Microhorario Downloader

441 4 0
    • Data from PyPI, GitHub, ClickHouse, and BigQuery