PyPI Stats
  • Insights
  • PyPI
  • GitHub
  • Search
  • Compare
  • Advisories
  • Ecosystem
  • About
Home

Scraping Websites Python Packages

Python packages with the GitHub topic scraping-websites. Sorted by relevance, with stars and monthly downloads.
kennethreitz
requests-html

Pythonic HTML Parsing for Humans™

816K 328 42
Anorov
cfscrape

A Python module to bypass Cloudflare's anti-bot page.

43K 4K 451
outscraper
outscraper

The library provides convenient access to the Outscraper API from applications written in the Python language. Allows using Outscraper's services from your code.

40K 92 21
TeamKillerX
tgcore

TgCore • Designed for complex systems. Made simple. A fluent DSL for building Telegram bots, APIs, and AI workflows.

4K 1 0
multimodal-ai-lab
scrapemm

LLM-friendly scraper for media and text from social media and the open web.

2K 5 0
crawlbase-source
crawlbase

Fast python library for the Crawlbase API

2K 25 2
sujitmandal
scrape-search-engine

Search anything on the different Search Engine's it will collect all the links.

2K 14 5
proxycrawl
proxycrawl

ProxyCrawl Python library for scraping and crawling

2K 58 19
outscraper
google-services-api

The library provides convenient access to the Outscraper API from applications written in the Python language. Allows using Outscraper's services from your code.

848 92 21
outscraper
google-maps-reviews

Google Maps Reviews API SDK

755 14 5
fedecalendino
nintendeals

Library with a set of tools for scraping information about Nintendo games and its prices across all regions (NA, EU and JP).

745 129 18
pyporn-san
multporn

A python library used to scrape and download from Multporn.net

709 28 1
proxymesh
scrapy-proxy-headers

Handle custom proxy headers when making HTTPS requests through proxies in scrapy

700 4 0
okaits
nicovideo-py

ニコニコ動画に投稿された動画の情報を取得するライブラリです。(動画をダウンロードすることはできません。)

690 0 0
andriystr
lst

Declarative Scraping Library

524 0 0
JordanAllen101
aioprox

aioprox – Asynchronous proxy manager for Python. Fetch, test, and filter HTTP/SOCKS proxies with optional latency measurement and custom sources.

506 1 0
danangfir
indoquake

A Latest Earthquake Detection Package Taken Based on BMKG | Meteorological, Climatological, and Geophysical Agency

444 0 0
sarartur
liquidcss

Alters css selector names across css files and html templates.

428 3 0
erikqu
newsdatascraper

Python package that helps you easily retrieve complete web articles, new and old

262 5 0
HubTou
pnu-libgh

GitHub scraping tool and library

247 1 1
Javinator9889
g-pygle

A tool for searching the entire web with the Google technology

247 5 1
Indigo-Coder-github
korean-news-crawler

Python Library for Crawling News Artircles in Korean Top 10 News Websites with Utilities

228 1 0
edwardseley
lyricscorpora

An unofficial Python API that allows users to create a corpus of lyrical text from their favorite artists and billboard charts

194 18 1
ivbeg
lazyscraper

Lazy helper tool to make easier scraping with simple tasks

157 19 2
    • Data from PyPI, GitHub, ClickHouse, and BigQuery