PyPI Stats
  • Insights
  • PyPI
  • GitHub
  • Search
  • Compare
  • Advisories
  • Ecosystem
  • About
Home

Webcrawler Python Packages

Python packages with the GitHub topic webcrawler. Sorted by relevance, with stars and monthly downloads.
saying121
decrypt-cookies

Get browser cookies and logins. Easily make a request using the authorization data from your browser.

12K 10 2
AIMLPM
markcrawl

Fast Python web crawler for RAG and AI ingestion. Extracts clean Markdown from any site for LLMs and vector stores.

8K 2 0
umrashrf
catch

Web crawler with built in parsers using latest Python technologies

3K 0 1
scrapinghub
scrapyrt

HTTP API for Scrapy spiders

3K 881 162
GeneralNewsExtractor
gne

新闻网页正文通用抽取器 Beta 版.

2K 4K 540
simonsdave
cloudfeaster

Cloudfeaster Spider Development

1K 3 0
imyourboyroy
web-scraper-toolkit

A powerful, standalone web scraping toolkit using Playwright and various parsers.

777 5 2
GeminidSystems
googlenewsscraper

A Python package that scrapes Google News article data while remaining undetected by Google. Our scraper can scrape page data up until the last page and never trigger a CAPTCHA (download stats: https://pepy.tech/project/GoogleNewsScraper)

746 11 5
cgq-qgc
hydrosensorreader

This project provides tools to read files from probes, sensors, or anything used in hydrogeology.

685 8 2
superjcd
spydy

light-weight high-level web-crawling framework

447 2 0
riquedev
sslproxies24

Captura e validação de Proxys (Python).

400 0 0
YUChoe
noizze-crawler

A web page crawler which returns (title, og:image, og:description).

399 0 1
rrmerugu
trawler

A data gathering framework to search and get information from web sources

382 2 2
EdmundMartin
scrapio

Asyncio web crawling framework. Work in progress.

334 19 4
ScrapeGraphAI
scrapegraph-mcp

ScapeGraph MCP Server

316 66 19
kingname
generalnewsextractor

新闻网页正文通用抽取器 Beta 版.

230 4K 540
Jack-Tilley
webscraping-tools

Tools to make webscraping easier

216 2 1
Indigo-Coder-github
korean-news-crawler

Python Library for Crawling News Artircles in Korean Top 10 News Websites with Utilities

215 1 0
ScrapeGraphAI
mseep-scrapegraph-mcp

ScapeGraph MCP Server

194 66 19
albert-marrero
yugioh-scraper

Yu-Gi-Oh! Scraper is a project that crawls websites and APIs and extracts Yu-Gi-Oh! related data from their pages.

173 1 0
Aravindha1234u
socialscraper

Social Scraper is a python tool meant for Detection of Child Predators/Cyber Harassers on Social Media

164 60 12
A-Bak
webpage-image-downloader

Tool for extracting and saving specific images from websites.

162 0 0
dearopen
django-easy-scraper

Django apps to scrape data from web page easily

151 2 1
VictorAlessander
smith-the-crawler

A toolkit to make easy web scraping the world.

138 3 0
    • Data from PyPI, GitHub, ClickHouse, and BigQuery