Dependents of scrapy

386 dependents

Package	Description	Downloads/month
scrapy-playwright	🎭 Playwright integration for Scrapy	922K
advertools	advertools - online marketing productivity and analysis tools	147K
news-please	news-please - an integrated web crawler and information extractor for news that ...	118K
scrapy-zyte-api	Zyte API integration for Scrapy	109K
llama-index-readers-web	llama-index readers web integration	86K
scrapy-splash	Scrapy+Splash for JavaScript integration	70K
scrapyd	A service daemon to run Scrapy spiders	47K
scrapy-redis	Redis-based components for Scrapy.	35K
scrapyd-client	Command line client for Scrapyd server	34K
scrapy-impersonate	Scrapy download handler that can impersonate browser' TLS signatures or JA3 fing...	34K
scrapy-selenium	Scrapy middleware to handle javascript pages using selenium	26K
scrapinghub-entrypoint-scrapy	Scrapy entrypoint for Scrapinghub job runner	23K
scrapy-zyte-smartproxy	Zyte Smart Proxy Manager (formerly Crawlera) middleware for Scrapy	22K
shub-workflow	Workflow manager for Zyte ScrapyCloud tasks.	21K
inspire-schemas	Inspire JSON schemas and utilities to use them.	20K
scrapy-poet	Page Object pattern for Scrapy	15K
scrapy-calyprium	Anti-detection Scrapy middleware — proxy routing and browser rendering for web s...	13K
fzutils	A Python utils for spider	9K
aiecs	AI Execute Services - A middleware framework for AI-powered task execution and t...	9K
scrapy-spider-metadata	Utilities to extend Scrapy spiders with usable metadata.	9K
scrapyscript	Run a Scrapy spider programmatically from a script or a Celery task - no project...	9K
e-models	Tools for helping build of extraction models with scrapy spiders.	8K
ayugespidertools	使 scrapy 开发不用在意 item，pipeline，middleware 等通用场景下模块的编写，解放开发者的双手。	8K
city-scrapers-core	Core functionality for City Scrapers projects	8K
scrapy-random-useragent-pro	A random user-agent for all your needs	7K
scrapy-zen		6K
scrapy-inline-requests	A decorator to write coroutine-like spider callbacks.	5K
scrapy-useragents	A downloader middleware to change user-agent of scrapy	5K
scrapy-settings-log	An extension that allows a user to display all or some of their scrapy spider se...	5K
plexflow	A short description of the package.	5K
clappscrapers	Clappform Python scraper	5K
zyte-spider-templates	Spider templates for automatic crawlers.	4K
scrapy-zenrows	A Scrapy middleware for accessing ZenRows Scraper API with minimal setup.	4K
duplicate-url-discarder	Discarding duplicate URLs based on rules.	4K
comicguispider	支持拷贝漫画, Māngabz, 禁漫天堂, wnacg, exhentai, hitomi.la, h-comic , kemono, danbooru \| ...	4K
scrapy-deltafetch	Scrapy spider middleware to ignore requests to pages containing items seen in pr...	4K
modis-crawler-utils	Scrapy utils for Modis crawlers projects.	3K
scrapy-cloudflare-middleware	A Scrapy middleware to bypass the CloudFlare's anti-bot protection	3K
scrapy-rss	Tools to easy generate RSS feed that contains each scraped item using Scrapy fra...	3K
scrapyrt	HTTP API for Scrapy spiders	3K
assetutilities	Standardized project configuration for assetutilities	2K
scrapy-frontera	More flexible and featured Frontera scheduler for Scrapy	2K
board-game-scraper	Board games data scraping and processing from BoardGameGeek and more!	2K
docrawl	Do automated crawling of pages using scrapy	2K
scrapy-wayback-middleware	Scrapy middleware for submitting URLs to the Internet Archive Wayback Machine	2K
gerapy	Distributed Crawler Management Framework Based on Scrapy, Scrapyd, Django and Vu...	2K
scraper-hj3415	Gathering the stock data	2K
canvasrobot	Library which uses Canvasapi (see https://canvasapi.readthedocs.io) to provide a...	2K
scrapy-proxy-pool	Simple scrapy proxy pool	2K
xx		2K