PyPI Stats
  • Insights
  • PyPI
  • GitHub
  • Search
  • Compare
  • Advisories
  • Ecosystem
  • About
Home

Scrapy Python Packages

Python packages with the GitHub topic scrapy. Sorted by relevance, with stars and monthly downloads.
scrapy
itemadapter

Common interface for data container classes

2.4M 69 13
scrapy-plugins
scrapy-playwright

🎭 Playwright integration for Scrapy

804K 1K 160
Luqman-Ud-Din
random-user-agent

A package to get list of user agents based on filters such as operating system, software name etc..

428K 103 12
scrapfly
scrapfly-sdk

Official Python SDK for the Scrapfly platform: web scraping, screenshots, AI extraction, crawling, and a remote anti-bot browser. Integrates with Scrapy, LlamaIndex, and LangChain.

313K 55 15
eliasdabbas
advertools

advertools - online marketing productivity and analysis tools

148K 1K 240
scrapy-plugins
scrapy-zyte-api

Zyte API integration for Scrapy

110K 40 21
hellock
icrawler

A multi-thread crawler framework with many builtin image crawlers provided.

73K 921 179
scrapy-plugins
scrapy-splash

Scrapy+Splash for JavaScript integration

71K 3K 456
rmax
scrapy-redis

Redis-based components for Scrapy.

36K 6K 2K
jxlil
scrapy-impersonate

Scrapy download handler that can impersonate browser' TLS signatures or JA3 fingerprints.

34K 232 27
scrapy-plugins
scrapy-crawlera

Zyte Smart Proxy Manager (formerly Crawlera) middleware for Scrapy

33K 365 91
clemfromspace
scrapy-selenium

Scrapy middleware to handle javascript pages using selenium

27K 953 353
alecxe
scrapy-fake-useragent

Random User-Agent middleware based on fake-useragent

24K 688 94
scrapy-plugins
scrapy-zyte-smartproxy

Zyte Smart Proxy Manager (formerly Crawlera) middleware for Scrapy

23K 365 91
TeamHG-Memex
scrapy-rotating-proxies

use multiple proxies with Scrapy

13K 773 157
my8100
logparser

A tool for parsing Scrapy log files periodically and incrementally, extending the HTTP JSON API of Scrapyd.

10K 92 24
Boris-code
feapder

🚀🚀🚀feapder is an easy to use, powerful crawler framework | feapder是一款上手简单,功能强大的Python爬虫框架。内置AirSpider、Spider、TaskSpider、BatchSpider四种爬虫解决不同场景的需求。且支持断点续爬、监控报警、浏览器渲染、海量数据去重等功能。更有功能强大的爬虫管理系统feaplat为其提供方便的部署及调度

9K 4K 541
ScrapingAnt
scrapingant-client

ScrapingAnt API client for Python.

9K 43 5
City-Bureau
city-scrapers-core

Core functionality for City Scrapers projects

8K 8 10
orangain
scrapy-s3pipeline

Scrapy pipeline to store chunked items into Amazon S3 or Google Cloud Storage bucket.

7K 76 12
shengchenyang
ayugespidertools

使 scrapy 开发不用在意 item,pipeline,middleware 等通用场景下模块的编写,解放开发者的双手。

7K 99 16
ScrapeOps
scrapeops-scrapy

Scrapy extension that gives you all the scraping monitoring, alerting, scheduling, and data validation you will need straight out of the box.

6K 38 13
scrapingbee
scrapy-scrapingbee

JavaScript support and proxy rotation for Scrapy with ScrapingBee.

5K 152 6
grammy-jiang
scrapy-useragents

A downloader middleware to change user-agent of scrapy

5K 21 5
    • Data from PyPI, GitHub, ClickHouse, and BigQuery