268 dependents
| Package | Description | Downloads/month |
|---|---|---|
| Parsel lets you extract data from XML/HTML documents using XPath or CSS selector... | 4.2M | |
| Complete lxml external type annotation | 4M | |
| APIs for browser automation, testing, and bypassing bot-detection. | 3.5M | |
| Scrapy, a fast high-level web crawling & scraping framework for Python. | 3.4M | |
| A jquery-like library for python | 2.1M | |
| fast python port of arc90's readability tool, updated to match latest readabilit... | 1.5M | |
| 🚀🤖 Crawl4AI: Open-source LLM Friendly Web Crawler & Scraper. Don't be shy, join ... | 1.5M | |
| newspaper3k is a news, full-text, and article metadata extraction in Python 3. A... | 999K | |
| 🕷️ An adaptive Web Scraping framework that handles everything from a single requ... | 564K | |
| Web based localization tool with tight version control integration. | 298K | |
| Python based web automation tool. It can control the browser and send and receiv... | 283K | |
| A library with support functions to be called from Odoo migration scripts. | 240K | |
| A python package for embedding pandas DataFrames as images into pdf and markdown... | 240K | |
| Python implementation of core ProseMirror modules | 104K | |
| A Python 3 compatible version of goose http://goose3.readthedocs.io/en/latest/in... | 81K | |
| İşlerimizi kolaylaştıracak fonksiyonların el altında durduğu kütüphane.. | 73K | |
| Diazo implements a Deliverance like language using a pure | 54K | |
| An API to scrape American court websites for metadata. | 41K | |
| Simple tool for getting geolocation information on given IP address from various... | 38K | |
| Pubmed / NCBI / eutils interaction library, handling the metadata of pubmed pape... | 36K | |
| ⚽ High-performance football analytics: build data pipelines, scrape data, model ... | 19K | |
| Library to support testing Splunk Add-on UX | 15K | |
| The Python Package Index Project | 14K | |
| Unofficial API for finviz.com | 12K | |
| :bike: A preprocessor for anyone writing specifications that converts source fil... | 11K | |
| A realtime logging and aggregation server. | 11K | |
| Create HTML with python 3 using a standard DOM API. Includes a python port of Ja... | 10K | |
| python-dsl code converter to html parser for web scraping | 9K | |
| A web scraping library based on LangChain which uses LLM and direct graph logic ... | 8K | |
| Anicli API implemetention | 8K | |
| A highly efficient, fast, powerful and light-weight anime downloader and streame... | 8K | |
| Guiguts version 2, Python/tkinter version | 6K | |
| A fast and simple declarative JSON/XML deserializer | 5K | |
| Extract HTML elements from the command line using CSS selectors or XPath. Pipe-f... | 5K | |
| Watch (parts of) webpages and get notified when something changes via e-mail, on... | 5K | |
| Console tools to download online novel and convert to text file | 5K | |
| Complete lxml external type annotation | 5K | |
| Fetch and format historical price data | 5K | |
| A very simple news crawler with a funny name | 5K | |
| Allows you to create browser automations with natural language | 5K | |
| Intellectual property data tools for AI agents | 4K | |
| RecipeDSL — domain-specific language for data gathering and transformations | 4K | |
| generate word documents in a sexy way | 4K | |
| Python extensions for Inkscape core, separated out from main repository. | 4K | |
| Render Plantuml codeblocks in mkdocs without sending sensitive diagrams to a pub... | 4K | |
| Manage appointments and resource booking | 3K | |
| Grep Python Abstract Syntax Trees (AST) using XPath | 3K | |
| 📜 The Archive Query Log. | 3K | |
| Distributed Python web crawling framework | 3K | |
| 基于 asyncio 的高性能异步分布式爬虫框架,支持单机和分布式部署 | 3K |