268 dependents
Package Description Downloads/month
Parsel lets you extract data from XML/HTML documents using XPath or CSS selector... 4.2M
Complete lxml external type annotation 4M
APIs for browser automation, testing, and bypassing bot-detection. 3.5M
Scrapy, a fast high-level web crawling & scraping framework for Python. 3.4M
A jquery-like library for python 2.1M
fast python port of arc90's readability tool, updated to match latest readabilit... 1.5M
🚀🤖 Crawl4AI: Open-source LLM Friendly Web Crawler & Scraper. Don't be shy, join ... 1.5M
newspaper3k is a news, full-text, and article metadata extraction in Python 3. A... 999K
🕷️ An adaptive Web Scraping framework that handles everything from a single requ... 564K
Web based localization tool with tight version control integration. 298K
Python based web automation tool. It can control the browser and send and receiv... 283K
A library with support functions to be called from Odoo migration scripts. 240K
A python package for embedding pandas DataFrames as images into pdf and markdown... 240K
Python implementation of core ProseMirror modules 104K
A Python 3 compatible version of goose http://goose3.readthedocs.io/en/latest/in... 81K
İşlerimizi kolaylaştıracak fonksiyonların el altında durduğu kütüphane.. 73K
Diazo implements a Deliverance like language using a pure 54K
An API to scrape American court websites for metadata. 41K
Simple tool for getting geolocation information on given IP address from various... 38K
Pubmed / NCBI / eutils interaction library, handling the metadata of pubmed pape... 36K
⚽ High-performance football analytics: build data pipelines, scrape data, model ... 19K
Library to support testing Splunk Add-on UX 15K
The Python Package Index Project 14K
Unofficial API for finviz.com 12K
:bike: A preprocessor for anyone writing specifications that converts source fil... 11K
A realtime logging and aggregation server. 11K
Create HTML with python 3 using a standard DOM API. Includes a python port of Ja... 10K
python-dsl code converter to html parser for web scraping 9K
A web scraping library based on LangChain which uses LLM and direct graph logic ... 8K
Anicli API implemetention 8K
A highly efficient, fast, powerful and light-weight anime downloader and streame... 8K
Guiguts version 2, Python/tkinter version 6K
A fast and simple declarative JSON/XML deserializer 5K
Extract HTML elements from the command line using CSS selectors or XPath. Pipe-f... 5K
Watch (parts of) webpages and get notified when something changes via e-mail, on... 5K
Console tools to download online novel and convert to text file 5K
Complete lxml external type annotation 5K
Fetch and format historical price data 5K
A very simple news crawler with a funny name 5K
Allows you to create browser automations with natural language 5K
Intellectual property data tools for AI agents 4K
RecipeDSL — domain-specific language for data gathering and transformations 4K
generate word documents in a sexy way 4K
Python extensions for Inkscape core, separated out from main repository. 4K
Render Plantuml codeblocks in mkdocs without sending sensitive diagrams to a pub... 4K
Manage appointments and resource booking 3K
Grep Python Abstract Syntax Trees (AST) using XPath 3K
📜 The Archive Query Log. 3K
Distributed Python web crawling framework 3K
基于 asyncio 的高性能异步分布式爬虫框架,支持单机和分布式部署 3K