373 dependents
Package Description Downloads/month
An open-source framework for detecting, redacting, masking, and anonymizing sens... 4.5M
Scrapy, a fast high-level web crawling & scraping framework for Python. 3.4M
Turn any computer or edge device into a command center for your computer vision ... 1.1M
newspaper3k is a news, full-text, and article metadata extraction in Python 3. A... 999K
Crawlee—A web scraping and browser automation library for Python to build reliab... 538K
Find and use proxy auto-config (PAC) files with Python and Requests. 344K
Simplified python article discovery & extraction. 319K
Python based web automation tool. It can control the browser and send and receiv... 283K
Python SDK for SuperTokens 171K
Turn any computer or edge device into a command center for your computer vision ... 119K
The new inference engine for Computer Vision models 117K
Label Studio is a multi-type data labeling and annotation tool with standardized... 114K
Organize Your Audiobook Collection With Beets 105K
The recursive internet scanner for hackers. 🧡 93K
Manipulate DNS records on various DNS providers in a standardized way. 90K
Integration layer between Requests and Selenium for automation of web actions. 72K
ingestr is a CLI tool to copy data between any databases with a single command s... 69K
Bright Data's python SDK, use it to call bright data's scrape and search tools. ... 57K
Check subdomains for subdomain takeovers and other DNS tomfoolery 55K
Microsoft Threat Intelligence Security Tools 53K
An API to scrape American court websites for metadata. 41K
Return a normalized email-address stripping ISP specific behaviors 41K
Collection of utils used at Inoopa. 40K
Python WHOIS and RDAP utility for querying and parsing information about Domains... 36K
URL matching library that relates URLs with resources 36K
Extends Selenium WebDriver classes to include the request function from the Requ... 36K
Certbot plugin enabling dns-01 challenge on the Hetzner DNS API 33K
A Python SDK for Dhisana AI Platform 21K
19K
Multisite in django — use one Django app to serve multiple domains 19K
Quokko is a cute search engine and web crawler featuring Quokkas! 19K
Atomic functions and classes to make developer life easier 18K
detect technologies with wappalyzer alternative 17K
Browser testing via live content 16K
Find way more from the Wayback Machine, Common Crawl, Alien Vault OTX, URLScan, ... 14K
Generate list of potential typo squatting domains with domain name permutation e... 13K
Simple CertificateAuthority and host certificate creation, useful for man-in-the... 13K
clinicedc edc
A framework for multisite longitudinal clinical trials built on Django 13K
Core Python Web Archiving Toolkit for replay and recording of web archives 11K
Faraday plugins package 11K
Plugin for certbot to obtain certificates using a DNS TXT record for Porkbun dom... 11K
Extract IOCs from text. 10K
This is a Certbot DNS plugin for the new Hetzner Cloud DNS, which allows you to ... 10K
The web browser for LLMs agents 10K
python library for getting metadata 10K
Formasaurus tells you the type of an HTML form and its fields using machine lear... 9K
Web tools and interfaces for Internet data processing. 9K
Mail hosting made simple 9K
Python library for SPF, DKIM, and DMARC email protections. 9K
a domain ssl cert admin 9K