7,503 dependents
Package Description Downloads/month
Create and modify Word documents with Python 51.7M
Redshift Python Connector. It supports Python Database API Specification v2.0. 49.6M
A Python SOAP client 40.3M
Create Open XML PowerPoint documents in Python 31.5M
Read Google Cloud Storage, Azure Blobs, and local paths with the same interface 14.3M
Separate project for HTML cleaning functionalities copied from lxml.html.clean. 12.9M
Python bindings for the XML Security Library. 12.7M
Fast and robust date extraction from web pages, with Python or on the command-li... 9.4M
Command line tool and async library to perform basic file operations on local pa... 9M
Saml Python Toolkit. Add SAML support to your Python software using this library 8.9M
A Python library for reading and writing PDF, powered by QPDF 8.1M
Python & Command-line tool to gather text and metadata on the Web: Crawling, scr... 7.2M
unittest-based test runner with Ant/JUnit like XML reporting. 6.4M
Heuristic based boilerplate removal tool 6.1M
pyHanko: sign and stamp PDF files 5.8M
Convert documents to structured data effortlessly. Unstructured is open-source E... 5.2M
A metasearch library that aggregates results from diverse web search services 5.2M
Read SVG files and convert them to other formats. 4.5M
Parsel lets you extract data from XML/HTML documents using XPath or CSS selector... 4.2M
A metasearch library that aggregates results from diverse web search services 4.1M
Reference BLEU implementation that auto-downloads test sets and reports a versio... 4M
Python XML Signature and XAdES library 3.9M
Scrapy, a fast high-level web crawling & scraping framework for Python. 3.4M
Generate code coverage reports with gcc/gcov 3.1M
AKShare is an elegant and simple financial data interface library for Python, bu... 2.7M
Publish Markdown files to Confluence wiki 2.3M
A jquery-like library for python 2.1M
Use a docx as a jinja2 template 2.1M
Pythonic SharePoint 2M
Python API for https://vespa.ai, the open big data serving engine 1.9M
A python based HTML to text conversion library, command line client and Web serv... 1.9M
Python client for Microsoft Exchange Web Services (EWS) 1.7M
Reverse engineering and pentesting for Android applications 1.6M
🌎💪 BrowserGym, a Gym environment for web task automation 1.6M
A simple HTML content extractor in Python. Can be run as a wrapper for Mozilla's... 1.6M
fast python port of arc90's readability tool, updated to match latest readabilit... 1.5M
🚀🤖 Crawl4AI: Open-source LLM Friendly Web Crawler & Scraper. Don't be shy, join ... 1.5M
Create web-based user interfaces with Python. The nice way. 1.2M
Extract text from HTML 1.2M
Python library to access and analyze SEC Edgar filings, XBRL financial statement... 1.1M
Append/Concatenate .docx documents 1.1M
A simple converter from ASCIIMath to LaTeX or MathML and from MathML to LaTeX 1.1M
Client library for Enterprise h2oGPTe 1M
newspaper3k is a news, full-text, and article metadata extraction in Python 3. A... 999K
Python library for NETCONF clients 985K
Easy to use WebDAV Client for Python 3.x 944K
A versatile Python library for EPUB2/EPUB3 manipulation and processing. 935K
🦊 Anti-detect browser 918K
Python parser for URDFs 900K
Label Studio SDK 899K