176 dependents
Package Description Downloads/month
fast python port of arc90's readability tool, updated to match latest readabilit... 1.5M
Create web-based user interfaces with Python. The nice way. 1.2M
Extract text from HTML 1.2M
Allowlist-based HTML cleaner 617K
Extract embedded metadata from HTML markup 341K
Module for automatic summarization of text documents and HTML pages. 151K
news-please - an integrated web crawler and information extractor for news that ... 118K
llama-index readers web integration 86K
A Playwright-based web scraper with persistent caching, parallel scraping, progr... 69K
MIME based content transformations 57K
Base OAREPO package freezeing versions of libraries 39K
Pubmed / NCBI / eutils interaction library, handling the metadata of pubmed pape... 36K
A faster cpu machine learning library 19K
Invenio module for previewing files. 19K
Local Deep Research achieves ~95% on SimpleQA benchmark (tested with Qwen 3.6). ... 15K
Official Oxylabs MCP integration 15K
Aient: The Awakening of Agent. 14K
Python бібліотека для сканування веб-сайтів та побудови графу їх структури. 10K
Formasaurus tells you the type of an HTML form and its fields using machine lear... 9K
a repository of entities / relations for knowledge management 9K
Mail hosting made simple 9K
modelmerge is a multi-large language model API aggregator. 9K
A web scraping library based on LangChain which uses LLM and direct graph logic ... 8K
Locally saves webpages to your hard disk with images, css, js & links as is. 6K
Agent-first semantic grep — filesystem-native retrieval 5K
从 Feedland OPML 解析和提取 RSS/Atom feeds 文章内容的工具。 5K
Simplify interactions with Large Language Models 5K
A NOMAD plugin containing the schema for the Perovskite Solar Cell Database. 5K
Add your description here 4K
Biblioteca de automação para ambientes Desktop, Web e utilitários do sistema. 4K
Self-hosted AI-powered personal knowledge base 4K
Advanced Pipeline for Simple yet Comprehensive AnaLysEs of DNA metabarcoding dat... 4K
A lightweight library for working with pandas dataframes using natural language ... 3K
Python internal and external DSL for writing generative AI analytics 3K
Example PyPI (Python Package Index) package set up with automated tests and publ... 3K
Move from idea to production in hours with policy-driven autonomous AI agents. U... 3K
Creates dynamic html report from jupyter notebook. 3K
Creates a complete full text historical archive for an RSS or ATOM feed. 3K
Modern Data Centric AI system for Large Language Models 3K
Large Action Model framework to develop AI Web Agents 3K
create a podcast feed from any text source 3K
Lucterios framework. 2K
A napari plugin for counting organoids from brightfield microscopy images with d... 2K
BERTrend analyses topic evolution over time using state-of-the-art transformer m... 2K
A library to get garbage collection dates in Hamburg 2K
An application for interactive for tracking with motile 2K
Core library for FeedSummary. 2K
FastAPI wrapper for Meta AI with chat, image generation & video generation. Easy... 2K
A combined news/weblog application for Aldryn and django CMS – part of the Essen... 2K
A cli tool for download subtitle from www.subdivx.com with the better possible m... 2K