15 dependents
| Package | Description | Downloads/month |
|---|---|---|
| Python & Command-line tool to gather text and metadata on the Web: Crawling, scr... | 7.2M | |
| Local Deep Research achieves ~95% on SimpleQA benchmark (tested with Qwen 3.6). ... | 15K | |
| :sunrise: next generation web crawling using machine intelligence | 3K | |
| Scio v2 is a reimplementation of Scio in Python3 | 2K | |
| The knowledge agent shell (core) | 1K | |
| 浏览器代理服务器 | 1K | |
| A Python package to manage delphai machine learning operations. | 722 | |
| Entity Market Research | 334 | |
| Ingestion (web/PDF/DOCX/TXT), cleaning, paragraph-level LID (PT/EN/ES), and spaC... | 193 | |
| Scalable Data Preprocessing Tool for Training Large Language Models | 185 | |
| Modern Python library for browser automation and intelligent content extraction ... | 147 | |
| Scalable data pre processing and curation toolkit for LLMs | 71 | |
| A Python package to extract data from unstructured into structured format | 70 | |
| hogwarts browser use 霍格沃兹测试开发学社学员定制版 | 18 | |
| Scalable Data Preprocessing Tool for Training Large Language Models | 1 |