15 dependents
Package Description Downloads/month
Python & Command-line tool to gather text and metadata on the Web: Crawling, scr... 7.2M
Local Deep Research achieves ~95% on SimpleQA benchmark (tested with Qwen 3.6). ... 15K
kootenpv sky
:sunrise: next generation web crawling using machine intelligence 3K
Scio v2 is a reimplementation of Scio in Python3 2K
The knowledge agent shell (core) 1K
浏览器代理服务器 1K
A Python package to manage delphai machine learning operations. 722
Entity Market Research 334
Ingestion (web/PDF/DOCX/TXT), cleaning, paragraph-level LID (PT/EN/ES), and spaC... 193
Scalable Data Preprocessing Tool for Training Large Language Models 185
Modern Python library for browser automation and intelligent content extraction ... 147
Scalable data pre processing and curation toolkit for LLMs 71
A Python package to extract data from unstructured into structured format 70
hogwarts browser use 霍格沃兹测试开发学社学员定制版 18
Scalable Data Preprocessing Tool for Training Large Language Models 1