43 dependents
| Package | Description | Downloads/month |
|---|---|---|
| AI Execute Services - A middleware framework for AI-powered task execution and t... | 9K | |
| ktrain is a Python library that makes deep learning and AI more accessible and e... | 7K | |
| A course management system currently used at DTU | 3K | |
| Smart local file search app that understands your files | 3K | |
| Scio v2 is a reimplementation of Scio in Python3 | 2K | |
| Add your description here | 1K | |
| A command-line interface for interacting with Distant Reader study carrels | 1K | |
| The Asynchronous Data Dynamo and Graph Neural Network Catalyst | 1K | |
| Owlsight is a command-line tool combining open-source AI models with Python func... | 893 | |
| LlamaIndex Legacy Office Reader, handles .doc files loading with Apache Tika | 766 | |
| The Harmony Python library: a research tool for psychologists to harmonise data ... | 468 | |
| Easily create semantic search based LLM applications on your own data | 433 | |
| Automatic CLI tool for generating outline of PDFs based on parsing the table of ... | 399 | |
| OCRUSREX takes a PDF (either by path or as a file-like object) and makes it sear... | 356 | |
| Document parsing tool for LLM training and Rag | 327 | |
| utils for html parsing | 302 | |
| Parsers and ingestors for different file types and formats | 266 | |
| DLC2Action is an action segmentation package that makes running and tracking of ... | 259 | |
| Python utils for the Camai CHC COVID Datasystem. | 226 | |
| Genie Flow Invoker Document Process | 224 | |
| 217 | ||
| Scraper and PDF text processor for domsdatabasen.dk | 192 | |
| Question Answering System for Plants | 178 | |
| Simple script for extracting business data from PDFs. | 154 | |
| Beautiful and interactive visualisations for NLP Topics | 151 | |
| 146 | ||
| Package to process documents of any format | 145 | |
| It a simple package for training and classification of resumes. | 132 | |
| 124 | ||
| Open source plagiarism checker | 119 | |
| A Model Context Protocol (MCP) server for reading and summarizing file content w... | 103 | |
| Benchmark PDF extraction tools for use with RAG applications | 101 | |
| This SDK is for Data Digitization. | 93 | |
| Documment Extraction library for Python | 88 | |
| A Python tool for extracting table of contents from EPUB files with hierarchical... | 81 | |
| A small package to extract text from pdf | 77 | |
| fetch, munge, and parse résumés and job postings | 75 | |
| This SDK is for Data Digitization | 74 | |
| Script to check local folders for GDPR-relevant information in the TUM context | 63 | |
| 62 | ||
| Convert pdf to plain string (multiline if needed) | 59 | |
| texta-parsers-lite | 52 | |
| 6 |