43 dependents
Package Description Downloads/month
AI Execute Services - A middleware framework for AI-powered task execution and t... 9K
ktrain is a Python library that makes deep learning and AI more accessible and e... 7K
A course management system currently used at DTU 3K
Smart local file search app that understands your files 3K
Scio v2 is a reimplementation of Scio in Python3 2K
Add your description here 1K
A command-line interface for interacting with Distant Reader study carrels 1K
The Asynchronous Data Dynamo and Graph Neural Network Catalyst 1K
Owlsight is a command-line tool combining open-source AI models with Python func... 893
LlamaIndex Legacy Office Reader, handles .doc files loading with Apache Tika 766
The Harmony Python library: a research tool for psychologists to harmonise data ... 468
Easily create semantic search based LLM applications on your own data 433
Automatic CLI tool for generating outline of PDFs based on parsing the table of ... 399
OCRUSREX takes a PDF (either by path or as a file-like object) and makes it sear... 356
Document parsing tool for LLM training and Rag 327
utils for html parsing 302
Parsers and ingestors for different file types and formats 266
DLC2Action is an action segmentation package that makes running and tracking of ... 259
Python utils for the Camai CHC COVID Datasystem. 226
Genie Flow Invoker Document Process 224
217
Scraper and PDF text processor for domsdatabasen.dk 192
Question Answering System for Plants 178
Simple script for extracting business data from PDFs. 154
Beautiful and interactive visualisations for NLP Topics 151
146
Package to process documents of any format 145
It a simple package for training and classification of resumes. 132
124
Open source plagiarism checker 119
A Model Context Protocol (MCP) server for reading and summarizing file content w... 103
Benchmark PDF extraction tools for use with RAG applications 101
This SDK is for Data Digitization. 93
Documment Extraction library for Python 88
A Python tool for extracting table of contents from EPUB files with hierarchical... 81
A small package to extract text from pdf 77
fetch, munge, and parse résumés and job postings 75
This SDK is for Data Digitization 74
Script to check local folders for GDPR-relevant information in the TUM context 63
62
Convert pdf to plain string (multiline if needed) 59
texta-parsers-lite 52
6