39 dependents
Package Description Downloads/month
Utilities that extend Standard Python. 22K
Beancount Tools 5K
Cross-platform python wrapper around ENIAM (http://eniam.nlp.ipipan.waw.pl/) 2K
PDF text and table search 2K
A web interface to extract tabular data from PDFs 1K
SDK Python pour connaître la Qualité des Eaux de Baignade à Nouméa 1K
Python package to convert spaCy and Stanza documents to NLP Annotation Format (N... 988
973
3m
3m 928
Data import tools 840
OFXStatement plugin for AltaBanka (Serbia) 690
llama-index readers pdf_table integration 559
Template for AI chatbots & document management using Retrieval-Augmented Generat... 356
Parse OCBC or DBS banks statements into individual transactions 330
Alice PDF is a CLI that extracts tables from PDFs—native or scanned—using Camelo... 328
This Package is for extracting tables, table_image, and text from pdf files. 324
Obtain New Molecular Entities (NME) Drug Approval Data 322
extract pdf table data using camelot, use ocr extract text from image-base pages 314
CLI to get information about Ironman professional races 303
A Python library for extracting text content from any document format. 277
Qt application to transform raw tables into clean geographic data. 264
Extrai e interpreta os registros e os campos das tabelas dos manuais do SPED (Si... 239
Simple python cli to extract voting results from PDfs published by the swiss par... 197
A collection of useful python programs. 195
Tool for extracting text and tables from PDF files and saving this data in docx ... 195
WithPano build using marzipano, python and react 184
Extract and process text from images and PDFs 177
A collection of useful python programs. 136
Agent for extracting structured content from PDFs using LangGraph 127
Convert PDF to structured data 117
An AI co-pilot for the mortgage industry speeds up the loan application process. 104
This is pipeline code for accelerating solution accelerators 103
corona chan scraper for gob mx 90
80
Advanced Table Extraction and Text Recognition Library 78
A modular document-processing pipeline for AI-powered document intelligence. 70
Extract data from donor PDF documents for pulmonary transplantation 66
Extract tabular data from PDFs 60
This add-on includes a suite of widgets designed to extract and parse data from ... 2