28 dependents
Package Description Downloads/month
ConvoKit is a toolkit for extracting conversational features and analyzing socia... 4K
II Researcher Package 4K
Powerfull python tool for modern NLP processing 1K
Scratchpad for scraper development and general utilities. 862
A Python package to manage delphai machine learning operations. 722
NLP Application to parse RH Curriculum Vitae for the RH department 587
📜 Dehyphenation of broken text (mainly German), i.e., extracted from a PDF 540
A package for tracking historical sentiment data from Twitter over certain keywo... 518
A CxG Induction Framework. 342
CLI & Python API to easily summarize text-based files with transformers 308
A straight forward tool to get information from docker command line and try to p... 303
Fulltext search for linkding 250
Construction Grammars for Natural Language Processing and Computational Linguist... 250
Understanding and Prioritizing Flaky Job Failure Categories 234
📑 Python Package to reconstruct the original continuous text from PDFs with lang... 225
Ingestion (web/PDF/DOCX/TXT), cleaning, paragraph-level LID (PT/EN/ES), and spaC... 193
rnm
184
Measure the similarity of text corpora for 74 languages 148
epubsum. 146
Preprocess German texts for serious NLP. 143
121
Edu-ConvoKit: An Open-Source Framework for Education Conversation Data 117
II Researcher Package 114
Add your description here 107
II Researcher Package 98
matthewdurward vmp
Generating Vocabulary Management Profiles in Python 80
Basic computational linguistics and natural language processing in Python 78
Geographically-informed language identification 72