47 dependents
Package Description Downloads/month
Python runtime for WeTextProcessing (does not depend on Pynini) 270K
An easy python package to run quick basic QA evaluations. This package includes ... 11K
Modern Data Centric AI system for Large Language Models 3K
SOftware Metadata Extraction Framework: A tool for automatically extracting rele... 2K
Chinese text analysis library, which can perform word frequency statistics, dict... 2K
中文新闻词频分析与趋势词云工具 1K
Data cleaning made easy with swachhdata 1K
Easy Data Preparation with latest LLMs-based Operators and Pipelines. 1K
EVERSE Research Software Fairness Checks 931
Topic modeling using Transformers 773
A package for automated hyper parameter tuning and machine learning workflows. B... 761
A Python library for building and training Seq2Seq models 761
a modular Python package for cleaning text, categorical, numerical, and datetime... 668
Laboratory-specific discourse analysis tools in the DIAAD lineage 637
A lightweight library for normalizing speech transcripts before computing WER 621
A lightweight and reusable text preprocessing package for NLP tasks 485
A modular, fully-configurable NLP text cleaning function with 15+ toggleable ste... 433
Use this library to transform raw text into differents graph representations. 415
A Python library for cleaning and preprocessing text data with asynchronous and ... 348
341
A lightweight Python library for constructing, processing, and visualizing const... 319
Python library for concurrent text preprocessing 245
Custom classes, functions, and scripts for working with Python. 231
Python library that collects tweets about movies, performs a sentiment analysis ... 225
A Python chatbot that learns as you speak to it. 217
A community identification module for Reddit conversations 194
Internet-ML: Allowing ML to connect to the internet 186
A comprehensive pipeline for sentiment analysis using deep learning models 185
An easy-to-use library with advanced preprocessing features to streamline and ac... 184
eXplainable Inference & Search: an industry-ready library for advanced data retr... 173
164
Scientific analysis of collaborative communities 154
A Python library for measuring the style. 139
A simple, configurable NLP preprocessing toolkit. 130
Proggramming For Data Engineering course final project 123
A collection of preprocessing functions for text data 108
A package for cleaning and preprocessing text data 102
Utilities for AnTeDe course. 101
A lightweight text preprocessing toolkit with tokenization, stopword removal, st... 100
A collection of modular and reusable machine learning utilities, tools, and help... 89
A Python script for counting word families in a text file using advanced morphol... 89
UoT Rotman NLP Case Study 60
This package provides standard and classifier-based short form QA evaluation met... 8
Awesome Data Science Package 5
Quality profiler library 3
A package for cleaning and preprocessing text data 1
Simple, scikit-learn-style NLP classifiers with one-liner preprocessing. 1