118 dependents
Package Description Downloads/month
A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrai... 357K
Task-based datasets, preprocessing, and evaluation for sequence models. 347K
Task-based datasets, preprocessing, and evaluation for sequence models. 216K
NeMo text processing for ASR and TTS 102K
Python package and data files for manipulating phonological segments (phones, ph... 80K
An open-source NLP research library, built on PyTorch. 72K
A streamlined and customizable framework for efficient large model (LLM, VLM, AI... 45K
A packaged and flexible version of the CRAFT text detector and Keras CRNN recogn... 44K
Official Python client library for the OpenReview API 40K
End-to-End Speech Processing Toolkit 30K
google-research t5
Code for the paper "Exploring the Limits of Transfer Learning with a Unified Tex... 28K
A suite of Arabic natural language processing tools developed by the CAMeL Lab a... 24K
FAIR Sequence Modeling Toolkit 2 23K
✔️Contextual word checker for better suggestions (not actively maintained) 22K
Python SDK to configure and run evaluations for your LLM-based application 19K
TextAttack 🐙 is a Python framework for adversarial attacks, data augmentation, ... 14K
Runsight Agent OS Core Engine 12K
OneGov Cloud Framework based on Morepath 11K
Libraries for developing the arivo openmodule 8K
Perfect hash based Index for genomic data 5K
FASR: Fast Automatic Speech Recognition Pipeline 4K
Prosodic: a metrical-phonological parser, written in Python. For English and Fin... 4K
ASR text preprocessing utility 4K
ExpoSeq is a pipeline to process and analyze in various visualizations ngs data ... 4K
immuneML is a platform for machine learning analysis of adaptive immune receptor... 4K
Document reader with OCR & image detection support. 3K
Kaleidoscope对外公开版 3K
Search engine for address 3K
Easy-to-use Speech Toolkit including Self-Supervised Learning model, SOTA/Stream... 3K
Transcript discovery and quantification with long RNA reads (Nanopores and PacBi... 2K
Versatile pipeline for processing protein structure data for deep learning appli... 2K
An efficient and easy-to-use morpheme segmentation library 2K
Evaluation of Docling 2K
Data processing and analysis tools for fuel market research 2K
FunCodec is a research-oriented toolkit for audio quantization and downstream ap... 2K
A package for audio transcription and speaker diarization using Whisper and NeMo... 2K
Python library for measuring string similarity in a smarter way 1K
GenET: Genome Editing Toolkit 1K
Preprocessing and Extraction of Linguistic Information for Computational Analysi... 1K
package for numerically integrating differential equations (front-end for scipy.... 1K
1K
Toolkit for symbolic regression/equation discovery 1K
mmf
mmf: a modular framework for vision and language multimodal research. 1K
Export LibreLingo courses in the JSON format used by the web app 1K
Generic interface for hooking up to any Interactive Theorem Prover (ITP) and col... 1K
Natural Language Understanding (text processing) for math symbols, digits, and w... 1K
A package for curating doc file collections, with ability to sync with youtube a... 1K
Library to characterise tandem repeats in genomic sequences in terms of evolutio... 964
A Hackable speech recognition library. 959
A flexible normalizer for user-generated content http://thalesbertaglia.com/enel... 828