88 dependents
Package Description Downloads/month
删库 150K
The Privacy Engineering & Compliance Framework 87K
A Python 3 compatible version of goose http://goose3.readthedocs.io/en/latest/in... 83K
:mag: ScanCode detects licenses, copyrights, dependencies by "scanning code" ...... 77K
Find legal citations in any block of text 62K
Quickly match many regexes against a string 39K
Open Source Neural Machine Translation and (Large) Language Models in PyTorch 23K
find any kind of occupation or job title in a text or file 13K
Repository for JAAT: efficient and accurate analysis of job ads for task matchin... 12K
Open source tools for Estonian natural language processing 11K
mvt-project mvt
MVT (Mobile Verification Toolkit) helps with conducting forensics of mobile devi... 11K
Infrastructure of AlphaX ecosystem 7K
The Logger that will prevent your data leak 5K
Fast, world class biomedical NER 5K
:mag: ScanCode detects licenses, copyrights, dependencies by "scanning code" ...... 5K
Compare two fonts 5K
Taiwan Traditional Chinese quality tool for AI-generated content (CLI + 6-langua... 4K
PDF craft can convert PDF files into various other formats. This project will fo... 4K
The surveillance and enforcement layer for AI. Audits code against a cryptograph... 3K
pycorrector is a toolkit for text error correction. 文本纠错,实现了Kenlm,T5,MacBERT,Cha... 3K
Evaluation and analysis framework for automatic speech recognition. 3K
Geo-referencing of text 3K
A personal toolset built over time by Ricco 2K
2K
Python toolkit for ML, CV, NLP and multimodal AI development 2K
A fast and simple Named Entity Recognition (NER) tool based on the Aho-Corasick ... 2K
A simple HTML Parser 2K
Fast relational access to openly-available publication data sets 2K
automatically design sgRNA for exon skipping with many base editors 2K
Dingo: A Comprehensive AI Data, Model and Application Quality Evaluation Tool 2K
MONAPipe provides natural-language-processing tools for German, implemented in P... 1K
LLM Secrets Leak Detector LLM Secrets Leak Detector is a security tool designed ... 1K
OpenVoiceOS's multilingual text color parsing and formatting library 1K
一个简单快速的分词、命名实体识别工具 1K
LEKCut (เล็ก คัด) is a Thai tokenization library that ports the deep learning mo... 1K
Lokale Pseudonymisierung personenbezogener Daten in Textdateien vor der Verarbei... 1K
computational chemistry toolkit 987
Utility functions for proteomics data analysis 945
EVERSE Research Software Fairness Checks 928
Python tools for proteogenomics analysis toolkit 803
Vital AI Agent Ecosystem, Ensemble Reasoning 768
Open language modeling toolkit based on PyTorch 678
CLI tool to process and analyze MAF files ( Multiple alignment format ) 646
Runtime Reliability Infrastructure for LLM Pipelines 629
Extract names of places from text and determine which country they may refer to 614
Domain Adaptation of Thai Word Segmentation Models using Stacked Ensemble (EMNLP... 578
Haplotyping 575
Protein-DNA binding site caller from kmer data 563
Rule-based toolkit for Chinese NLP tasks 534
A tool for the analysis of bisulfite-free and base-resolution sequencing data ge... 507