88 dependents
| Package | Description | Downloads/month |
|---|---|---|
| 删库 | 150K | |
| The Privacy Engineering & Compliance Framework | 87K | |
| A Python 3 compatible version of goose http://goose3.readthedocs.io/en/latest/in... | 83K | |
| :mag: ScanCode detects licenses, copyrights, dependencies by "scanning code" ...... | 77K | |
| Find legal citations in any block of text | 62K | |
| Quickly match many regexes against a string | 39K | |
| Open Source Neural Machine Translation and (Large) Language Models in PyTorch | 23K | |
| find any kind of occupation or job title in a text or file | 13K | |
| Repository for JAAT: efficient and accurate analysis of job ads for task matchin... | 12K | |
| Open source tools for Estonian natural language processing | 11K | |
| MVT (Mobile Verification Toolkit) helps with conducting forensics of mobile devi... | 11K | |
| Infrastructure of AlphaX ecosystem | 7K | |
| The Logger that will prevent your data leak | 5K | |
| Fast, world class biomedical NER | 5K | |
| :mag: ScanCode detects licenses, copyrights, dependencies by "scanning code" ...... | 5K | |
| Compare two fonts | 5K | |
| Taiwan Traditional Chinese quality tool for AI-generated content (CLI + 6-langua... | 4K | |
| PDF craft can convert PDF files into various other formats. This project will fo... | 4K | |
| The surveillance and enforcement layer for AI. Audits code against a cryptograph... | 3K | |
| pycorrector is a toolkit for text error correction. 文本纠错,实现了Kenlm,T5,MacBERT,Cha... | 3K | |
| Evaluation and analysis framework for automatic speech recognition. | 3K | |
| Geo-referencing of text | 3K | |
| A personal toolset built over time by Ricco | 2K | |
| 2K | ||
| Python toolkit for ML, CV, NLP and multimodal AI development | 2K | |
| A fast and simple Named Entity Recognition (NER) tool based on the Aho-Corasick ... | 2K | |
| A simple HTML Parser | 2K | |
| Fast relational access to openly-available publication data sets | 2K | |
| automatically design sgRNA for exon skipping with many base editors | 2K | |
| Dingo: A Comprehensive AI Data, Model and Application Quality Evaluation Tool | 2K | |
| MONAPipe provides natural-language-processing tools for German, implemented in P... | 1K | |
| LLM Secrets Leak Detector LLM Secrets Leak Detector is a security tool designed ... | 1K | |
| OpenVoiceOS's multilingual text color parsing and formatting library | 1K | |
| 一个简单快速的分词、命名实体识别工具 | 1K | |
| LEKCut (เล็ก คัด) is a Thai tokenization library that ports the deep learning mo... | 1K | |
| Lokale Pseudonymisierung personenbezogener Daten in Textdateien vor der Verarbei... | 1K | |
| computational chemistry toolkit | 987 | |
| Utility functions for proteomics data analysis | 945 | |
| EVERSE Research Software Fairness Checks | 928 | |
| Python tools for proteogenomics analysis toolkit | 803 | |
| Vital AI Agent Ecosystem, Ensemble Reasoning | 768 | |
| Open language modeling toolkit based on PyTorch | 678 | |
| CLI tool to process and analyze MAF files ( Multiple alignment format ) | 646 | |
| Runtime Reliability Infrastructure for LLM Pipelines | 629 | |
| Extract names of places from text and determine which country they may refer to | 614 | |
| Domain Adaptation of Thai Word Segmentation Models using Stacked Ensemble (EMNLP... | 578 | |
| Haplotyping | 575 | |
| Protein-DNA binding site caller from kmer data | 563 | |
| Rule-based toolkit for Chinese NLP tasks | 534 | |
| A tool for the analysis of bisulfite-free and base-resolution sequencing data ge... | 507 |