13 dependents
| Package | Description | Downloads/month |
|---|---|---|
| Evaluation tools for TREC AutoJudge: meta-evaluate, qrel-evaluate, leaderboard s... | 3K | |
| Conveniently process a dictionary of anndatas | 2K | |
| Inference-time scaling for LLMs-as-a-judge. | 2K | |
| Match recall segments with story segments. | 1K | |
| soak: graph-based pipelines and tools for LLM-assisted qualitative text analysis | 1K | |
| A benchmark for feature attribution techniques | 500 | |
| A Python framework for constructing, deploying, and analyzing large-scale lingui... | 241 | |
| An LLM annotation experiment pipeline for computational social science. | 209 | |
| Package to calculate inter annotator agreement based on krippendorff's alpha for... | 144 | |
| A comprehensive Python package tools for Balinese Natural Language Processing | 137 | |
| XL-DURel Utility Functions | 111 | |
| Word-level Quality Estimation Toolkit | 109 | |
| Package for distributing annotations and calculating annotator agreement/reliabi... | 106 |