13 dependents
Package Description Downloads/month
Evaluation tools for TREC AutoJudge: meta-evaluate, qrel-evaluate, leaderboard s... 3K
Conveniently process a dictionary of anndatas 2K
Inference-time scaling for LLMs-as-a-judge. 2K
Match recall segments with story segments. 1K
soak: graph-based pipelines and tools for LLM-assisted qualitative text analysis 1K
A benchmark for feature attribution techniques 500
A Python framework for constructing, deploying, and analyzing large-scale lingui... 241
An LLM annotation experiment pipeline for computational social science. 209
Package to calculate inter annotator agreement based on krippendorff's alpha for... 144
A comprehensive Python package tools for Balinese Natural Language Processing 137
XL-DURel Utility Functions 111
Word-level Quality Estimation Toolkit 109
Package for distributing annotations and calculating annotator agreement/reliabi... 106