187 dependents
Package Description Downloads/month
Leveraging BERT and c-TF-IDF to create easily interpretable topics. 381K
This is an open-source version of the representation engineering framework for s... 167K
QuerySource is a tool for querying different databases (or REST endpoints) using... 77K
A lightweight framework for benchmarking multimodal AI agents with parallel exec... 30K
YData allows to use the *Data-Centric* tools from the YData ecosystem to acceler... 29K
exprmat: Routines for expression matrices 15K
Kura is a tool for analysing and visualising chat data 9K
Top2Vec learns jointly embedded topic, document and word vectors. 8K
A Python library for the Reflexio 8K
Hail helper functions for the gnomAD project and Translational Genomics Group 4K
Very fast, accurate speaker diarization 4K
Visualization Package Leveraging SVG in Jupyter Notebooks 4K
Behavioral sequencing and phenotyping with lightweight task specific adaptation 3K
A comprehensive desktop application for visualizing, querying, and managing vect... 3K
A Python package for audio analysis and machine learning-based audio classificat... 2K
Sliced Detection and Clustering Analysis Toolkit - Developed by MBARI 2K
Neural model for next clinical event prediction from EHR sequences using the Nar... 2K
2nd brain for YouTube 2K
BERTrend analyses topic evolution over time using state-of-the-art transformer m... 2K
Explore how your notes connect to each other and surface real-time clusters from... 2K
Interpretable clustering and graph-based visualization of painting collections 2K
Identifying methlyation motifs in nanopore data 2K
MEGAN: Multi Explanation Graph Attention Network 2K
NarrativeMapper is a text analysis pipeline that uncovers the dominant narrative... 2K
Image processing pipeline to automatically crop and quantify in vivo bioluminesc... 2K
Flare-Sensitive Clustering based on HDBSCAN*. 2K
A friendly way to do link, aggregate, cluster and de-duplicate dataframes using ... 2K
offline/online spike sorting with french touch that light the barbecue 2K
A Repository for Single- and Multi-modal Speaker Verification, Speaker Recogniti... 2K
Spatio-Temporal Tag and Photo Location Clustering for generating Tag Maps 1K
Nordlys is an AI lab building a Mixture of Models. This repository contains the ... 1K
ATaRVa - Analysis of Tandem Repeat Variation 1K
Insert Description 1K
Package to analyze calcium fluorescence events in astrocytes 1K
Dead-simple local vector database powered by usearch HNSW. 1K
Clustering for mixed-type data 1K
The Asynchronous Data Dynamo and Graph Neural Network Catalyst 1K
Aligned Neural Topic Model (ANTM) for Exploring Evolving Topics: a dynamic neura... 1K
A materials discovery algorithm geared towards exploring high-performance candid... 1K
soak: graph-based pipelines and tools for LLM-assisted qualitative text analysis 1K
Publication Literature Miner using PubMed API, Document Ingestion, embedding-bas... 972
Tools for interactive visual inspection of semantic embeddings. 956
ArchiTXT is an open source Python library that transforms unstructured text into... 931
Episodiq: Pattern mining tool for agentic trajectories. 875
Kosh allows codes to store, query, share data via an easy-to-use Python API. Kos... 870
A professional tool for cleaning duplicate or near-duplicate image frames using ... 853
A toolbox for text analysis, clustering, topic modelling, information and keywor... 834
Python library for analyzing data quality and its impact on model performance ac... 827
A python library for social event detection 793
TriTan: An efficient triple non-negative matrix factorisation method for integra... 754