PyPI Stats
  • Insights
  • PyPI
  • GitHub
  • Search
  • Compare
  • Advisories
  • Ecosystem
  • About
Home

Search Packages

Find Python packages by name, description, GitHub topic, or filter by metrics
michaelscutari
protclust

protclust is a Python library for protein sequence analysis that integrates MMseqs2 for fast clustering and provides tools for creating robust machine learning datasets. It offers cluster-aware data splitting to prevent sequence similarity bias in model evaluation, along with comprehensive protein embedding capabilities for feature generation.

381 4 0
michaelscutari
mmseqspy

Python utilities for protein sequence clustering and dataset splitting with MMseqs2

81 4 0
    • Data from PyPI, GitHub, ClickHouse, and BigQuery