PyPI Stats
  • Insights
  • PyPI
  • GitHub
  • Search
  • Compare
  • Advisories
  • Ecosystem
  • About
Home

Search Packages

Find Python packages by name, description, GitHub topic, or filter by metrics
cereja-project
cereja

Cereja is a bundle of useful functions we don't want to rewrite and .. just pure fun!

9K 29 12
Karan-Malik
prepdata

Automating the process of Data Preprocessing for Data Science

893 8 3
vkreat-tech
ctrl4ai

A helper package for Machine Learning and Deep Learning Algorithms

855 10 2
ChenTaHung
mobpy

Monotonic Optimal Binning algorithm is a statistical approach to transform continuous variables into optimal and monotonic categorical variables.

649 19 2
mannasoumya
imputerapi

Data Imputer API

297 0 0
sayanmondal2098
easytoken

Tokenizer is an independent Open Source, Natural Language Processing python library which implements a tokenizer to create token from Both Sentence and Paragraph.

222 1 5
Kawai-Senpai
ultraclean

UltraClean is a fast and efficient Python library for cleaning and preprocessing text data for AI/ML tasks and data processing.

192 2 0
vishallmaurya
veda-lib

veda_lib-A Python library designed to streamline the transition from raw data to machine learning models. It automates and simplifies data preprocessing, cleaning, and balancing, addressing the time-consuming and complex aspects of these tasks to provide clean and ready-to-use data.

153 0 0
ENGRZULQARNAIN
scrapysub

ScrapySub is a Python library designed to recursively scrape website content, including subpages. It fetches the visible text from web pages and stores it in a structured format for easy access and analysis. This library is particularly useful for NLP and AI developers who need to gather large amounts of web content for their projects.

136 4 0
AntoinePinto
stringpairfinder

Package designed to match strings by similarity

135 4 1
    • Data from PyPI, GitHub, ClickHouse, and BigQuery