PyPI Stats
  • Insights
  • PyPI
  • GitHub
  • Search
  • Compare
  • Advisories
  • Ecosystem
  • About
Home

Search Packages

Find Python packages by name, description, GitHub topic, or filter by metrics
Microsoft
presidio-analyzer

An open-source framework for detecting, redacting, masking, and anonymizing sensitive data (PII) across text, images, and structured data. Supports NLP, pattern matching, and customizable pipelines.

4.5M 8K 1K
Microsoft
presidio-anonymizer

An open-source framework for detecting, redacting, masking, and anonymizing sensitive data (PII) across text, images, and structured data. Supports NLP, pattern matching, and customizable pipelines.

3.3M 8K 1K
ethyca
ethyca-fides

The Privacy Engineering & Compliance Framework

90K 454 89
datafog
datafog

Python SDK for PII detection and redaction in text and images, combining regex + NLP pipelines for production privacy workflows.

54K 54 13
Microsoft
presidio-image-redactor

An open-source framework for detecting, redacting, masking, and anonymizing sensitive data (PII) across text, images, and structured data. Supports NLP, pattern matching, and customizable pipelines.

30K 8K 1K
microsoft
presidio-structured

An open-source framework for detecting, redacting, masking, and anonymizing sensitive data (PII) across text, images, and structured data. Supports NLP, pattern matching, and customizable pipelines.

14K 8K 1K
IBM
diffprivlib

Diffprivlib: The IBM Differential Privacy Library

13K 911 208
ashutoshrana
enterprise-rag-patterns

Cross-industry compliance patterns for RAG pipelines: FERPA, HIPAA, GDPR, NIST AI RMF, OWASP LLM Top 10, and more. Vector store adapters, framework integrations, and audit logging.

13K 0 0
Microsoft
presidio

An open-source framework for detecting, redacting, masking, and anonymizing sensitive data (PII) across text, images, and structured data. Supports NLP, pattern matching, and customizable pipelines.

7K 8K 1K
IFCA-Advanced-Computing
pycanon

pyCANON is a Python library and CLI to assess the values of the parameters associated with the most common privacy-preserving techniques.

3K 52 9
AI-SDC
acro

Tools for the Semi-Automatic Checking of Research Outputs. These are tools for researchers to use as drop-in replacements for common analysis commands.

3K 22 12
martaajonees
clip-protocol

This repository contains an adaption of differential privacy algorithms on learning analitics

2K 0 1
dataxid
dataxid

The Synthetic Data API. Generate privacy-safe synthetic data with 5 lines of code.

2K 26 7
ashutoshrana
ferpa-haystack

FERPA-compliant document filter for Haystack RAG pipelines — identity-scoped pre-filtering before LLM context

1K 0 0
ethyca
fidesctl

CLI for Fides

1K 454 89
Tatarinho
llm-safe-pl

[DEPRECATED — use pii-toolkit] Reversible Polish PII anonymization for LLM workflows. Successor packages: pii-veil, pii-core, pii-presidio.

1K 2 0
AI-SDC
aisdc

Tools for the statistical disclosure control of machine learning models

1K 36 8
brootware
pyredactkit

Python CLI tool to redact and un-redact sensitive data from text files. 🔐📝

1K 50 7
mahadillahm4di-cyber
mh-gdpr-ai

Your LLM prompt has a name in it. It just crossed the Atlantic. That's a GDPR violation. This fixes it in 3 lines.

795 5 0
nightfallai
nightfall

Python Data Loss Prevention (DLP) SDK - Nightfall Developer Platform

581 25 13
Samuel-Maddock
pure-ldp

Python package for simple implementations of state-of-the-art LDP frequency estimation algorithms. Contains code for our VLDB 2021 Paper.

501 78 14
senzing-garage
sz-semantics

Transform JSON output from Senzing SDK for use with graph technologies, semantics, and downstream LLM integrations

438 18 3
martaajonees
privadjust

This repository contains an adaption of differential privacy algorithms on learning analitics

434 0 1
AI-SDC
sacroml

Tools for the statistical disclosure control of machine learning models

395 36 8
    • Data from PyPI, GitHub, ClickHouse, and BigQuery