11 dependents
Package Description Downloads/month
Prototype record matching database. 3K
A package for matching UK addresses using a pretrained Splink model 3K
Fast and simple probabilistic data matching package 2K
The Public Utility Data Liberation Project provides analysis-ready energy system... 1K
Allows Clickhouse to be used as the execution engine for Splink 700
ARC: data linking solution for Databricks with Splink 651
Generate synthetic data with a specified data generating process 475
Snowflake backend support for Splink 336
The XML-to-OCDS parser for the TEDective project based on lxml 192
Emulates the methods the US Census Bureau uses to link people across multiple da... 148
Command-line tool for deduplicating healthcare provider data using probabilistic... 56