Lists (1)
Sort Name ascending (A-Z)
Stars
An open source, high scalability toolkit in Java for Entity Resolution.
Deepparse is a state-of-the-art library for parsing multinational street addresses using deep learning
A Python package for pool-based active evaluation
Similarity and distance measures for clustering and record linkage applications in R
Bayesian Entity Resolution with Exchangeable Random Partition Priors
Distributed Bayesian Entity Resolution in Apache Spark
pyspark-parallelised functions producing graph-theoretical metrics in connected component clusters for use in record-linkage (or other domains)
R package fastLink: Fast Probabilistic Record Linkage
Fast, accurate and scalable probabilistic data linkage with support for multiple SQL backends
multilink is an R package which implements methodology presented in the manuscript "Multifile Partitioning for Record Linkage and Duplicate Detection", available on arXiv: https://arxiv.org/abs/211…
A distill blog and showcase about building distill websites and blogs!