Skip to content

microclustr: Entity Resolution with Random Partition Priors for Microclustering

Latest
Compare
Choose a tag to compare
@resteorts resteorts released this 19 Apr 16:04
· 4 commits to master since this release

An implementation of the model in Betancourt, Zanella, Steorts (2020) arXiv:2004.02008, which performs microclustering models for categorical data. The package provides a vignette for two proposed methods in the paper as well as two standard Bayesian non-parametric clustering approaches for entity resolution. The experiments are reproducible and illustrated using a simple vignette. LICENSE: GPL-3 + file license.