chomper: A Comprehensive Hit or Miss Probabilistic Entity Resolution Model

Provides Bayesian probabilistic methods for record linkage and entity resolution across multiple datasets using the Comprehensive Hit Or Miss Probabilistic Entity Resolution (CHOMPER) model. The package implements three main inference approaches: (1) Evolutionary Variational Inference for record Linkage (EVIL), (2) Coordinate Ascent Variational Inference (CAVI), and (3) Markov Chain Monte Carlo (MCMC) with split and merge process. The model supports both discrete and continuous fields, and it performs locally-varying hit mechanism for the attributes with multiple truths. It also provides tools for performance evaluation based on either approximated variational factors or posterior samples. The package is designed to support parallel computing with multi-threading support for EVIL to estimate the linkage structure faster.

Version: 0.1.3
Depends: R (≥ 3.5)
Imports: Rcpp
LinkingTo: Rcpp, RcppArmadillo, RcppThread
Suggests: blink, ggplot2, knitr, patchwork, rmarkdown, salso, spelling
Published: 2026-03-16
DOI: 10.32614/CRAN.package.chomper (may not be active yet)
Author: Hyungjoon Kim [aut, cre], Andee Kaplan [aut], Matthew Koslovsky [aut]
Maintainer: Hyungjoon Kim <hjkim8987 at gmail.com>
BugReports: https://github.com/hjkim8987/chomper/issues
License: GPL (≥ 3)
URL: https://github.com/hjkim8987/chomper
NeedsCompilation: yes
Language: en-US
Materials: README
CRAN checks: chomper results

Documentation:

Reference manual: chomper.html , chomper.pdf
Vignettes: Introduction to 'chomper' package (source, R code)

Downloads:

Package source: chomper_0.1.3.tar.gz
Windows binaries: r-devel: not available, r-release: not available, r-oldrel: not available
macOS binaries: r-release (arm64): not available, r-oldrel (arm64): not available, r-release (x86_64): not available, r-oldrel (x86_64): not available

Linking:

Please use the canonical form https://CRAN.R-project.org/package=chomper to link to this page.