reclin2: Record Linkage Toolkit

Functions to assist in performing probabilistic record linkage and deduplication: generating pairs, comparing records, em-algorithm for estimating m- and u-probabilities (I. Fellegi & A. Sunter (1969) <doi:10.1080/01621459.1969.10501049>, T.N. Herzog, F.J. Scheuren, & W.E. Winkler (2007), "Data Quality and Record Linkage Techniques", ISBN:978-0-387-69502-0), forcing one-to-one matching. Can also be used for pre- and post-processing for machine learning methods for record linkage. Focus is on memory, CPU performance and flexibility.

Version: 0.5.0
Depends: data.table, R (≥ 3.6.0)
Imports: stringdist, stats, utils, lpSolve, Rcpp, parallel
LinkingTo: Rcpp
Suggests: simplermarkdown
Published: 2024-02-09
DOI: 10.32614/CRAN.package.reclin2
Author: Jan van der Laan ORCID iD [aut, cre]
Maintainer: Jan van der Laan <r at>
License: GPL-3
NeedsCompilation: yes
Materials: README NEWS
In views: OfficialStatistics
CRAN checks: reclin2 results


Reference manual: reclin2.pdf
Vignettes: Deduplication using reclin2
Introduction to reclin2
Record linkage using machine learning
Using a cluster for record linkage


Package source: reclin2_0.5.0.tar.gz
Windows binaries: r-devel:, r-release:, r-oldrel:
macOS binaries: r-release (arm64): reclin2_0.5.0.tgz, r-oldrel (arm64): reclin2_0.5.0.tgz, r-release (x86_64): reclin2_0.5.0.tgz, r-oldrel (x86_64): reclin2_0.5.0.tgz
Old sources: reclin2 archive

Reverse dependencies:

Reverse imports: PriceIndices


Please use the canonical form to link to this page.