Package: reclin2 0.5.0
Jan van der Laan
reclin2: Record Linkage Toolkit
Functions to assist in performing probabilistic record linkage and deduplication: generating pairs, comparing records, em-algorithm for estimating m- and u-probabilities (I. Fellegi & A. Sunter (1969) <doi:10.1080/01621459.1969.10501049>, T.N. Herzog, F.J. Scheuren, & W.E. Winkler (2007), "Data Quality and Record Linkage Techniques", ISBN:978-0-387-69502-0), forcing one-to-one matching. Can also be used for pre- and post-processing for machine learning methods for record linkage. Focus is on memory, CPU performance and flexibility.
Authors:
reclin2_0.5.0.tar.gz
reclin2_0.5.0.zip(r-4.5)reclin2_0.5.0.zip(r-4.4)reclin2_0.5.0.zip(r-4.3)
reclin2_0.5.0.tgz(r-4.4-x86_64)reclin2_0.5.0.tgz(r-4.4-arm64)reclin2_0.5.0.tgz(r-4.3-x86_64)reclin2_0.5.0.tgz(r-4.3-arm64)
reclin2_0.5.0.tar.gz(r-4.5-noble)reclin2_0.5.0.tar.gz(r-4.4-noble)
reclin2_0.5.0.tgz(r-4.4-emscripten)reclin2_0.5.0.tgz(r-4.3-emscripten)
reclin2.pdf |reclin2.html✨
reclin2/json (API)
NEWS
# Install 'reclin2' in R: |
install.packages('reclin2', repos = c('https://djvanderlaan.r-universe.dev', 'https://cloud.r-project.org')) |
Bug tracker:https://github.com/djvanderlaan/reclin2/issues
- linkexample1 - Tiny example dataset for probabilistic linkage
- linkexample2 - Tiny example dataset for probabilistic linkage
- town_names - Spelling variations of a set of town names
Last updated 10 months agofrom:1739f2db25. Checks:OK: 7 NOTE: 2. Indexed: yes.
Target | Result | Date |
---|---|---|
Doc / Vignettes | OK | Nov 05 2024 |
R-4.5-win-x86_64 | NOTE | Nov 05 2024 |
R-4.5-linux-x86_64 | NOTE | Nov 05 2024 |
R-4.4-win-x86_64 | OK | Nov 05 2024 |
R-4.4-mac-x86_64 | OK | Nov 05 2024 |
R-4.4-mac-aarch64 | OK | Nov 05 2024 |
R-4.3-win-x86_64 | OK | Nov 05 2024 |
R-4.3-mac-x86_64 | OK | Nov 05 2024 |
R-4.3-mac-aarch64 | OK | Nov 05 2024 |
Exports:add_from_xadd_from_ycluster_callcluster_collectcluster_modify_pairscluster_paircluster_pair_blockingcluster_pair_minsimcmp_identicalcmp_jaccardcmp_jarowinklercmp_lcscompare_pairscompare_varsdeduplicate_equivalenceget_inspect_pairsgreedyjaccardjaro_winklerlcslinkmatch_n_to_mmerge_pairspairpair_blockingpair_minsimproblink_emscore_simpleselect_greedyselect_n_to_mselect_thresholdselect_uniquetabulate_patterns
Dependencies:data.tablelpSolveRcppstringdist
Deduplication using reclin2
Rendered fromdeduplication.md
usingsimplermarkdown::mdweave_to_html
on Nov 05 2024.Last update: 2023-07-06
Started: 2021-11-08
Introduction to reclin2
Rendered fromintroduction.md
usingsimplermarkdown::mdweave_to_html
on Nov 05 2024.Last update: 2023-08-25
Started: 2021-12-19
Record linkage using machine learning
Rendered fromrecord_linkage_using_machine_learning.md
usingsimplermarkdown::mdweave_to_html
on Nov 05 2024.Last update: 2023-07-06
Started: 2021-11-09
Using a cluster for record linkage
Rendered fromusing_a_cluster_for_record_linkage.md
usingsimplermarkdown::mdweave_to_html
on Nov 05 2024.Last update: 2023-07-06
Started: 2022-01-05