Package: fuzzystring 0.0.5

fuzzystring: Fast Fuzzy String Joins for Data Frames

Perform fuzzy joins on data frames using approximate string matching. Implements inner, left, right, full, semi, and anti joins with string distance metrics from the 'stringdist' package, including Optimal String Alignment, Levenshtein, Damerau-Levenshtein, Jaro-Winkler, q-gram, cosine, Jaccard, and Soundex. Uses a 'data.table' backend plus compiled 'C++' result assembly to reduce overhead in large joins, while adaptive candidate planning avoids unnecessary distance evaluations in single-column string joins. Suitable for reconciling misspellings, inconsistent labels, and other near-match identifiers while optionally returning the computed distance for each match. Bibliographic references include Van der Loo, M. P. J. (2014) <https://CRAN.R-project.org/package=stringdist> and Robinson, D. (2015) <https://github.com/dgrtwo/fuzzyjoin>.

Authors:Paul E. Santos Andrade [aut, cre, cph], David Robinson [ctb]

fuzzystring_0.0.5.tar.gz
fuzzystring_0.0.5.zip(r-4.7)fuzzystring_0.0.5.zip(r-4.6)fuzzystring_0.0.5.zip(r-4.5)
fuzzystring_0.0.5.tgz(r-4.6-x86_64)fuzzystring_0.0.5.tgz(r-4.6-arm64)fuzzystring_0.0.5.tgz(r-4.5-x86_64)fuzzystring_0.0.5.tgz(r-4.5-arm64)
fuzzystring_0.0.5.tar.gz(r-4.7-arm64)fuzzystring_0.0.5.tar.gz(r-4.7-x86_64)fuzzystring_0.0.5.tar.gz(r-4.6-arm64)fuzzystring_0.0.5.tar.gz(r-4.6-x86_64)
fuzzystring_0.0.5.tgz(r-4.6-emscripten)
manual.pdf |manual.html
DESCRIPTION |NEWS
card.svg |card.png
fuzzystring/json (API)

# Install 'fuzzystring' in R:
install.packages('fuzzystring', repos = c('https://paulesantos.r-universe.dev', 'https://cloud.r-project.org'))

Bug tracker:https://github.com/paulesantos/fuzzystring/issues

Pkgdown/docs site:https://paulesantos.github.io

Uses libs:
  • c++– GNU Standard C++ Library v3
Datasets:
  • misspellings - A corpus of common misspellings, for examples and practice

On CRAN:

Conda:

cpp

5.48 score 5 stars 499 downloads 7 exports 3 dependencies

Last updated from:8af2f2baf1. Checks:13 OK. Indexed: yes.

TargetResultTimeFilesSyslog
linux-devel-arm64OK138
linux-devel-x86_64OK140
source / vignettesOK196
linux-release-arm64OK143
linux-release-x86_64OK126
macos-release-arm64OK95
macos-release-x86_64OK178
macos-oldrel-arm64OK95
macos-oldrel-x86_64OK277
windows-develOK121
windows-releaseOK91
windows-oldrelOK102
wasm-releaseOK116

Exports:fuzzystring_anti_joinfuzzystring_full_joinfuzzystring_inner_joinfuzzystring_joinfuzzystring_left_joinfuzzystring_right_joinfuzzystring_semi_join

Dependencies:data.tableRcppstringdist

Benchmarking fuzzystring against fuzzyjoin
Overview | Benchmark Script | Results | Absolute Runtime | Relative Runtime | Interpretation

Last update: 2026-05-25
Started: 2026-03-27

Getting Started with fuzzystring
Introduction | Installation | Quick Start | Key Features | All Join Types Supported | Inner join | Left join | Right join | Full join | Semi join (rows from x with a match in y) | Anti join (rows from x without a match in y) | Using the generic fuzzystring_join() | Multiple Distance Methods | Case-Insensitive Matching | Advanced Usage | Multiple Column Joins | Performance

Last update: 2026-03-27
Started: 2026-01-25

Readme and manuals

Help Manual

Help pageTopics
fuzzystring: Fast fuzzy string joins for data framesfuzzystring-package fuzzystring
Join two tables based on fuzzy string matchingfuzzystring_anti_join fuzzystring_full_join fuzzystring_inner_join fuzzystring_join fuzzystring_left_join fuzzystring_right_join fuzzystring_semi_join
A corpus of common misspellings, for examples and practicemisspellings