stringdist: Approximate String Matching and String Distance Functions

Implements an approximate string matching version of R's native 'match' function. Can calculate various string distances based on edits (Damerau-Levenshtein, Hamming, Levenshtein, optimal sting alignment), qgrams (q- gram, cosine, jaccard distance) or heuristic metrics (Jaro, Jaro-Winkler). An implementation of soundex is provided as well. Distances can be computed between character vectors while taking proper care of encoding or between integer vectors representing generic sequences.

Depends: R (≥ 2.15.3)
Imports: parallel
Suggests: testthat
Published: 2017-07-31
Author: Mark van der Loo [aut, cre], Jan van der Laan [ctb], R Core Team [ctb], Nick Logan [ctb]
Maintainer: Mark van der Loo <mark.vanderloo at>
License: GPL-3
NeedsCompilation: yes
Citation: stringdist citation info
Materials: NEWS
In views: NaturalLanguageProcessing, OfficialStatistics
CRAN checks: stringdist results


Reference manual: stringdist.pdf
Package source: stringdist_0.9.4.6.tar.gz
Windows binaries: r-devel:, r-release:, r-oldrel:
OS X El Capitan binaries: r-release: stringdist_0.9.4.6.tgz
OS X Mavericks binaries: r-oldrel: stringdist_0.9.4.6.tgz
Old sources: stringdist archive

Reverse dependencies:

Reverse depends: AurieLSHGaussian, blink, vwr
Reverse imports: available, bcRep, bdlp, bibliometrix, deductive, diffrprojects, fastLink, fcuk, flora, genBaRcode, GetLattesData, lime, lingtypology, lintr, PGRdup, qdap, rabi, refinr, revtools, SentimentAnalysis, sjmisc, tcR, tidystringdist, TSTr, utilsIPEA
Reverse suggests: googleLanguageR, rlist, spew


Please use the canonical form to link to this page.