stringdist: Approximate String Matching and String Distance Functions

Implements an approximate string matching version of R's native 'match' function. Can calculate various string distances based on edits (Damerau-Levenshtein, Hamming, Levenshtein, optimal sting alignment), qgrams (q- gram, cosine, jaccard distance) or heuristic metrics (Jaro, Jaro-Winkler). An implementation of soundex is provided as well. Distances can be computed between character vectors while taking proper care of encoding or between integer vectors representing generic sequences.

Version: 0.9.4.6
Depends: R (≥ 2.15.3)
Imports: parallel
Suggests: testthat
Published: 2017-07-31
Author: Mark van der Loo [aut, cre], Jan van der Laan [ctb], R Core Team [ctb], Nick Logan [ctb]
Maintainer: Mark van der Loo <mark.vanderloo at gmail.com>
BugReports: https://github.com/markvanderloo/stringdist/issues
License: GPL-3
URL: https://github.com/markvanderloo/stringdist
NeedsCompilation: yes
Citation: stringdist citation info
Materials: NEWS
In views: NaturalLanguageProcessing, OfficialStatistics
CRAN checks: stringdist results

Downloads:

Reference manual: stringdist.pdf
Package source: stringdist_0.9.4.6.tar.gz
Windows binaries: r-devel: stringdist_0.9.4.6.zip, r-release: stringdist_0.9.4.6.zip, r-oldrel: stringdist_0.9.4.6.zip
OS X El Capitan binaries: r-release: stringdist_0.9.4.6.tgz
OS X Mavericks binaries: r-oldrel: stringdist_0.9.4.6.tgz
Old sources: stringdist archive

Reverse dependencies:

Reverse depends: AurieLSHGaussian, blink, vwr
Reverse imports: available, bcRep, bdlp, bibliometrix, deductive, diffrprojects, fastLink, fcuk, flora, genBaRcode, GetLattesData, lime, lingtypology, lintr, PGRdup, qdap, rabi, refinr, revtools, SentimentAnalysis, sjmisc, tcR, tidystringdist, TSTr, utilsIPEA
Reverse suggests: googleLanguageR, rlist, spew

Linking:

Please use the canonical form https://CRAN.R-project.org/package=stringdist to link to this page.