SnowballC: Snowball stemmers based on the C libstemmer UTF-8 library

An R interface to the C libstemmer library that implements Porter's word stemming algorithm for collapsing words to a common root to aid comparison of vocabulary. Currently supported languages are Danish, Dutch, English, Finnish, French, German, Hungarian, Italian, Norwegian, Portuguese, Romanian, Russian, Spanish, Swedish and Turkish.

Version: 0.5.1
Published: 2014-08-09
Author: Milan Bouchet-Valat [aut, cre]
Maintainer: Milan Bouchet-Valat <nalimilan at>
License: BSD_2_clause + file LICENSE
Copyright: Dr Martin Porter (2001) for the libstemmer C library, and Milan Bouchet-Valat (2013) for the R package contents
NeedsCompilation: yes
Materials: NEWS
In views: NaturalLanguageProcessing
CRAN checks: SnowballC results


Reference manual: SnowballC.pdf
Package source: SnowballC_0.5.1.tar.gz
Windows binaries: r-devel:, r-release:, r-oldrel:
OS X El Capitan binaries: r-release: SnowballC_0.5.1.tgz
OS X Mavericks binaries: r-oldrel: SnowballC_0.5.1.tgz
Old sources: SnowballC archive

Reverse dependencies:

Reverse depends: lsa, RWBP
Reverse imports: available, bibliometrix, corpustools, DeducerText, gofastr, goldi, inpdfr, lexRankr, NLPutils, proustr, ptstem, quanteda, revtools, SentimentAnalysis, slowraker, stmCorrViz, textmineR, textmining, textstem, tokenizers
Reverse suggests: koRpus, movMF, qdap, rattle, RcmdrPlugin.temis, stm, textreg, tm, topicmodels


Please use the canonical form to link to this page.