Top "Tm" questions

The `tm` package (shorthand for Text Mining Infrastructure in R) provides a framework for text mining applications within R.

Are Snowball & SnowballC packages different in R?

I am using stemDocument for stemming text document using tm package in R. Example code: data("crude") crude[[1]] stemDocument(crude[[1]]) …

r stemming tm snowball
Dictionary() is not supported anymore in tm package. How to emend code?

I just noticed that after updating to tm v. 0.5-10 the function Dictionary() is not supported anymore. Is this an …

r tm
Shiny app fails with "argument 1 (type 'closure') cannot be handled by 'cat'" - what does this mean?

I am building a Shiny app that takes a user's text input, compares the last two words to a data …

r shiny tm
R: Calculate cosine distance from a term-document matrix with tm and proxy

I want to calculate the cosine distance among authors of a corpus. Let's take a corpus of 20 documents. require(tm) …

r matrix proxy tm
Removing overly common words (occur in more than 80% of the documents) in R

I am working with the 'tm' package in to create a corpus. I have done most of the preprocessing steps. …

r text-mining tm
R stemming a string/document/corpus

I'm trying to do some stemming in R but it only seems to work on individual documents. My end goal …

r nlp stemming tm
How to exactly remove the punctuation when using R with tm package

Update: I think I may have a workaround to solve this problem, just to add one code :dtms = removeSparseTerms(dtm,0.1) …

r customization text-mining tm punctuation
R and tm package: create a term-document matrix with a dictionary of one or two words?

Purpose: I want to create a term-document matrix using a dictionary which has compound words, or bigrams, as some of …

r tm n-gram term-document-matrix rweka
R Corpus Is Messing Up My UTF-8 Encoded Text

I am simply trying to create a corpus from Russian, UTF-8 encoded text. The problem is, the Corpus method from …

r encoding utf-8 tm corpus
no applicable method for 'tm_map' applied to an object of class "character"

My data looks like this: 1. Good quality, love the taste, the only ramen noodles we buy but they're available at …

r matrix text-mining tm