Top "Tm" questions

The `tm` package (shorthand for Text Mining Infrastructure in R) provides a framework for text mining applications within R.

Adding custom stopwords in R tm

I have a Corpus in R using the tm package. I am applying the removeWords function to remove stopwords tm_…

r text-mining stop-words corpus tm
R tm removeWords function not removing words

I am trying to remove some words from a corpus I have built but it doesn't seem to be working. …

r text text-mining tm corpus
R tm package vcorpus: Error in converting corpus to data frame

I am using the tm package to clean up some data using the following code: mycorpus <- Corpus(VectorSource(…

r tm corpus
Use R to convert PDF files to text files for text mining

I have nearly one thousand pdf journal articles in a folder. I need to text mine on all article's abstracts …

r text-mining tm pdftotext
How does the removeSparseTerms in R work?

I am using the removeSparseTerms method in R and it required a threshold value to be input. I also read …

r tm lda
Removing non-English text from Corpus in R using tm()

I am using tm() and wordcloud() for some basic data-mining in R, but am running into difficulties because there are …

r tm
Topic models: cross validation with loglikelihood or perplexity

I'm clustering documents using topic modeling. I need to come up with the optimal topic numbers. So, I decided to …

r tm cross-validation topic-modeling
Finding ngrams in R and comparing ngrams across corpora

I'm getting started with the tm package in R, so please bear with me and apologies for the big ol' …

r text-mining n-gram tm
LDA with topicmodels, how can I see which topics different documents belong to?

I am using LDA from the topicmodels package, and I have run it on about 30.000 documents, acquired 30 topics, and got …

r lda topic-modeling tm
Trying to get tf-idf weighting working in R

I am trying to do some very basic text analysis with the tm package and get some tf-idf scores; I'm …

r tm tf-idf text-analysis