Top "Term-document-matrix" questions

A document-term matrix or term-document matrix is a mathematical matrix that describes the frequency of terms that occur in a collection of documents.

Error converting text to lowercase with tm_map(..., tolower)

I tried using the tm_map. It gave the following error. How can I get around this? require(tm) byword&…

r tm lowercase term-document-matrix
list of word frequencies using R

I have been using the tm package to run some text analysis. My problem is with creating a list with …

r text-mining word-frequency term-document-matrix
efficient Term Document Matrix with NLTK

I am trying to create a term document matrix with NLTK and pandas. I wrote the following function: def fnDTM_…

python pandas nltk term-document-matrix
How to build a Term-Document-Matrix from a set of texts and a specific set of terms (tags)?

I have two sets of data: a set of tags (single words like php, html, etc) a set of texts …

r term-document-matrix
TermDocumentMatrix errors in R

I have been working through numerous online examples of the {tm} package in R, attempting to create a TermDocumentMatrix. Creating …

r text-mining tm corpus term-document-matrix
More efficient means of creating a corpus and DTM with 4M rows

My file has over 4M rows and I need a more efficient way of converting my data to a corpus …

r data.table corpus term-document-matrix qdap
R and tm package: create a term-document matrix with a dictionary of one or two words?

Purpose: I want to create a term-document matrix using a dictionary which has compound words, or bigrams, as some of …

r tm n-gram term-document-matrix rweka
Creating a Term Document Matrix from Text File

I'm trying to read one text file and create a term document matrix using textmining packages. I can create term …

python matrix term-document-matrix
Frequency Per Term - R TM DocumentTermMatrix

I'm very new to R and cannot quite wrap my head around DocumentTermMatrixs. I have a DocumentTermMatrix created with the …

r tm term-document-matrix
findAssocs for multiple terms in R

In R I used the [tm package][1] for building a term-document matrix from a corpus of documents. My goal is …

r text-mining term-document-matrix