Top "Corpus" questions

A corpus most commonly refers to a collection of structured text.

How can I change the default Mysql connection timeout when connecting through python?

I connected to a mysql database using python con = _mysql.connect('localhost', 'dell-pc', '', 'test') The program that I wrote …

python mysql corpus
How to create a word cloud from a corpus in Python?

From Creating a subset of words from a corpus in R, the answerer can easily convert a term-document matrix into …

python nltk corpus gensim word-cloud
Creating a new corpus with NLTK

I reckoned that often the answer to my title is to go and read the documentations, but I ran through …

python nlp nltk corpus
DocumentTermMatrix error on Corpus argument

I have the following code: # returns string w/o leading or trailing whitespace trim <- function (x) gsub("^\\s+|\\…

r tm corpus
Programmatically install NLTK corpora / models, i.e. without the GUI downloader?

My project uses the NLTK. How can I list the project's corpus & model requirements so they can be automatically …

installation packages nltk requirements corpus
How to show corpus text in R tm package?

I'm completely new in R and tm package, so please excuse my stupid question ;-) How can I show the …

r tm corpus
Adding custom stopwords in R tm

I have a Corpus in R using the tm package. I am applying the removeWords function to remove stopwords tm_…

r text-mining stop-words corpus tm
R tm removeWords function not removing words

I am trying to remove some words from a corpus I have built but it doesn't seem to be working. …

r text text-mining tm corpus
R tm package vcorpus: Error in converting corpus to data frame

I am using the tm package to clean up some data using the following code: mycorpus <- Corpus(VectorSource(…

r tm corpus
Make dataframe of top N frequent terms for multiple corpora using tm package in R

I have several TermDocumentMatrixs created with the tm package in R. I want to find the 10 most frequent terms in …

r text-mining corpus tm