Top "Topic-modeling" questions

Topic models describe the frequency of topics in documents and text.

Removing stopwords from a user-defined corpus in R

I have a set of documents: documents = c("She had toast for breakfast", "The coffee this morning was excellent", "For …

r tm topic-modeling
How to print the LDA topics models from gensim? Python

Using gensim I was able to extract topics from a set of documents in LSA but how do I access …

python nlp lda topic-modeling gensim
Gensim: KeyError: "word not in vocabulary"

I have a trained Word2vec model using Python's Gensim Library. I have a tokenized list as below. The vocab …

python nlp gensim word2vec topic-modeling
Remove empty documents from DocumentTermMatrix in R topicmodels?

I am doing topic modelling using the topicmodels package in R. I am creating a Corpus object, doing some basic …

r lda topic-modeling topicmodels
Understanding LDA implementation using gensim

I am trying to understand how gensim package in Python implements Latent Dirichlet Allocation. I am doing the following: Define …

python topic-modeling gensim dirichlet
Topic models: cross validation with loglikelihood or perplexity

I'm clustering documents using topic modeling. I need to come up with the optimal topic numbers. So, I decided to …

r tm cross-validation topic-modeling
LDA with topicmodels, how can I see which topics different documents belong to?

I am using LDA from the topicmodels package, and I have run it on about 30.000 documents, acquired 30 topics, and got …

r lda topic-modeling tm
LDA topic modeling - Training and testing

I have read LDA and I understand the mathematics of how the topics are generated when one inputs a collection …

lda topic-modeling
Using Word2Vec for topic modeling

I have read that the most common technique for topic modeling (extracting possible topics from text) is Latent Dirichlet allocation (…

nlp topic-modeling word2vec
How to avoid decoding to str: need a bytes-like object error in pandas?

Here is my code : data = pd.read_csv('asscsv2.csv', encoding = "ISO-8859-1", error_bad_lines=False); data_text = data[[…

python python-3.x pandas gensim topic-modeling