Top "Gensim" questions

Gensim is a free Python framework designed to automatically extract semantic topics from documents, as efficiently (computer-wise) and painlessly (human-wise) as possible.

Chunkize warning while installing gensim

I have installed gensim (through pip) in Python. After the installation is over I get the following warning: C:\Python27\…

python gensim
Topic distribution: How do we see which document belong to which topic after doing LDA in python

I am able to run the LDA code from gensim and got the top 10 topics with their respective keywords. Now …

python nltk lda gensim
Get bigrams and trigrams in word2vec Gensim

I am currently using uni-grams in my word2vec model as follows. def review_to_sentences( review, tokenizer, remove_stopwords=…

python tokenize word2vec gensim n-gram
Error while loading Word2Vec model in gensim

I'm getting an AttributeError while loading the gensim model available at word2vec repository: from gensim import models w = models.…

python gensim word2vec
How to speed up Gensim Word2vec model load time?

I'm building a chatbot so I need to vectorize the user's input using Word2Vec. I'm using a pre-trained model …

deep-learning gensim word2vec
Gensim: How to save LDA model's produced topics to a readable format (csv,txt,etc)?

last parts of the code: lda = LdaModel(corpus=corpus,id2word=dictionary, num_topics=2) print lda bash output: INFO : adding …

python lda gensim
Gensim: TypeError: doc2bow expects an array of unicode tokens on input, not a single string

I am starting with some python task, I am facing a problem while using gensim. I am trying to load …

python gensim
Python: gensim: RuntimeError: you must first build vocabulary before training the model

I know that this question has been asked already, but I was still not able to find a solution for …

python gensim word2vec
Word2Vec: Effect of window size used

I am trying to train a word2vec model on very short phrases (5 grams). Since each sentence or example is …

gensim word2vec
How to avoid decoding to str: need a bytes-like object error in pandas?

Here is my code : data = pd.read_csv('asscsv2.csv', encoding = "ISO-8859-1", error_bad_lines=False); data_text = data[[…

python python-3.x pandas gensim topic-modeling