Top "Text-mining" questions

Text Mining is a process of deriving high-quality information from unstructured (textual) information.

What is CoNLL data format?

I am new to text mining. I am using a open source jar (Mate Parser) which gives me output in …

nlp text-parsing text-mining information-extraction
R tm removeWords function not removing words

I am trying to remove some words from a corpus I have built but it doesn't seem to be working. …

r text text-mining tm corpus
Use R to convert PDF files to text files for text mining

I have nearly one thousand pdf journal articles in a folder. I need to text mine on all article's abstracts …

r text-mining tm pdftotext
How to find the closest word to a vector using word2vec

I have just started using Word2vec and I was wondering how can we find the closest word to a …

python text-mining data-analysis word2vec
Finding 2 & 3 word Phrases Using R TM Package

I am trying to find a code that actually works to find the most frequently used two and three word …

r data-mining text-mining
Finding ngrams in R and comparing ngrams across corpora

I'm getting started with the tm package in R, so please bear with me and apologies for the big ol' …

r text-mining n-gram tm
Count the number of elements in a string separated by comma

I am dealing with text strings such as the following: LN1 2DW, DN21 5BJ, DN21 5BL, ... In Python, how can …

python text text-mining comma
Save and reuse TfidfVectorizer in scikit learn

I am using TfidfVectorizer in scikit learn to create a matrix from text data. Now I need to save this …

python nlp scikit-learn pickle text-mining
Data sets for emotion detection in text

I'm implementing a system that could detect the human emotion in text. Are there any manually annotated data sets available …

database dataset nlp text-mining emotion
How do I remove verbs, prepositions, conjunctions etc from my text?

Basically in my text I just want to keep nouns and remove other parts of speech. I do not think …

python r text-mining