Top "Text-analysis" questions

Natural language processing (NLP) is a subfield of artificial intelligence that involves transforming or extracting useful information from natural language data.

How to find common phrases in a large body of text

I'm working on a project at the moment where I need to pick out the most common phrases in a …

data-structures graph data-mining text-analysis
How do I use sklearn CountVectorizer with both 'word' and 'char' analyzer? - python

How do I use sklearn CountVectorizer with both 'word' and 'char' analyzer? http://scikit-learn.org/stable/modules/generated/sklearn.feature_…

python machine-learning scikit-learn analyzer text-analysis
Clustering words into groups

This is a Homework question. I have a huge document full of words. My challenge is to classify these words …

cluster-analysis k-means text-analysis
Check if a string is a possible abbrevation for a name

I'm trying to develop a python algorithm to check if a string could be an abbrevation for another word. For …

python string-matching slug abbreviation text-analysis
Use brain.js neural network to do text analysis

I'm trying to do some text analysis to determine if a given string is... talking about politics. I'm thinking I …

neural-network text-analysis brain.js
Error faced while using TM package's VCorpus in R

I am facing the below error while working on the TM package with R. library("tm") Loading required package: NLP …

r text-mining tm text-analysis
Stemming unstructured text in NLTK

I tried the regex stemmer, but I get hundreds of unrelated tokens. I'm just interested in the "play" stem. Here …

nltk tokenize text-analysis lemmatization
Create dfm step by step with quanteda

I want to analyze a big (n=500,000) corpus of documents. I am using quanteda in the expectation that will be …

r text-analysis term-document-matrix quanteda