Top "Document-classification" questions

Document classification is the act of assigning documents from a given set of documents to any of a number of classes, where those classes are known a priori.

scikit-learn TfidfVectorizer meaning?

I was reading about TfidfVectorizer implementation of scikit-learn, i don´t understand what´s the output of the method, for …

machine-learning nlp scikit-learn feature-extraction document-classification
Basic text classification with Weka in Java

Im trying to build a text classifier in JAVA with Weka. I have read some tutorials, and I´m trying …

java classification weka document-classification
Text classification/categorization algorithm

My objective is to [semi]automatically assign texts to different categories. There's a set of user defined categories and a …

algorithm text-mining document-classification
Supervised Latent Dirichlet Allocation for Document Classification?

I have a bunch of already human-classified documents in some groups. Is there a modified version of lda which I …

machine-learning nlp classification document-classification lda
How to calculate TF*IDF for a single new document to be classified?

I am using document-term vectors to represent a collection of document. I use TF*IDF to calculate the term weight …

machine-learning classification information-retrieval text-mining document-classification
How can i classify text documents with using SVM and KNN

Almost all of the examples are based on numbers. In text documents i have words instead of numbers. So can …

svm knn document-classification text-classification
How to change attribute type to String (WEKA - CSV to ARFF)

I'm trying to make an SMS SPAM classifier using the WEKA library. I have a CSV file with "label" and "…

java csv weka document-classification arff