Top "Data-mining" questions

Data mining is the process of analyzing large amounts of data in order to find patterns and commonalities.

how to determine the number of topics for LDA?

I am a freshman in LDA and I want to use it in my work. However, some problems appear. In …

nlp data-mining lda
clustering very large dataset in R

I have a dataset consisting of 70,000 numeric values representing distances ranging from 0 till 50, and I want to cluster these numbers; …

r machine-learning bigdata cluster-analysis data-mining
Using adaboost within R's caret package

I've been using the ada R package for a while, and more recently, caret. According to the documentation, caret's train() …

r machine-learning data-mining classification adaboost
Weka simple K-means clustering assignments

I have what feels like a simple problem, but I can't seem to find an answer. I'm pretty new to …

cluster-analysis data-mining weka k-means
WEKA Tutorials / Examples for a Newbie

In a follow-up to this answer I want to ask if any of you know any good (and more importantly …

machine-learning data-mining weka
In Rapidminer once I import a data set how do I change the type of a column?

I've imported a datset into Rapidminer 5 and one of the columns that was supposed to be nominal or polynomial was …

data-mining rapidminer
Hierarchical clustering of 1 million objects

Can anyone point me to a hierarchical clustering tool (preferable in python) that can cluster ~1 Million objects? I have tried …

python machine-learning cluster-analysis data-mining hierarchical-clustering
Data Mining open source tools

I'm due to take up a project which is into data mining. Before I jump in I wanted to probe …

open-source data-mining
Hierarchical Clustering: Determine optimal number of cluster and statistically describe Clusters

I could use some advice on methods in R to determine the optimal number of clusters and later on describe …

r data-mining cluster-analysis
What does the "Unexpected end of input" error mean?

I am currently building an application in R to calculate the QR matrix decomposition, the QR non negative matrix decomposition …

r data-mining qr-decomposition