Top "Cluster-analysis" questions

Cluster analysis is the process of grouping "similar" objects into groups known as "clusters", along with the analysis of these results.

How does clustering (especially String clustering) work?

I heard about clustering to group similar data. I want to know how it works in the specific case for …

string cluster-analysis data-mining
How to identify Cluster labels in kmeans scikit learn

I am learning python scikit. The example given here displays the top occurring words in each Cluster and not Cluster …

python machine-learning scikit-learn cluster-analysis k-means
R - 'princomp' can only be used with more units than variables

I am using R software (R commander) to cluster my data. I have a smaller subset of my data containing 200 …

r cluster-analysis k-means pca r-commander
kmeans: Quick-TRANSfer stage steps exceeded maximum

I am running k-means clustering in R on a dataset with 636,688 rows and 7 columns using the standard stats package: kmeans(…

r cluster-analysis k-means
Java Clustering Library

I am looking for a light weight clustering library in java. I don't need 100s of clustering algo in that …

java math cluster-analysis
Clustering values by their proximity in python (machine learning?)

I have an algorithm that is running on a set of objects. This algorithm produces a score value that dictates …

python machine-learning cluster-analysis data-mining
Understanding concept of Gaussian Mixture Models

I'm trying to understand GMM by reading the sources available online. I have achieved clustering using K-Means and was seeing …

matlab machine-learning classification cluster-analysis mixture-model
Text clustering with Levenshtein distances

I have a set (2k - 4k) of small strings (3-6 characters) and I want to cluster them. Since I …

r matlab cluster-analysis levenshtein-distance hierarchical-clustering
sklearn agglomerative clustering linkage matrix

I'm trying to draw a complete-link scipy.cluster.hierarchy.dendrogram, and I found that scipy.cluster.hierarchy.linkage is slower …

python scikit-learn cluster-analysis dendrogram
How to perform clustering without removing rows where NA is present in R

I have a data which contain some NA value in their elements. What I want to do is to perform …

r cluster-analysis bioconductor