Top "Data-mining" questions

Data mining is the process of analyzing large amounts of data in order to find patterns and commonalities.

How does clustering (especially String clustering) work?

I heard about clustering to group similar data. I want to know how it works in the specific case for …

string cluster-analysis data-mining
rapid miner: how to add a 'label' attribute to a dataset?

I want to apply a decision tree learning algorithm to a dataset I have imported from a CSV. The problem …

machine-learning data-mining decision-tree rapidminer
pandas pivot table rename columns

How to rename columns with multiple levels after pandas pivot operation? Here's some code to generate test data: import pandas …

python pandas pivot pivot-table data-mining
Clustering values by their proximity in python (machine learning?)

I have an algorithm that is running on a set of objects. This algorithm produces a score value that dictates …

python machine-learning cluster-analysis data-mining
Scikit-learn: How to run KMeans on a one-dimensional array?

I have an array of 13.876(13,876) values between 0 and 1. I would like to apply sklearn.cluster.KMeans to only this vector …

python scikit-learn data-mining k-means
Kmeans without knowing the number of clusters?

I am attempting to apply k-means on a set of high-dimensional data points (about 50 dimensions) and was wondering if there …

python machine-learning data-mining k-means
Sentiment Analysis java Library

I have some unlabeled microblogging posts and I want to create a sentiment analysis module. To do this I have …

java machine-learning data-mining text-mining sentiment-analysis
Can k-means clustering do classification?

I want to know whether the k-means clustering algorithm can do classification? If I have done a simple k-means clustering . …

algorithm cluster-analysis data-mining k-means
Python Implementation of OPTICS (Clustering) Algorithm

I'm looking for a decent implementation of the OPTICS algorithm in Python. I will use it to form density-based clusters …

python machine-learning cluster-analysis data-mining optics-algorithm
Mixing categorial and continuous data in Naive Bayes classifier using scikit-learn

I'm using scikit-learn in Python to develop a classification algorithm to predict the gender of certain customers. Amongst others, I …

python machine-learning data-mining classification scikit-learn