Top "Data-mining" questions

Data mining is the process of analyzing large amounts of data in order to find patterns and commonalities.

How can I find the center of a cluster of data points?

Let's say I plotted the position of a helicopter every day for the past year and came up with the …

algorithm geocoding cluster-analysis data-mining markerclusterer
Best clustering algorithm? (simply explained)

Imagine the following problem: You have a database containing about 20,000 texts in a table called "articles" You want to connect …

algorithm text cluster-analysis data-mining text-mining
how to choose initial centroids for k-means clustering

I am working on implementing k-means clustering in Python. What is the good way to choose initial centroids for a …

python cluster-analysis data-mining k-means centroid
What is the meaning of jitter in visualize tab of weka

In weka I load an arff file. I can view the relationship between attributes using the visualize tab. However I …

java data-mining weka arff
Ways to calculate similarity

I am doing a community website that requires me to calculate the similarity between any two users. Each user is …

statistics social-networking data-mining pattern-recognition similarity
What is evaluation of a cluster in WEKA?

What do we mean when we say that we are evaluating the clusters in WEKA frmework? Clustering is an unsupervised …

java machine-learning data-mining weka
How would you group/cluster these three areas in arrays in python?

So you have an array 1 2 3 60 70 80 100 220 230 250 For a better understanding: How would you group/cluster the three areas in arrays in …

python cluster-analysis data-mining pattern-recognition
Difference between Closed and open Sequential Pattern Mining Algorithms

I want to use some algorithms to mine my log data. I found a pattern mining framework on: http://www.…

pattern-matching data-mining sequential apriori
Historical weather data from NOAA

I am working on a data mining project and I would like to gather historical weather data. I am able …

web-scraping data-mining weather-api
What FFT descriptors should be used as feature to implement classification or clustering algorithm?

I have some geographical trajectories sampled to analyze, and I calculated the histogram of data in spatial and temporal dimension, …

machine-learning fft data-mining similarity feature-extraction