Top "Mahout" questions

Apache Mahout open source scalable machine learning project

Large scale machine learning - Python or Java?

I am currently embarking on a project that will involve crawling and processing huge amounts of data (hundreds of gigs), …

java python machine-learning nltk mahout
Mahout: CSV to vector and running the program

I'm analysing the k-means algorithm with Mahout. I'm going to run some tests, observe performance, and do some statistics with …

java apache hadoop mahout
Java's Mahout equivalent in Python

Java based Mahout's goal is to build scalable machine learning libraries. Are there any equivalent libraries in Python ?

java python scalability machine-learning mahout
How to directly send the output of a mapper-reducer to a another mapper-reducer without saving the output into the hdfs

Problem Solved Eventually check my solution in the bottom Recently I am trying to run the recommender example in the …

hadoop mahout
Using mahout and hadoop

I am a newbie trying to understand how will mahout and hadoop be used for collaborative filtering. I m having …

hadoop cassandra mahout
How to perform k-means clustering in mahout with vector data stored as CSV?

I have a file containing vectors of data, where each row contains a comma-separated list of values. I am wondering …

mahout k-means
Using the Apache Mahout machine learning libraries

I've been working with the Apache Mahout machine learning libaries in my free time a bit over the past few …

apache machine-learning hadoop mahout
Converting CSV to SequenceFile

I have a CSV file which I would like to convert to a SequenceFile, which I would ultimately use to …

hadoop mahout sequencefile
mahout lucene document clustering howto?

I'm reading that i can create mahout vectors from a lucene index that can be used to apply the mahout …

indexing lucene cluster-analysis k-means mahout
How to do an item based recommendation in spark mllib?

In Mahout, there is support for item based recommendation using API method: ItemBasedRecommender.mostSimilarItems(int productid, int maxResults, Rescorer rescorer) …

apache-spark mahout recommendation-engine apache-spark-mllib