Actually I am trying my level best to integrate with R, but I got this error.
packages ‘rmr’, ‘rJava‘, ‘RJSONIO‘, ‘rhdfs’, ‘rhbase’, ‘plyrmr’ are not available (for R version 3.1.3)
Steps to integrate Hadoop with R:
Installed R, and Hadoop in ubuntu.
Add these three lines in ~/.bashrc file.
*export HADOOP_PREFIX=/Users/hadoop/hadoop-1.1.2
export HADOOP_CMD=/Users/hadoop/hadoop-1.1.2/bin/hadoop
export HADOOP_STREAMING=/Users/hadoop/hadoop-1.1.2/contrib/streaming/hadoop-streaming-1.1.2.jar*
Installed R packages by using this command
install.packages(c("rJava", "RJSONIO", "rmr", "rhdfs", "rhbase", "plyrmr").
But i got above error. What is the main problem how to integrate R and Hadoop. I have followed this link to integrate.
Download packages rhdfs, rhbase, rmr2 and plyrmr from https://github.com/RevolutionAnalytics/RHadoop/wiki and install them as below :
install.packages("<path>/rhdfs_1.0.8.tar.gz", repos=NULL, type="source")
install.packages("<path>/rmr2_2.2.2.tar.gz", repos=NULL, type="source")
install.packages("<path>plyrmr_0.2.0.tar.gz", repos=NULL, type="source")
install.packages("<path>/rhbase_1.2.0.tar.gz", repos=NULL, type="source")