How to install RHadoop packages (Rmr, Rhdfs, Rhbase)?

Venu A Positive picture Venu A Positive · Apr 15, 2015 · Viewed 17.6k times · Source

Actually I am trying my level best to integrate with R, but I got this error.

packages ‘rmr’, ‘rJava‘, ‘RJSONIO‘, ‘rhdfs’, ‘rhbase’, ‘plyrmr’ are not available (for R version 3.1.3)

Steps to integrate Hadoop with R:

Installed R, and Hadoop in ubuntu.

Add these three lines in ~/.bashrc file.

*export HADOOP_PREFIX=/Users/hadoop/hadoop-1.1.2

export HADOOP_CMD=/Users/hadoop/hadoop-1.1.2/bin/hadoop

export HADOOP_STREAMING=/Users/hadoop/hadoop-1.1.2/contrib/streaming/hadoop-streaming-1.1.2.jar*

Installed R packages by using this command

install.packages(c("rJava", "RJSONIO", "rmr", "rhdfs", "rhbase", "plyrmr").

But i got above error. What is the main problem how to integrate R and Hadoop. I have followed this link to integrate.

Answer

Jinith picture Jinith · Nov 26, 2015

Download packages rhdfs, rhbase, rmr2 and plyrmr from https://github.com/RevolutionAnalytics/RHadoop/wiki and install them as below :

install.packages("<path>/rhdfs_1.0.8.tar.gz", repos=NULL, type="source")
install.packages("<path>/rmr2_2.2.2.tar.gz", repos=NULL, type="source")
install.packages("<path>plyrmr_0.2.0.tar.gz", repos=NULL, type="source")
install.packages("<path>/rhbase_1.2.0.tar.gz", repos=NULL, type="source")