I am new to R Programming.
I am using below code after setting all env variables. All connections are working well but I am getting an error in executing below of to.dfs.
Please guide me.
Sys.setenv("HADOOP_CMD"="/usr/local/hadoop/bin/hadoop")
Sys.setenv("HADOOP_CONF"="/usr/local/hadoop/conf")
Sys.setenv("HADOOP_STREAMING"="/usr/local/hadoop/share/hadoop/tool/lib/hadoop-streaming-2.6.0.jar")
library(rhdfs)
hdfs.init()
library(rmr2)
library(plyrmr)
hdfs.ls('/user/hduser/')
sample <- 1:500
small.ints <- to.dfs(sample)
I am getting the error:
Not a valid JAR: /usr/local/hadoop/share sh: 2: hadoop/tools/lib/hadoop-streaming-2.6.0.jar: not found