Questions tagged [hadoop-plugins]

Plugins for the Apache™ Hadoop™ project.

Plugins for the Apache™ Hadoop™ project.

141 questions
21
votes
11 answers

DataNode is Not Starting in singlenode hadoop 2.6.0

I installed hadoop 2.6.0 in my laptop running Ubuntu 14.04LTS. I successfully started the hadoop daemons by running start-all.sh and I run a WourdCount example successfully, then I tried to run a jar example that didn't work with me so I decide to…
Firas M. Awaysheh
  • 211
  • 1
  • 2
  • 3
8
votes
2 answers

package org.apache.hadoop.conf does not exist after setting classpath

I am a beginner in hadoop using the hadoop's beginners guide book as a tutorial. I am using a mac osx 10.9.2 and hadoop version 1.2.1 I have set all the appropriate class path, when I call echo $PATH in terminal: Here is the result I…
user3287477
  • 87
  • 1
  • 1
  • 6
7
votes
4 answers

Chaining multiple mapreduce tasks in Hadoop streaming

I am in scenario where I have two mapreduce jobs. I am more comfortable with python and planning to use it for writing mapreduce scripts and use hadoop streaming for the same. is there a convenient to chain both the jobs following form when hadoop…
7
votes
1 answer

InvalidRequestException(why:empid cannot be restricted by more than one relation if it includes an Equal)

This is regarding an issue I am facing while querying Cassandra from Apache Spark. The normal query from Spark works fine without any issues , however when I query with a condition which is the key I get the below error. Initially I tried querying…
user2081818
  • 309
  • 2
  • 10
6
votes
2 answers

Hadoop Security

I am trying to learn " How Kerberos can be implemented in Hadoop ?" I have gone through this doc https://issues.apache.org/jira/browse/HADOOP-4487 I have also gone through Basic Kerberos stuff ( https://www.youtube.com/watch?v=KD2Q-2ToloE) After…
5
votes
1 answer

Datanode failing in Hadoop on single Machine

I set up and configured sudo node hadoop environment on ubuntu 12.04 LTS using following tutorial http://www.michael-noll.com/tutorials/running-hadoop-on-ubuntu-linux-multi-node-cluster/#formatting-the-hdfs-filesystem-via-the-namenode After typing…
Jaipal R
  • 81
  • 1
  • 8
4
votes
1 answer

Loading protobuf format file into pig script using loadfunc pig UDF

I have very little knowledge of pig. I have protobuf format data file. I need to load this file into a pig script. I need to write a LoadFunc UDF to load it. say function is Protobufloader(). my PIG script would be A = LOAD 'abc_protobuf.dat'…
Reddy
  • 1,620
  • 6
  • 26
  • 33
4
votes
1 answer

Hadoop task schedulers: Capacity vs Fair sharing or something else?

Background My employer is progressively shifting our resource intensive ETL and backend processing logic from MySQL to Hadoop ( dfs & hive ). At the moment everything is still somewhat small and manageable ( 20 TB over 10 nodes ) but we intend to…
David
  • 17,673
  • 10
  • 68
  • 97
4
votes
1 answer

Error in addInputPath of MapReduce Driver

I am getting the error in the addInputPath method of my MapReduce Driver. The error is "The method addInputPath(Job, Path) in the type FileInputFormat is not applicable for the arguments (JobConf, Path)" Here is my code for the driver: package…
Arun Kumar
  • 101
  • 2
  • 10
4
votes
3 answers

Install mahout on ubuntu 12.04 - E: Unable to locate package mahout

How to I install mahout on ubuntu 12.04? sudo apt-get install mahout Reading package lists... Done Building dependency tree Reading state information... Done E: Unable to locate package…
Tampa
  • 75,446
  • 119
  • 278
  • 425
3
votes
4 answers

Is it possible to run several map task in one JVM?

I want to share large in memory static data(RAM lucene index) for my map tasks in Hadoop? Is there way for several map/reduce tasks to share same JVM?
yura
  • 14,489
  • 21
  • 77
  • 126
3
votes
2 answers

Which setting to use in Spark to specify compression of `Output`?

So, Spark has the file spark-defaults.xml for specifying what settings, including which compression codec is to used and at what stage (RDD, Shuffle). Most of the settings can be set at the application level. EDITED: conf = SparkConf() …
nikk
  • 2,627
  • 5
  • 30
  • 51
3
votes
1 answer

how to work with glusterfs-hadoop plugin?

i installed glusterfs and works fine, after that i installed hadoop 1.x and works fine with hdfs, but when i use glusterfs-hadoop plugin to use glusterfs as the filesystem backend for my hadoop i get error, i use github site for glusterfs-hadoop…
mahsa frj
  • 31
  • 1
3
votes
1 answer

Adding a mongo query when importing data with pig and mongodb

How would you attach a query when importing data using MongoLoader in apache pig. I could see in the mongo-hadoop wiki that there is reference to "mongo.input.query" but it seems to relate to the standard map reduce functionality and not Apache Pig.…
3
votes
2 answers

The import org.apache.hadoop.mapreduce cannot be resolved

I am trying to execute the below code package test; import java.io.IOException; import java.util.*; import org.apache.hadoop.fs.Path; import org.apache.hadoop.conf.*; import org.apache.hadoop.io.*; import org.apache.hadoop.util.*; import…
Indrajit Swain
  • 1,505
  • 1
  • 15
  • 22
1
2 3
9 10