Questions tagged [cascading]

Cascading is a Query API, Query Planner, and Process Scheduler used for defining and executing complex, scale-free, and fault tolerant data processing workflows on a Hadoop cluster.

Cascading is a thin Java library that sits on top of Hadoop's MapReduce layer and is executed from the command line like any other Hadoop application. It is not a new text based query syntax (like Pig) or another complex system that must be installed on a cluster and maintained (like Hive). Though Cascading is both complimentary to and is a valid alternative to either application.

Cascading lets the developer quickly assemble complex distributed data-processing applications without having to "think" in MapReduce. And to efficiently schedule them based on their dependencies. Obviously simple data processing applications are supported as well, as complex applications tend to start simple.

Cascading is Open Source and dual licensed under the GPL and OEM/Commercial Licenses. OEM/Commercial Licenses and Developer Support can be obtained through Concurrent, Inc.

Cascading has a strong community of users and contributors, see our Cascading modules page for related projects and extensions.

Cascading, extensions, and related libraries are also hosted in the Conjars maven repository maintained by Concurrent, Inc. The repository is open to the public.

Cascading application-stack overview: enter image description here

Links:

Cascading Homepage

364 questions

votes

5 answers

Cascading examples failed to compile?

In shell I typed gradle cleanJar in the Impatient/part1 directory. The output is below. The error is "class file for org.apache.hadoop.mapred.JobConf not found". Why did it fail to compile? :clean UP-TO-DATE :compileJava Download…

asked Sep 20 '12 at 10:53

Treper

3,539
2
26
48

votes

4 answers

Populating cascading dropdown lists in JSP/Servlet

Suppose I am having three dropdownlist controls named dd1, dd2 and dd3. The value of each dropdownlist comes from database. dd3's value depends upon value of dd2 and dd2's value depends on value of dd1. Can anyone tell me how do I call servlet for…

jsp servlets drop-down-menu cascading

asked Feb 15 '10 at 04:01

deven

votes

3 answers

Introducing FOREIGN KEY constraint may cause cycles or multiple cascade paths

I am getting this error Introducing FOREIGN KEY constraint 'FK_dbo.Regions_dbo.Countries_CountryId' on table 'Regions' may cause cycles or multiple cascade paths. Specify ON DELETE NO ACTION or ON UPDATE NO ACTION, or modify other FOREIGN KEY…

entity-framework cascading

asked Oct 15 '13 at 04:28

chobo2

83,322
195
530
832

votes

8 answers

does anyone find Cascading for Hadoop Map Reduce useful?

I've been trying Cascading, but I cannot see any advantage over the classic map reduce approach for writing jobs. Map Reduce jobs gives me more freedom and Cascading seems to be putting a lot of obstacles. Might make a good job for making simple…

hadoop cascading

asked Sep 10 '10 at 00:20

Federico

5,438
5
39
47

votes

3 answers

What is the most mature library for building a Data Analytics Pipeline in Java/Scala for Hadoop?

I found many options recently, and interesting in their comparisons primarely by maturity and stability. Crunch - https://github.com/cloudera/crunch Scrunch - https://github.com/cloudera/crunch/tree/master/scrunch Cascading -…

scala hadoop cascading flume

asked Feb 24 '12 at 08:59

yura

14,489
21
77
126

votes

4 answers

(doctrine2 + symfony2) cascading remove : integrity constraint violation 1451

First, sorry for my poor English... I got four entities : User, Application, Bundle & Entity. Here are their relations (with cascading persist & remove, see code below) : User 1-n Application Application 1-n Bundle Bundle 1-n Entity It's working…

symfony doctrine-orm cascading referential-integrity

asked Jan 10 '13 at 11:33

bgaze

votes

2 answers

Gradle - Could not find method mavenRepo() for arguments

I've cloned the Cascading Pattern project, installed Gradle 2.0 - run and the commend gradle eclipse --stacktrace and I get the following - what is the root cause? J-MacBook-Pro:pattern J$ gradle eclipse --stacktrace FAILURE: Build failed with an…

gradle cascading

asked Aug 19 '14 at 11:54

hawkeye

34,745
30
150
304

votes

1 answer

Build custom join logic in Cascading ensuring MAP_SIDE only

I have 3 cascading pipes (one to join against other two) described as follows, LHSPipe - (larger size) RHSPipes - (smaller size that could possibly fit to memory) Psuedocode as follows, This example involves two joins IF F1DecidingFactor =…

java hadoop mapreduce cascading

asked Mar 22 '16 at 06:31

Viswa

1,357
3
18
30

votes

2 answers

Hibernate @ManyToOne only works with CascadeType.ALL

I am using Hibernate 3.3.1 and i would like to create a relation between persons and an assigned company. They should be loosely coupled, but i would like to arrange to create a company via cascade and not explicitly calling…

hibernate many-to-one cascading

asked Aug 20 '13 at 21:54

Shannon

votes

4 answers

Make multiple dependent / cascading selection components in JSF

I am trying to make 4 dependent / cascading selection components. In this question, the selection component happens to be a , but this is of course applicable on any other kind of selection component extending from…

jsf hierarchy selectonemenu cascading ajax-update

asked May 04 '13 at 19:24

Shadooo Medo

votes

1 answer

Apache Spark or Cascading framework?

I am confused as to when to use the Cascading framework and when to use Apache Spark. What are suitable use cases for each one? Any help is appreciated.

java apache-spark cascading

asked Aug 11 '14 at 10:04

progrrammer

4,475
2
30
38

votes

3 answers

how to prevent hadoop job to fail on corrupted input file

I'm running hadoop job on many input files. But if one of the files is corrupted the whole job is fails. How can I make the job to ignore the corrupted file? maybe write for me some counter/error log but not fail the whole job

hadoop mapreduce cascading

asked Feb 17 '13 at 10:41

Julias

5,752
17
59
84

votes

1 answer

Why this gradle build script is not compiling java class?

I am trying to use Cascading in my Hadoop project. I am trying to implement first example given in Enterprise Data Workflows with Cascading book. I have written java class which contains Cascading related code and I have another build.graddle file…

gradle cascading

asked Sep 11 '13 at 13:36

Shekhar

11,438
36
130
186

votes

2 answers

Workflow tool comaparison: Oozie Vs Cascading

I am looking for a workflow tool to run complex map-reduce jobs. I have Oozie in mind but also want to explore Cascading. Is there any sample code or example that chains existing M/R jobs using cascading API? Also, can you provide the comparison…

hadoop mapreduce cascading oozie

asked Jul 03 '12 at 18:36

user1499636

votes

2 answers

Cascading: Java tutorial for the Impatient, Part1 : Gradle build fails

When building part1 of the tutorial on CentOS 6.5,java version "1.7.0_51" I get Gradle error: You can't change configuration 'providedCompile' because it is already resolved! [localhost part1]$ gradle clean jar FAILURE: Build failed with an…

build.gradle cascading

asked Jul 02 '14 at 10:07

DarqMoth

2 3

…

24 25 Next