Questions tagged [cascading]

Cascading is a Query API, Query Planner, and Process Scheduler used for defining and executing complex, scale-free, and fault tolerant data processing workflows on a Hadoop cluster.

Cascading is a Query API, Query Planner, and Process Scheduler used for defining and executing complex, scale-free, and fault tolerant data processing workflows on a Hadoop cluster.

Cascading is a thin Java library that sits on top of Hadoop's MapReduce layer and is executed from the command line like any other Hadoop application. It is not a new text based query syntax (like Pig) or another complex system that must be installed on a cluster and maintained (like Hive). Though Cascading is both complimentary to and is a valid alternative to either application.

Cascading lets the developer quickly assemble complex distributed data-processing applications without having to "think" in MapReduce. And to efficiently schedule them based on their dependencies. Obviously simple data processing applications are supported as well, as complex applications tend to start simple.

Cascading is Open Source and dual licensed under the GPL and OEM/Commercial Licenses. OEM/Commercial Licenses and Developer Support can be obtained through Concurrent, Inc.

Cascading has a strong community of users and contributors, see our Cascading modules page for related projects and extensions.

Cascading, extensions, and related libraries are also hosted in the Conjars maven repository maintained by Concurrent, Inc. The repository is open to the public.

Cascading application-stack overview: enter image description here

Links:

364 questions
42
votes
5 answers

Cascading examples failed to compile?

In shell I typed gradle cleanJar in the Impatient/part1 directory. The output is below. The error is "class file for org.apache.hadoop.mapred.JobConf not found". Why did it fail to compile? :clean UP-TO-DATE :compileJava Download…
Treper
  • 3,539
  • 2
  • 26
  • 48
41
votes
4 answers

Populating cascading dropdown lists in JSP/Servlet

Suppose I am having three dropdownlist controls named dd1, dd2 and dd3. The value of each dropdownlist comes from database. dd3's value depends upon value of dd2 and dd2's value depends on value of dd1. Can anyone tell me how do I call servlet for…
deven
  • 805
  • 3
  • 10
  • 10
34
votes
3 answers

Introducing FOREIGN KEY constraint may cause cycles or multiple cascade paths

I am getting this error Introducing FOREIGN KEY constraint 'FK_dbo.Regions_dbo.Countries_CountryId' on table 'Regions' may cause cycles or multiple cascade paths. Specify ON DELETE NO ACTION or ON UPDATE NO ACTION, or modify other FOREIGN KEY…
chobo2
  • 83,322
  • 195
  • 530
  • 832
19
votes
8 answers

does anyone find Cascading for Hadoop Map Reduce useful?

I've been trying Cascading, but I cannot see any advantage over the classic map reduce approach for writing jobs. Map Reduce jobs gives me more freedom and Cascading seems to be putting a lot of obstacles. Might make a good job for making simple…
Federico
  • 5,438
  • 5
  • 39
  • 47
18
votes
3 answers

What is the most mature library for building a Data Analytics Pipeline in Java/Scala for Hadoop?

I found many options recently, and interesting in their comparisons primarely by maturity and stability. Crunch - https://github.com/cloudera/crunch Scrunch - https://github.com/cloudera/crunch/tree/master/scrunch Cascading -…
yura
  • 14,489
  • 21
  • 77
  • 126
15
votes
4 answers

(doctrine2 + symfony2) cascading remove : integrity constraint violation 1451

First, sorry for my poor English... I got four entities : User, Application, Bundle & Entity. Here are their relations (with cascading persist & remove, see code below) : User 1-n Application Application 1-n Bundle Bundle 1-n Entity It's working…
bgaze
  • 980
  • 1
  • 8
  • 24
12
votes
2 answers

Gradle - Could not find method mavenRepo() for arguments

I've cloned the Cascading Pattern project, installed Gradle 2.0 - run and the commend gradle eclipse --stacktrace and I get the following - what is the root cause? J-MacBook-Pro:pattern J$ gradle eclipse --stacktrace FAILURE: Build failed with an…
hawkeye
  • 34,745
  • 30
  • 150
  • 304
10
votes
1 answer

Build custom join logic in Cascading ensuring MAP_SIDE only

I have 3 cascading pipes (one to join against other two) described as follows, LHSPipe - (larger size) RHSPipes - (smaller size that could possibly fit to memory) Psuedocode as follows, This example involves two joins IF F1DecidingFactor =…
Viswa
  • 1,357
  • 3
  • 18
  • 30
10
votes
2 answers

Hibernate @ManyToOne only works with CascadeType.ALL

I am using Hibernate 3.3.1 and i would like to create a relation between persons and an assigned company. They should be loosely coupled, but i would like to arrange to create a company via cascade and not explicitly calling…
Shannon
  • 731
  • 3
  • 12
  • 26
10
votes
4 answers

Make multiple dependent / cascading selection components in JSF

I am trying to make 4 dependent / cascading selection components. In this question, the selection component happens to be a , but this is of course applicable on any other kind of selection component extending from…
Shadooo Medo
  • 101
  • 1
  • 4
8
votes
1 answer

Apache Spark or Cascading framework?

I am confused as to when to use the Cascading framework and when to use Apache Spark. What are suitable use cases for each one? Any help is appreciated.
progrrammer
  • 4,475
  • 2
  • 30
  • 38
8
votes
3 answers

how to prevent hadoop job to fail on corrupted input file

I'm running hadoop job on many input files. But if one of the files is corrupted the whole job is fails. How can I make the job to ignore the corrupted file? maybe write for me some counter/error log but not fail the whole job
Julias
  • 5,752
  • 17
  • 59
  • 84
7
votes
1 answer

Why this gradle build script is not compiling java class?

I am trying to use Cascading in my Hadoop project. I am trying to implement first example given in Enterprise Data Workflows with Cascading book. I have written java class which contains Cascading related code and I have another build.graddle file…
Shekhar
  • 11,438
  • 36
  • 130
  • 186
7
votes
2 answers

Workflow tool comaparison: Oozie Vs Cascading

I am looking for a workflow tool to run complex map-reduce jobs. I have Oozie in mind but also want to explore Cascading. Is there any sample code or example that chains existing M/R jobs using cascading API? Also, can you provide the comparison…
6
votes
2 answers

Cascading: Java tutorial for the Impatient, Part1 : Gradle build fails

When building part1 of the tutorial on CentOS 6.5,java version "1.7.0_51" I get Gradle error: You can't change configuration 'providedCompile' because it is already resolved! [localhost part1]$ gradle clean jar FAILURE: Build failed with an…
DarqMoth
  • 603
  • 1
  • 13
  • 31
1
2 3
24 25