Questions tagged [apache-apex]

Apex is a YARN-native platform that unifies stream and batch processing. It processes big data in-motion in a way that is highly scalable, highly performant, fault tolerant, stateful, secure, distributed, and easily operable.

Apache Apex is an open source platform for a unified stream and batch processing. It runs natively on YARN. Apache Apex itself is divided into two core modules

Developing application on Apex

To start with, documentation is available at: http://apex.apache.org/docs.html

51 questions
17
votes
1 answer

What is the differences between Apache Spark and Apache Apex?

Apache Apex - is an open source enterprise grade unified stream and batch processing platform. It is used in GE Predix platform for IOT. What are the key differences between these 2 platforms? Questions From a data science perspective, how is it…
9
votes
2 answers

How Apache Apex is different from Apache Storm?

Apache Apex looks similar to Apache Storm. Users build application/topology as Directed Acyclic Graph (DAG) on both platforms. Apex uses operators/streams and Storm uses spouts/streams/bolts. They both process data in real time as opposed to…
PradeepKumbhar
  • 3,361
  • 1
  • 18
  • 31
7
votes
1 answer

Apache Apex vs Apache Flink

As both are streaming frameworks which processes event at a time, What are the core architectural differences between these two technologies/streaming framework? Also, what are some particular use cases where one is more appropriate than the other?
Biplob Biswas
  • 1,761
  • 19
  • 33
5
votes
2 answers

Hadoop tools for moving files from local file system to HDFS

I am doing a POC on ways to import data from a shared network drive to HDFS. Data would in different folders on the shared drive and each folder would correspond to a different directory on HDFS. I looked at some popular tools that do this but most…
imgr8
  • 501
  • 4
  • 11
  • 25
4
votes
2 answers

Apache Apex - Kafka 0.9 secure kafka topics

Can someone pls help me understand, Does apache apex have any inbuilt kafka input operator to read from Kafka 0.9 secure kafka topics?
Raja
  • 513
  • 5
  • 18
3
votes
0 answers

CouchbasePOJOSetOperator is causing start up problems when running Apache Apex app

I have a basic Apex stream with a Kafka input operator feeding into a Couchbase set operator. The Kafka sides works fine and have proven it by removing the couch operator. I have tried switching around versions of the Malhar library to see if it…
Mark Kouba
  • 236
  • 2
  • 12
3
votes
2 answers

How to calculate network latency between operators in Apache Apex

Is there a way, using which we can predict the N/W latency between two different operators in Apache Apex. In a situation where the data that is going to be used in an operator, resides on a different machine altogether; this will add to the…
frewper
  • 1,385
  • 6
  • 18
  • 44
3
votes
2 answers

Not able to launch apache apex cli

I am trying to launch Apache Apex cli but not able to do so. the document says just type apex on command prompt but that says No command 'apex' found Are there some pre steps which i need to perform before start using it? I have all prerequisites…
UserASR
  • 2,015
  • 4
  • 24
  • 47
3
votes
1 answer

DataTorrent/ Apache Apex with sbt

Have been working on DataTorrent/ Apex for a while. Can SBT can be used to create DataTorrent (Java) Applications? While performing sbt clean assembly in the root directory of the DataTorrent Application encountered Build Success. But only the .jar…
New Coder
  • 499
  • 4
  • 22
3
votes
1 answer

How does Apache Apex handle back pressure?

Repost from users@apex.incubator.apache.org Apex utilizes buffer server for back pressure. How does the buffer server survive application crashes? What if the buffer server itself dies? Will Apex guarantee that the downstream operator will…
user6147934
3
votes
1 answer

How to unit test Kafka 0.9 operator with Apache Apex?

Repost from users@apex.incubator.apache.org I would like to run a unit test code using the new Kafka Operator that supports the 0.9 version protocol. In this process, I included the Malhar-Kafka library version ( 3.3.1-incubating ) and am using the…
user6147934
3
votes
1 answer

How do you create a DAG with JSON in Apache Apex?

I've been trying to find the documentation for populating a DAG instance using JSON. Is there formal documentation to the format somewhere?
user6147934
3
votes
2 answers

Understanding DataTorrent with example

I am supposed to work on DataTorrent and looking for articles/documentation to go through. I could not find detailed documentation on what are operators, how are they used for processing our data and about MALHAR library which is being used in…
Atom
  • 768
  • 1
  • 15
  • 35
2
votes
1 answer

Adding Input Operator Dynamically to a running Apache Apex application

Is it possible to add input operator for different source in the running Apex application? For example: In an production environment, I am running an Apex application to read the text-file from input source and I want to add Kafka source with its…
2
votes
2 answers

Trying to run Apache Apex's Yahoo Finance example on YARN

I've downloaded Apache Apex 3.5.0 along with Malhar 3.5.0. I've successfully started the apex client and submitted the Yahoo Finance demo example to our YARN cluster (running CDH 5.10). The cluster is running and configured properly (many Spark and…
Tony
  • 1,214
  • 14
  • 18
1
2 3 4