Questions tagged [stream-processing]
272 questions
226
votes
21 answers
How do I read the contents of a Node.js stream into a string variable?
How do I collect all the data from a Node.js stream into a string?

obrienmd
- 2,575
- 3
- 17
- 8
49
votes
5 answers
Akka Stream Kafka vs Kafka Streams
I am currently working with Akka Stream Kafka to interact with kafka and I was wonderings what were the differences with Kafka Streams.
I know that the Akka based approach implements the reactive specifications and handles back-pressure,…

nsanglar
- 1,632
- 1
- 14
- 24
26
votes
3 answers
What is the difference between mini-batch vs real time streaming in practice (not theory)?
What is the difference between mini-batch vs real time streaming in practice (not theory)? In theory, I understand mini batch is something that batches in the given time frame whereas real time streaming is more like do something as the data arrives…

user1870400
- 6,028
- 13
- 54
- 115
23
votes
3 answers
Does Kafka python API support stream processing?
I have used Kafka Streams in Java. I could not find similar API in python.
Do Apache Kafka support stream processing in python?

user3126637
- 333
- 1
- 2
- 5
17
votes
1 answer
What is the differences between Apache Spark and Apache Apex?
Apache Apex - is an open source enterprise grade unified stream and batch processing platform. It is used in GE Predix platform for IOT.
What are the key differences between these 2 platforms?
Questions
From a data science perspective, how is it…

GeorgeOfTheRF
- 8,244
- 23
- 57
- 80
16
votes
2 answers
Real-world analog to TIS-100
The recent game TIS-100 is centered around a rather interesting machine architecture, where the CPU consists of "nodes" which can communicate to their adjacent neighbours. I unfortunately cannot find an official link to the reference manual for…

Matthew Sainsbury
- 1,470
- 3
- 18
- 42
12
votes
2 answers
Apache Beam over Apache Kafka Stream processing
What are the differences between Apache Beam and Apache Kafka with respect to Stream processing?
I am trying to grasp the technical and programmatic differences as well.
Please help me understand by reporting from your experience.

Stella
- 1,728
- 5
- 41
- 95
12
votes
1 answer
akka stream asyncBoundary vs mapAsync
I am trying to understand the difference between asyncBoundary and mapAsync. From the glance, I guess they should be same. However, when I run the code, it looks like that the performance of asyncBoundary is quicker than mapAsync
Here is the…

Xiaohe Dong
- 4,953
- 6
- 24
- 53
10
votes
2 answers
How to programmatically kill a Apache Storm topology?
I am using a Java class to submit a topology to a Storm cluster and I also plan to use a Java class to kill the topology. But as per storm documentation, the following command is used to kill a topology and there is no Java method (and this has…

mbgsuirp
- 618
- 1
- 8
- 20
9
votes
2 answers
How Apache Apex is different from Apache Storm?
Apache Apex looks similar to Apache Storm.
Users build application/topology as Directed Acyclic Graph (DAG) on both platforms. Apex uses operators/streams and Storm uses spouts/streams/bolts.
They both process data in real time as opposed to…

PradeepKumbhar
- 3,361
- 1
- 18
- 31
8
votes
1 answer
An ArrowCircuit instance for stream processors which could block
The Control.Arrow.Operations.ArrowCircuit class is for:
An arrow type that can be used to interpret synchronous circuits.
I want to know what synchronous means here. I looked it up on Wikipedia, where they are speaking of digital electronics. My…

Zhiltsoff Igor
- 1,812
- 8
- 24
8
votes
2 answers
Server CPU and GPU With LAMP
I am trying to figure out more about the hardware that can be utilized when running a php application or even a c++ compiled php app using HipHop. I would like to setup a microserver and use the GPU to help the CPU process requests...
Anyone?

GregL83
- 615
- 2
- 7
- 15
8
votes
4 answers
Synchronize Data From Multiple Data Sources
Our team is trying to build a predictive maintenance system whose task is to look at a set of events and predict whether these events depict a set of known anomalies or not.
We are at the design phase and the current system design is as…

sgarizvi
- 16,623
- 9
- 64
- 98
8
votes
2 answers
Understanding Kafka stream groupBy and window
I am not able to understand the concept of groupBy/groupById and windowing in kafka streaming. My goal is to aggregate stream data over some time period (e.g. 5 seconds). My streaming data looks something…

x64
- 332
- 1
- 4
- 13
8
votes
1 answer
Why does Apache Flink need Watermarks for Event Time Processing?
Can someone explain Event timestamp and watermark properly. I understood it from docs, but it is not so clear. A real life example or layman definition will help. Also, if it is possible give an example ( Along with some code snippet which can…

Kspace
- 225
- 4
- 18