Questions tagged [stream-processing]

272 questions
226
votes
21 answers

How do I read the contents of a Node.js stream into a string variable?

How do I collect all the data from a Node.js stream into a string?
obrienmd
  • 2,575
  • 3
  • 17
  • 8
49
votes
5 answers

Akka Stream Kafka vs Kafka Streams

I am currently working with Akka Stream Kafka to interact with kafka and I was wonderings what were the differences with Kafka Streams. I know that the Akka based approach implements the reactive specifications and handles back-pressure,…
26
votes
3 answers

What is the difference between mini-batch vs real time streaming in practice (not theory)?

What is the difference between mini-batch vs real time streaming in practice (not theory)? In theory, I understand mini batch is something that batches in the given time frame whereas real time streaming is more like do something as the data arrives…
23
votes
3 answers

Does Kafka python API support stream processing?

I have used Kafka Streams in Java. I could not find similar API in python. Do Apache Kafka support stream processing in python?
17
votes
1 answer

What is the differences between Apache Spark and Apache Apex?

Apache Apex - is an open source enterprise grade unified stream and batch processing platform. It is used in GE Predix platform for IOT. What are the key differences between these 2 platforms? Questions From a data science perspective, how is it…
16
votes
2 answers

Real-world analog to TIS-100

The recent game TIS-100 is centered around a rather interesting machine architecture, where the CPU consists of "nodes" which can communicate to their adjacent neighbours. I unfortunately cannot find an official link to the reference manual for…
12
votes
2 answers

Apache Beam over Apache Kafka Stream processing

What are the differences between Apache Beam and Apache Kafka with respect to Stream processing? I am trying to grasp the technical and programmatic differences as well. Please help me understand by reporting from your experience.
Stella
  • 1,728
  • 5
  • 41
  • 95
12
votes
1 answer

akka stream asyncBoundary vs mapAsync

I am trying to understand the difference between asyncBoundary and mapAsync. From the glance, I guess they should be same. However, when I run the code, it looks like that the performance of asyncBoundary is quicker than mapAsync Here is the…
Xiaohe Dong
  • 4,953
  • 6
  • 24
  • 53
10
votes
2 answers

How to programmatically kill a Apache Storm topology?

I am using a Java class to submit a topology to a Storm cluster and I also plan to use a Java class to kill the topology. But as per storm documentation, the following command is used to kill a topology and there is no Java method (and this has…
mbgsuirp
  • 618
  • 1
  • 8
  • 20
9
votes
2 answers

How Apache Apex is different from Apache Storm?

Apache Apex looks similar to Apache Storm. Users build application/topology as Directed Acyclic Graph (DAG) on both platforms. Apex uses operators/streams and Storm uses spouts/streams/bolts. They both process data in real time as opposed to…
PradeepKumbhar
  • 3,361
  • 1
  • 18
  • 31
8
votes
1 answer

An ArrowCircuit instance for stream processors which could block

The Control.Arrow.Operations.ArrowCircuit class is for: An arrow type that can be used to interpret synchronous circuits. I want to know what synchronous means here. I looked it up on Wikipedia, where they are speaking of digital electronics. My…
8
votes
2 answers

Server CPU and GPU With LAMP

I am trying to figure out more about the hardware that can be utilized when running a php application or even a c++ compiled php app using HipHop. I would like to setup a microserver and use the GPU to help the CPU process requests... Anyone?
GregL83
  • 615
  • 2
  • 7
  • 15
8
votes
4 answers

Synchronize Data From Multiple Data Sources

Our team is trying to build a predictive maintenance system whose task is to look at a set of events and predict whether these events depict a set of known anomalies or not. We are at the design phase and the current system design is as…
8
votes
2 answers

Understanding Kafka stream groupBy and window

I am not able to understand the concept of groupBy/groupById and windowing in kafka streaming. My goal is to aggregate stream data over some time period (e.g. 5 seconds). My streaming data looks something…
x64
  • 332
  • 1
  • 4
  • 13
8
votes
1 answer

Why does Apache Flink need Watermarks for Event Time Processing?

Can someone explain Event timestamp and watermark properly. I understood it from docs, but it is not so clear. A real life example or layman definition will help. Also, if it is possible give an example ( Along with some code snippet which can…
Kspace
  • 225
  • 4
  • 18
1
2 3
18 19