Questions tagged [hazelcast-jet]

Hazelcast Jet is a distributed computing platform for low-latency stream and batch processing. It benefits from integration with Hazelcast IMDG providing fast, scalable Big Data processing that’s lightweight and embeddable.

Hazelcast Jet is a distributed computing platform for low-latency stream and batch processing. It benefits from integration with Hazelcast IMDG providing fast, scalable Big Data processing that’s lightweight and embeddable.

188 questions
11
votes
2 answers

What are the differences between Hazelcast Jet and Apache Flink

More specifically, what usecases does Hazelcast Jet solve that Flink does not solve (equally well) and vice versa?
Atle
  • 428
  • 3
  • 11
3
votes
2 answers

wordcount test shows slowness in Flink

i am doing some benchmark comparison between streaming processing frameworks, I selected WordCount such "Hello world" task (with some twists) in this area, and tested Flink and Hazelcast Jet so far, the result is Flink is taking 80+s to complete,…
3
votes
2 answers

Hazelcast external sorting.Sort records in a big csv

I have a use case where I need to sort a huge csv file say 10 million records, and write result to another file. Does hazelcast-jet / hazelcast provide any such external sort capability where I have limited RAM.
Rajesh
  • 153
  • 6
3
votes
1 answer

Hazlecast Jet Cluster Processes duplicates

I have deployed 3 spring boot apps with Hazelcast Jet embedded. The nodes recognize each other and run as a cluster. I have the following code: A simple reading from CSV and write to a file. But Jet writes duplicates to the file sink. To be precise,…
Rajesh
  • 153
  • 6
3
votes
2 answers

Hazelcast Jet connector for Apache Pulsar as source

I could not find a Hazlecast Jet source connector for Apache Pulsar. Have anybody tried this? Appreciate any directions, points, sources, considerations if I have to write a custom stream connector for Pulsar as source for Jet?
vvra
  • 2,832
  • 5
  • 38
  • 82
3
votes
1 answer

Processor with 2 incoming edges - When returning false on one edge, keep re-processing from same edge and never process new items on other edge

I'm asking for a confirmation to my assumption about the tryProcess() logic. In detail how the return value (true/false) influence the DAG workflow on a processor with 2 incoming edges with no priority specified. My assumption is that if the…
Lubbo
  • 1,030
  • 1
  • 10
  • 18
3
votes
0 answers

com.hazelcast.core.HazelcastException: Service with name 'hz:impl:jetService' not found

I am trying to execute a sample code provided for the Hazelcast Jet on https://blog.hazelcast.com/use-hazelcast-jet-stream-data-imap-kafka-topic/ But its failing with an exception for Service with name 'hz:impl:jetService' not found! Here's my…
Ranjan
  • 475
  • 1
  • 5
  • 18
3
votes
1 answer

How do I plug custom partition logic in WriteKafkaP?

I'm using the WriteKafaP vertex in a Jet job. Is there a way to plug in custom partition logic when publishing to the Kafka topic?
Shannon Ma
  • 45
  • 4
3
votes
1 answer

How much overhead is usual while distributing processing?

For impatient readers: this is a work in progress, where I ask for help, during the process. Please do not judge the tools by my temporary data, as they can change while I try to get better results. We are in the middle of the decision process…
3
votes
2 answers

Hazelcast Jet - Use Cases

What are the use-cases of Hazelcast Jet? Has anyone started using it? Our project uses Hazelcast for Distributed Map holding Key-Value pair and Distributed computing on those Keys to run the task at the node holding the Key. We use NearCache…
A.K.Desai
  • 1,274
  • 1
  • 10
  • 16
2
votes
2 answers

Hazelcast SQL interface slow performance HZ 4.2.2 vs HZ 5.0.2

Situation : We have a product with approx 30 attributes (String, Enum, Double) values We have iMap with indexes for all attributes IndexType.HASH for string value and IndexType.SORTED for double values. (900MB together) We have 300k products in…
2
votes
2 answers

Hazelcast Jet change data capture

I'm using Hazelcast Change data capture (CDC) in my application. (Reason I'm using CDC because if use jdbc or other alternative feature to load data into cache its taking to much of time). So CDC will have a data sync between database and…
vivek
  • 115
  • 2
  • 9
2
votes
1 answer

Hazelcast Change Data Capture with Postgres

I'm trying to use CDC for my Postgres Database. and I have created simple project using Hazelcast Docs example. https://jet-start.sh/docs/tutorials/cdc-postgres public static void main(String[] args) { StreamSource source =…
vivek
  • 115
  • 2
  • 9
2
votes
1 answer

How to apply python on BatchStage Hazelcast Jet

What I am trying to do is, I am fetching the data from different source and doing grouping and join on them and after that I am getting result in BatchStage and now I have to perform python code on this function but I am not able to do it. Here is…
user3458271
  • 638
  • 12
  • 31
2
votes
1 answer

Perform Left, Right and Inner join in Hazelcast Jet

I am working on Hazelcast jet application and I trying to join two Sources using Left, Right or Inner Join but I am stuck at below: Here is my code: BatchStage> batch1=…
user3458271
  • 638
  • 12
  • 31
1
2 3
12 13