Questions tagged [gelly]

Gelly is the graph processing library for the Apache Flink distributed streaming engine.

Gelly is a Graph API for Flink. It contains a set of methods and utilities which aim to simplify the development of graph analysis applications in Flink. In Gelly, graphs can be transformed and modified using high-level functions similar to the ones provided by the batch processing API. Gelly provides methods to create, transform and modify graphs, as well as a library of graph algorithms.

Learn more about Gelly here.

20 questions
5
votes
1 answer

Unable to achieve high CPU utilization with Flink and Gelly

I've been experimenting with Flink streaming for a while, using benchmarks like the Yahoo streaming benchmark: https://github.com/yahoo/streaming-benchmarks which are supposed to stress the system but I never achieved a satisfying CPU utilization -…
5
votes
1 answer

How to import Apache Flink SNAPSHOT artifacts?

I want to add the documents of Gelly to my project, but it gives me this error: Sources not found for: org.apache.flink:flink-gelly_2.10:1.2-SNAPSHOT This is in my pom.xml org.apache.flink
5
votes
1 answer

Linkage failure when running Apache Flink jobs

I have a job developed in Flink 0.9 that is using the graph module (Gelly). The job is running successfully within the IDE (Eclipse) but after exporting it to a JAR using maven (mvn clean install) it fails to execute on the local flink instance with…
3
votes
1 answer

Apache Flink creates incorrect plan

I created a simple Job for Apache Flink that uses the PageRank implementation provided with Gelly. Locally, running inside the IDE, everything is fine. However, I tried to submit a JAR with my Job to a Flink instance running in my machine, using the…
Renato Rosa
  • 33
  • 1
  • 5
3
votes
1 answer

Flink: PageRank type mismatch error

I want to compute PageRank from a CSV file of edges formatted as follows: 12,13,1.0 12,14,1.0 12,15,1.0 12,16,1.0 12,17,1.0 ... My code: var filename = ".csv" val graph = Graph.fromCsvReader[Long,Double,Double]( env =…
lary
  • 399
  • 2
  • 14
2
votes
1 answer

Flink Gelly Memory ran out

I run a cluster with the following specifications (per each task manager): - 16 cpu threads - 16 GB ram - 16 slots. I have two task managers and when we run a graph algorithm such as the connected components, the program would be failed with the…
2
votes
1 answer

Flink Gelly - Type mismatch while creating graph

I am new to Apache Flink and Gelly and I use the Scala API. I hava a DataSet of vertices and a DataSet of edges and I am trying to create a graph like this: val env = ExecutionEnvironment.getExecutionEnvironment // correct result val edges:…
Al Jenssen
  • 655
  • 3
  • 9
  • 25
1
vote
1 answer

Flink Gelly updating graph during computation

I am new to Fink and Gelly. We have a requirement to do graph computation while the graph itself is constantly changing due to incremental updates (frequently). Because the graph is so huge, we did some investigation on Spark with TinkerPop. But the…
wding109
  • 11
  • 3
1
vote
0 answers

Flink : Memory ran out exception

I set up a Flink cluster with the following configuration: The number of task managers: 2 When I run the Connected Components algorithm (org.apache.flink.graph.library.ConnectedComponents) on a graph with 4 million edges and 1,750,000 vertices…
1
vote
0 answers

Better performance with Flink Gelly when decreasing available memory

I notice strange behavior in the performance of Flink Gelly (PageRank algorithm on a graph of approximately 17,000,000 vertices and 136,000,000 edges). While setting taskmanager.heap.mb: 125000 results to a runtime of ~110 sec, taskmanager.heap.mb:…
1
vote
0 answers

flink gelly implementing closeness

I aim in implementing closeness centrality in Flink Gelly and trying to use a map function on each node for calculating SSSP for it. Graph is not serialized and, so, I can't send it to each node. Any help will be highly appreciated.
1
vote
1 answer

Flink Gelly extending edge class and using it in DataSet

In Gelly i'm trying to make a special Edge called a Temporal edge, to make this easier i made a class called Temporaledgev3: public class TemporalEdgev3 extends Edge> { /* Creates new temporaledge with only null…
1
vote
1 answer

Trying to extend Gelly Graph

I'm trying to extend a Gelly Graph into a Graph with edges with Tuple5 instead of Tuple 3. This is not possible by extending a gelly Graph since the constructor is private. I went ahead and made my own Edge class that extends Tuple5 instead of Tuple…
1
vote
1 answer

Flink Serialization Error

I'm trying to run the Label propagation protocol on my Apache Flink Gelly Graph. Here is my code: Graph ugraph = Graph.fromDataSet(vertex, edgeSet, env).getUndirected(); DataSet>…
1
vote
4 answers

Flink: Connected Components - type mismatch error

I am trying to run connected components algorithm on a graph using the Scala API as shown in the programming guide and other examples. val graph = Graph.fromDataSet(vertices, edges, env).getUndirected val maxIterations = 10 val components =…
Al Jenssen
  • 655
  • 3
  • 9
  • 25
1
2