GraphX is a component in Apache Spark for graphs and graph-parallel computation
GraphX is a component in Apache Spark for graphs and graph-parallel computation.
At a high level, GraphX extends the Spark RDD by introducing a new Graph abstraction: a directed multigraph with properties attached to each vertex and edge.
To support graph computation, GraphX exposes a set of fundamental operators (e.g., subgraph, joinVertices, and aggregateMessages) as well as an optimized variant of the Pregel API.
In addition, GraphX includes a growing collection of graph algorithms and builders to simplify graph analytics tasks.