6

I am new to graph world. I have been assigned to work on graph processing. Now I know Apache Spark, so thought of using it Graphx to process large graph. Then I came across Gephi provides nice GUI to manipulate graphs.

Does Graphx have such tools or it is mainly parallel graph processing library. Can I import json graph data came from Gephi into graphx?

halfer
  • 19,824
  • 17
  • 99
  • 186
Umesh K
  • 13,436
  • 25
  • 87
  • 129

2 Answers2

1

Adding to that you can as well try Graphlab https://dato.com/products/create/open_source.html

It directly support Spark RDD https://dato.com/learn/userguide/data_formats_and_sources/spark_integration.html

Not much work required after that

from pyspark import SparkContext
import graphlab as gl

sc = SparkContext('yarn-client')

t = sc.textFile("hdfs://some/large/file")
sf = gl.SFrame.from_rdd(t)

# do stuff...

out_rdd = sf.to_rdd(sc)
Abhishek Choudhary
  • 8,255
  • 19
  • 69
  • 128
0

No, Apache Spark Graphx have no visualization, it's just a processing engine but you can import data from gephi to graphx using Gephi's API.

Kshitij Kulshrestha
  • 2,032
  • 1
  • 20
  • 27