Spark uses Murmur3 algorithm to compute the hash value. I tried using this Python-based mmh3 package to produce a hash but it produces a different hash value from Spark.
I've read lots of relavent questions about Spark's Hash algorithm but I still don't know how to get same hash value in pure Python.