I need to understand How can I remove duplicate rows from a Data-frame on the basis of single in Spark SQL using Java.
Like in normal SQL, ROW_NUMBER () OVER (PARTITION BY col ORDER BY Col DESC). How Can I translate this step into Spark SQL in Java?