Why data frame not throwing RunTimeException with "FAILFAST" option in spark while reading using com.crealytics.spark.excel?

Question

schema = <Schema of excel file>
df = spark.read.format("com.crealytics.spark.excel").\
 option("useHeader", "true").\  
 option("mode", "FAILFAST"). \
 schema(schema).\
 option("dataAddress", "Sheet1"). \
 load("C:\\Users\\ABC\\Downloads\\Input.xlsx")
 
 df.show()

Above pyspark read excel dataframe snippet is not failing/throwing runtime exception while reading (calling action using show() ) from incorrect/corrupt data. However option("mode", "FAILFAST") is working fine for CSV but when I am using com.crealytics.spark.excel jar I am facing issue i.e. its not failing code and giving results by substracting incorrect/corrupt data.

Does anyone encountered same issue ?

Thanks in advance!

score 0 · Answer 1 · answered Dec 29 '21 at 13:44

0

Based on following documentation, no where mentioned mode is supported.

https://github.com/crealytics/spark-excel

answered Dec 29 '21 at 13:44

Ranga Reddy

2,936
4
29
41

Why data frame not throwing RunTimeException with "FAILFAST" option in spark while reading using com.crealytics.spark.excel?

1 Answers1