I have a parquet file as source and I loaded that parquet file using PySpark notebook as shown below:
df_Employee = spark.read.parquet(<filename>)
df_Employee .createOrReplaceTempView("employee_data")
here's how the table looks like
Employee Table:
-Name
-Salary
-Company
-Address (datatype=string)
--street.name
--street.number
--postalcode
-JoiningDate
I have the following code but I'm stuck on how to extract street.name & street.number from the above SQL table, here's what I have
df=spark.sql(f'''
select Name, Salary, Company, json_extract(Address,'$."street.name"') as StreetName
from employee_data
''')
but json_extract(Address,'$."street.name"') as StreetName throws an error. How to extract this nested json field?