Input Json Data looks as below.
{
"CarBrands": [
{
"model": "audi",
"make": " (YEAR == \"2009\" AND CONDITION in (\"Y\") AND RESALE in (\"2015\")) ",
"service": {
"first": null,
"second": [],
"third": []
},
"dealerspot": [
{
"dealername": [
"\"first\"",
"\"abc\""
]
},
{
"dealerlat": [
"\"45.00\"",
"\"38.00\""
]
}
],
"type": "ok",
"plate": true
},
{
"model": "bmw",
"make": " (YEAR == \"2010\" AND CONDITION OR (\"N\") AND RESALE in (\"2016\")) ",
"service": {
"first": null,
"second": [],
"third": []
},
"dealerspot": [
{
"dealername": [
"\"sports\"",
"\"abc\""
]
},
{
"dealerlat": [
"\"45.00\"",
"\"38.00\""
]
}
],
"type": "ok",
"plate": true
},
{
"model": "toy",
"make": " (YEAR == \"2013\" AND CONDITION in (\"Y\") AND RESALE in (\"2018\")) ",
"service": {
"first": null,
"second": [],
"third": []
},
"dealerspot": [
{
"dealername": [
"\"nelson\"",
"\"abc\""
]
},
{
"dealerlat": [
"\"45.00\"",
"\"38.00\""
]
}
],
"type": "ok",
"plate": true
}
]
}
I want to process this file in spark using scala and expected output in a dataframe
+----- -+-------+-----------+---------------+--------+-----------------+------------+
| model | year | condition | cond_operator | resale | resale_operator |dealername |
+-------+-------+-----------+---------------+--------+-----------------+------------+
|audi | 2009 | Y | in | 2015 | in | first |
|bmw | 2010 | N | OR | 2016 | in | sports |
|toy | 2013 | Y | in | 2018 | in | nelson |
+------ +-------+-----------+---------------+--------+-----------------+------------+