I would like to normalize the following JSON:
[
{
"studentId": 1,
"studentName": "James",
"schools": [
{
"schoolId": 1,
"classRooms": [
{
"classRoomId": {
"id": 1,
"floor": 2
}
},
{
"classRoomId": {
"id": 3
}
},
],
"teachers": [
{
"teacherId": 1,
"teacherName": "Tom"
},
{
"teacherId": 2,
"teacherName": "Sarah"
}
]
},
{
"schoolId": 2,
"classRooms": [
{
"classRoomId": {
"id": 4
}
}
],
"teachers": [
{
"teacherId": 1,
"teacherName": "Tom"
},
{
"teacherId": 2,
"teacherName": "Sarah"
},
{
"teacherId": 3,
"teacherName": "Tara"
}
]
}
]
}
]
And I would like to get the following table in Python (tabular form):
studentId studentName schoolId classRoomId.id classRoomId.floor teacherId
teacherName
1 James 1 1 2 1 Tom
1 James 1 1 2 2 Sarah
1 James 1 3 1 Tom
1 James 1 3 2 Sarah
1 James 2 4 1 Tom
1 James 2 4 2 Sarah
1 James 2 4 3 Tara
I've tried to use Pandas json_normalize function like this:
df1 = json_normalize(test1, ["schools","teachers"], ["studentId", "studentName",["schools","teachers"]])
df2 = json_normalize(test1, ["schools","classRooms"], ["studentId", "studentName",["schools","classRooms"]])
df = pd.concat([df1,df2],axis=1)
But that doesn't give me the structure I need.
It doesn't have to be in Pandas, any other library or code in Python would do. Any help is appreciate it. Thank you.