I extracted data from an API using Airflow. The data is extracted from the API and saved on cloud storage in JSON format.
The next step is to insert the data into an SQL DB. I have a few questions:
- Should I do it on Airflow or using another ETL like AWS Glue/Azure Data factory?
- How to insert the data into the SQL DB? I google "how to insert data into SQL DB using python"?. I found a solution that loops all over JSON records and inserts the data 1 record at a time. It is not very efficient. Any other way I can do it?
- Any other recommendations and best practices on how to insert the JSON data into the SQL server?
I haven't decided on a specific DB so far, so feel free to pick the one you think fits best.
thank you!