I have hundreds of folders of Spanish short stories, with each folder having dozens of individual .MP3 files for each dialogue spoken. Along with the .MP3 files, lies in the folder, a JSON file of the format ["folder1-dialogue-1", "folder1-dialogue-2",...]
.
I want to generate images, each containing the text values of the JSON file, like folder1-dialogue-1
and generate videos combining that image with its corresponding audio (folder1-dialogue-1.mp3)
, and then merge all the videos in the folder into one - folder1.mp4
How do I go about doing this?
P.S I'm not straight up asking for code, but just advice as to how I go about doing this.