I am currently working on a project which a user can select a "layout" (like 3 or 4 different videos in each corner, similar to acapella by picplaypost.), put videos to each corner, edit them individually,merge it with another video, merge the sounds, play them together to see how videos are looking and make a single video out of it. I have found some API's, but they were either with paid license or lacked the requirements. Many of you may say FFmpeg but I think it lacks the "making single video out of it" part. A friend suggested me to use OpenGL to merge the videos but I lack the knowledge and experience to do anything with OpenGL. Any suggestions which is legal(like not reverse engineering the api's that I found)?
Sidenote: as an extra, if I can play a segment of a video in a number of loops, that would be awesome!