My website needs to get transcripts (captions) from youtube videos. According to the API documentation this is supposed to work by:
- Listing the available captions by videoId
- Downloading the captions by id
Both these methods are supposed to require authorization with one of these scopes:
https://www.googleapis.com/auth/youtube.force-ssl https://www.googleapis.com/auth/youtubepartner
In reality, the list method works just fine with just a regular youtube api key. No OAuth2 needed. The download method returns a 401 Unauthorized.
This seems super strange, since the captions are publicly available, you can use scrapers to get them from video pages (not a viable solution for us), so why can't I just get that info like any other video data? And why can I use the 'list' method freely, and not the download, even though the docs say both require authorization?
Three years ago someone from google answered a similar question promising this feature should be available. There are many other older questions about this topic, all with inconclusive answers, or workarounds. On the support page for the api they say to ask here with the appropriate tags, so here I am, hoping for an answer.