I found python script (2018) on github for extracting YouTube transcripts.
I fixed line 37 (deprecated) from:
driver = webdriver.Firefox(firefox_options=options)
to
driver = webdriver.Firefox(options=options)
I have a file named url.csv
It has a header row 'url'
There is one url on line 2 of the csv for testing
Lines 2 & 3 of captions.py have been modified from:
filename = 'videolist_zembla_273_2018_05_25-09_17_02.tab'
colname = 'videoId'
To:
filename = 'url.csv'
colname = 'url'
All files are in a folder named 'subtitles' with geckodriver.exe
The script runs until the 3rd exception 'could not find transcript in options menu' then fails.
I have tried different urls with no success, and suspect it may be a timeout issue, though I really have no clue what I'm doing or how to fix it.
Can anyone help me troubleshoot this further, I'm stumped at this point.
Any help appreciated.