I need to use a HTML website from R. The website is http://soundoftext.com/. This website allows to download MP3 audio from a word in a given language. I would like to download this file from R by selecting the language and word.
I have experience on parsing XML files but not HTML. So far, I got the tree structure but I don't know how to parse HTML files obtaining the "tags" to POST my values.
require(RCurl)
require(XML)
webpage <- getURL("http://soundoftext.com/")
webpage <- readLines(tc <- textConnection(webpage)); close(tc)
pagetree <- htmlTreeParse(webpage, error=function(...){}, useInternalNodes = TRUE)