R to read xlsx to construct corpus

Asked Mar 27 '18 at 14:33

Active Mar 27 '18 at 14:44

Viewed 169 times

I intend to get Term-Document Matrix from hundreds of documents in .pdf and .xlsx. I could construct corpus with following command to read lots of pdf files.

bb <- Corpus(DirSource(aa),readerControl=list(reader=readPDF))

And searched and failed to find how to read .xlsx to construct corpus in R. I installed package and set library for 'xlsx' and 'rJava'. Is there any way to read hundreds of xlsx files similarly with reading pdf files. (There are so many files, I cannot convert them to .csv one by one)

edited Mar 27 '18 at 14:44

digEmAll

56,430
9
115
140

asked Mar 27 '18 at 14:33

Byung Yun Son

Please, when non-base functions are used, indicate the imported package(s) (e.g. `Corpus` belongs to tm package, right ?) – digEmAll Mar 27 '18 at 15:01
Sorry about that. you are right. Corpus is under tm package – Byung Yun Son Mar 28 '18 at 13:16

R to read xlsx to construct corpus

0 Answers0