The tidytext package provides tools for text mining using tidy data principles in R.
The R tidytext
package, developed by Julia Silge and David Robinson, provides functions and supporting data sets to allow conversion of text to and from tidy formats, and to switch seamlessly between tidy tools and existing text mining packages. When text is in a tidy data structure, tools from the R tidyverse ecosystem like dplyr can be used for effective data handling and analysis.
Repositories
Vignettes
- Introduction to tidytext
- Tidy Term Frequency and Inverse Document Frequency (tf-idf)
- Converting to and from Document-Term Matrices and Corpus objects
- Tidy Topic Modeling