How to create and use a lookup table

Question

I have a character vector of misspelled words:

wordswrong <- c("veh", "crrts", "ornges")
wordscorrect <- c("vehicle", "carrots", "oranges")

Here's a dataframe:

words <- data.frame(terms = c("crrts oranges",
+                               "car is a veh", 
+                               "orngs bannas peas"))

How can I go through each word in words$terms and update based on my two vectors?

Try `for(i in seq_along(wordswrong)) words$terms <- gsub(wordswrong[i], wordscorrect[i], words$terms)` or `library(qdap); words$terms <- mgsub(wordswrong, wordscorrect, words$terms)` — akrun, Jun 30 '17 at 07:02
Thanks @akrun! I'm sure I have a memory of once seeing code where someone used a lookup table along the lines of df$wrongwords <- lut (lookuptable). Does this sound familiar? Or maybe it's the wrong context for a list? Or perhaps since each cell is not an exact lookup I cannot go this route — Doug Fir, Jun 30 '17 at 07:04

score 1 · Accepted Answer · answered Jun 30 '17 at 07:06

1

We can use mgsub from qdap

library(qdap)
words$terms <- mgsub(wordswrong, wordscorrect, words$terms)

answered Jun 30 '17 at 07:06

akrun

1 Answers1