I have a dataframe with a column that has many misspelled words in it.
I would like to simply (a) identify all misspelled words in each cell on the next column and (b) produce a list of all unique misspelled words found (not duplicates).
For example I have,
Column 1
I worked fertl for a long time.
I worked at fhe desk job.
I am seeing a prw of it.
cia and nba are both cool places to work
Desired output:
Column 1 | Column 2 |
---|---|
I worked fertl for a long time. | fertl |
I worked at fhe desk job. | fhe |
I am seeing a prw of it. | prw |
cia and nba are both cool places to work | cia, nba |
and also i want to get a list of all of these like:
{fertl, fhe, prw, cia, nba}