I have a city column of 25000 rows with lot of misspelled cities in a data frame. The sample looks like below
Vishakapatnam, a.p
Vishakapatnam URBAN
Vishakapatnam Distt.
Vishakapatnam
Vishakapatnam
vghjfg"
vgfsgsvsw
Vellore
Vellore
VELLORE
VELLORE
New deklhi
New Dehli
new dehli
NEW DEHI
xxxx
zz
a
1234
5644
3
The data contains city with different spelling, numeric, spaces and some random alphabets. I want change the misspelled cities into one name and remove spaces, alphabets with no meaning and numeric. I am trying to do with grep as mentioned in some of the answers here but it is so tedious. Also, I tried with TM package but I could not achieve this. Could some one please share any method which we can do this more efficiently.