I have a dataset with a column containing strings in multiple languages. I am hoping to remove rows where one column contains a string in any language other than English. I can't seem to find any way to go about this. Does anyone have suggestions for a library or code that might be useful for this purpose?
Asked
Active
Viewed 35 times
0
-
British-English or US-English or SMS-English? – user3435121 Dec 30 '22 at 22:18
-
US-English is what I am hoping the keep – Niamh Dec 30 '22 at 22:19
-
Your answer do not seem to be in US-English. – user3435121 Dec 30 '22 at 22:28
1 Answers
1
This seems like a repeat of this, as the root of this question is how to detect non-English languages rather than how to filter a dataset.

bkleiboeker
- 21
- 3