Being unsure whether a huge dataframe contains Chinese or Arabic characters, I would like to find out (a) whether there are indeed such values in a given column, and (b) if so, I would like to subset the respective rows.
Would that be possible in R? If so, how?
Here is an exemplrary dataframe:
> DF <- data.frame(Var = c("Test1", "Another test", "Oranges"), Names = c("汉字", "Lioba", "الْأَبْجَدِيَّة الْعَرَبِيَّة"))
> dput(DF)
structure(list(Var = c("Test1", "Another test", "Oranges"), Names = c("<U+6C49><U+5B57>",
"Lioba", "<U+0627><U+0644><U+0652><U+0623><U+064E><U+0628><U+0652><U+062C><U+064E><U+062F><U+0650><U+064A><U+064E><U+0651><U+0629> <U+0627><U+0644><U+0652><U+0639><U+064E><U+0631><U+064E><U+0628><U+0650><U+064A><U+064E><U+0651><U+0629>"
)), class = "data.frame", row.names = c(NA, -3L))