Trouble with a specific character (’) in R

Question

Problem

I have been working on merging and standardizing several survey datasets. One problem that I'm running across is that there is inconsistent use of punctuation. Sometimes, the research is coded with a standard ', and other times is coded with ’.

For example, the names of the Ivory Coast in French is Côte d'Ivoire. Unfortunately, the data are not uniformly coded across time. As a result, when I run a crosstab, I get this:

country         2008      2009
-------         ----      ----
Cote d'Ivoire    498        0
Cote d’Ivoire     0        502

What I want to get is this:

country         2008      2009
-------         ----      ----
Cote d'Ivoire    498       502

When I try to standardize these to use the ' rather than the ’, I have absolutely no luck. It just doesn't seem to do anything. Here is the code I would use:

data$country[data$country == "Cote d’Ivoire"] <- Cote d'Ivoire

For some reason, I can't seem to figure this out, and it's driving me nuts. Does anyone know what I'm doing wrong?

Thank you!

firt what does `sum(data$country == "Cote d’Ivoire")` return? — Onyambu, Jan 31 '18 at 04:21
Well, I think I figure it out! I used `trimws()` to see whether there was perhaps some extra blank space in there, and it seems to have fixed the issue :) — Yasha, Feb 25 '18 at 15:08

Ajay Ohri · Answer 1 · 2018-01-31T04:29:06.743

2

you can replace a value with another value using gsub

data$country=gsub("’","'",data$country)

In case it doesnt work you may need to escape the special character using a double backslash

data$country=gsub("\\’","'",data$country)

See

Remove pattern from string with gsub

edited Jan 31 '18 at 04:29

answered Jan 31 '18 at 04:21

Ajay Ohri

3,382
3
30
60

2

It should be the other way round. He/She is trying to replace with ' – Onyambu Jan 31 '18 at 04:23
Thank you both very much. Unfortunately, this doesn't seem to work. I'll keep plugging away. – Yasha Jan 31 '18 at 04:41
You need to determine if indeed `’` exists. use `grep` and if it does exist then this code provided here might work – Onyambu Jan 31 '18 at 04:43
Dear @Onyambu - Thank you! I will figure out how to use grep. – Yasha Jan 31 '18 at 05:02

Trouble with a specific character (’) in R

Problem

1 Answers1