How do I remove an interpunct (aka interpoint, middle dot, middot) from a string? I am looking for something like trimws, but trimws doesn't work on the interpunct. Cheers
Asked
Active
Viewed 452 times
-4
-
2Can you give a [minimal, reproducible example](https://stackoverflow.com/questions/5963269/how-to-make-a-great-r-reproducible-example) and the code that is not working for you? – markus Sep 25 '18 at 09:03
1 Answers
1
I believe this is what you're looking for.
string <- c("· interpunct", "interpunct · interpunct", "interpunct · ")
#[1] "· interpunct" "interpunct · interpunct" "interpunct · "
sub("(?:\\s?)+·(?:\\s?)+", "", string)
#[1] "interpunct" "interpunctinterpunct" "interpunct"

Andre Elrico
- 10,956
- 6
- 50
- 69
-
Thanks, that works for what you have written, but unfortunately not for my string. In my string, the interpunct looks like a space (not the symbol you write), but isn't removed with trimws. When I copy and paste the string into word, it gives me the interpunct symbol. The string came from a confidential document, so unfortunately I can't upload it here. Any idea how to identify and remove it? – nm200 Sep 25 '18 at 10:02
-
Change the content of your confidential string to make it non-confidential by keeping the same "structure". Otherwise, we have to play the guessing game. – Andre Elrico Sep 25 '18 at 10:06
-
You probably just want to copy "that strange interpunct" character of yours and replace it with my interpunct character in the regex. – Andre Elrico Sep 25 '18 at 10:10
-
Hi Andre, thanks for the help. When I copy and paste the character in R, it comes out as a white space. That means that I can't paste here an example, and copy and paste into the regex doesn't work either. The only time I have been able to see a difference, is when I copy and paste into word. Then I see the intepunct character. I don't know how I can copy the symbol here so that it appears correctly. – nm200 Sep 25 '18 at 10:22