How to get Ruby 1.9 regexp supports \p{Nonspacing_Mark}?

Question

Isn't the diacritical mark above "a" should be removed by the Regex?

 "hǎo".gsub(/\p{Nonspacing_Mark}/, '')
 => "hǎo" 

 "hǎo".gsub(/\p{Mn}/, '')
 => "hǎo"

Update:

I kind of get it from how it works in Java.

Normalizer.normalize("hǎo", Form.NFD).replaceAll("\\p{Mn}+", "")

I need to normalizer it first to split the "ǎ" into "a" and the diacritical mark.

take a look at this http://stackoverflow.com/questions/3571480/converting-chinese-to-pinyin — AabinGunz, Apr 19 '11 at 11:29
Are you wanting to this wickedness because you don’t know how to compare two strings in an “accent-insensitive” fashion? — tchrist, Apr 23 '11 at 22:42

score 0 · Accepted Answer · edited May 23 '17 at 11:47

0

puts UnicodeUtils.nfkd("ﻺ (hǎo)").gsub(/[\p{Nonspacing_Mark}]/, '')

edited May 23 '17 at 11:47

Community

answered Apr 19 '11 at 15:15

Cheng

1 Answers1