Is there any Collator
implementation which has the same characteristics as MySQL's utf8_general_ci? I need a collator which is case insensitive and does not distinguish german umlauts like ä
with the vowel a
.
Background:
We recently encountered a bug which was caused by a wrong collation in our table. The used collation was utf8_general_ci
where utf8_bin
would be the correct one. The particular column had a unique index. The utf8_general_ci
collation does not distinguish between words like pöker
and poker
, so the rows were merged, which was not desired.
I now need a way to implement a module for our Java application, which repairs the wrong rows.