OK so I checked and it doesn't seem someone asked this question.
So I have two words:
thiep cuoi
thiệp cưới
The problem is when I COUNT() these mysql will combine these two as the same. For instance this sql:
#lets assume these two words have an id of 1 and 2 and that the column name
#in the table is `word`
SELECT `word`, COUNT(`word`)
FROM table_name
WHERE `id` IN(1,2)
GROUP BY `word`;
Will return the two words as one row with a count of 2. These are not the same words via UTF-8, how can I bypass this behavior in MySQL? Doesn't MySQL group by UTF-8 and not convert to ASCII? : /