character B
can have
HEX code = 0x42
(real letter) or 0x412
(fake letter)
DEC code = 66
(real letter) or 1042
(fake letter)
HTML with named char ref = B
(real letter) or В
(fake letter)
Java string = B
(real letter) or \u0412B
(fake letter)
When I parse content from remote URL with CURL I see on macOS in both variants letter B
. But really it could be not real letter B
. I check it with this online tool is letter real or not.
This code helps me for one letter:
$content = str_replace("В", "B", $content);
But how can I make the same with PHP for all other illegal characters?