I am stuck on a problem, I am using a very basic RTE to get user input, and trimming the garbage from the string, when that is posted using the functions provided with RTE. I am using http://premiumsoftware.net/cleditor
After user submits the data, I parse it with PHP and remove the unwanted content. Most of the users are Linux / Mac users, and they usually copy content from emails/word documents and paste that in RTE, causing lots of garbage.
We also need to allow all the UTF8 chars from any language.
Saying all this, please check this image
As you can see, in the color notes there is not special char visible, and if I copy this from MYSQL and paste it any where, there will be no garbage. But if I turn the values to HEX you can see, a strange char is there. Highlighted with yellow.
Is there any way to filter these kind of issues. It is causing my PDF generation script to stop working