I convert Word Documents to html for display. I use DocTo to convert from .doc to .html.
If I open the resulting html file in a browser it is displayed correctly without any ? or other unicode problems.
However for my website I have to retrieve the HTML file via
file_get_contents('file/url to my file.html');
The resulting html when displayed has ? and [] squares in place of some characters in the document.
Is there a simple workaround I'm missing?
I know I could link directly to the html file, however this does not suit my needs as I need to output the content inside a div.