Unicode is awesome. There aren't too many people who disagree with this.
Apart from Python 3 (which did it wrong), what would be the negative impact (if any) of the next major version of all programming languages defaulting to using Unicode/UTF-8 strings?
I'm talking specifically about the many cases which require workarounds to get UTF-8. For example, running a Java program:
java ... -Dfile.encoding=UTF-8
Or working with strings in Python 2:
# -*- coding: utf8 -*-
...
unicode_string = u"This is Unicode Text"
Certain MySQL databases default to a different character encoding by default:
[server]
collation_server=utf8_unicode_ci
character_set_server=utf8
etc. etc.
Why don't we all just default to using Unicode/UTF-8 and allow users to use the workarounds if they need support for other character encodings? What would be the problems with doing this?