Questions tagged [iso-8859-15]

A single byte character set in the ISO/IEC 8859 series of ASCII standard character encodings, including the Euro currency symbol and a few other replacements.

ISO/IEC 8859-15:1999, Information technology — 8-bit single-byte coded graphic character sets — Part 15: Latin alphabet No. 9, is part of the ISO/IEC 8859 series of -based standard s, first edition published in 1999. It is informally referred to as Latin-9 (and was for a while called Latin-0).

It is similar to , and thus generally intended for “Western European” languages, but replaces some less common symbols with the sign and some letters that were now deemed missing in part 1 for the target use.

ISO-8859-15 is the IANA preferred charset name for this standard when supplemented with the C0 and C1 control codes from ISO-6429. Microsoft has assigned code page 28605 aka Windows-28605 to ISO-8859-15.

There were attempts to make ISO-8859-15 the default character set for 8-bit communication, but it was never able to supplant the popular ISO-8859-1. However, it did see some use as a character set for terminal or textual programs under Linux when the Euro sign was needed, but the use of full (Unicode) was not practical. All the printable characters from both ISO/IEC 8859-1 and ISO/IEC 8859-15 are also found in .

Changes from ISO-8859-1

Position  0xA4  0xA6  0xA8  0xB4  0xB8  0xBC  0xBD  0xBE
8859-1       ¤     ¦     ¨     ´     ¸     ¼     ½     ¾
8859-15      €     Š     š     Ž     ž     Œ     œ     Ÿ
26 questions
4
votes
3 answers

Replace accented character with html entity

I'm trying to automate a series of queries but, I need to replace characters with accents with the corresponding html entity. It needs to be in Python3 Example: vèlit [needs to become] vèlit The thing is, whenever I try to do a…
Jordi
  • 101
  • 1
  • 6
4
votes
1 answer

FreeMarker special character output as question mark

I am trying to submit a form with fields containing special characters, such as €ŠšŽžŒœŸ. As far as I can see from the ISO-8859-15 wikipedia page, these characters are included in the standard. Even though the encoding for both request and response…
Alex
  • 312
  • 1
  • 3
  • 9
4
votes
3 answers

How to normalize unicode encoding for iso-8859-15 conversion in python?

I want to convert unicode string into iso-8859-15. These strings include the u"\u2019" (RIGHT SINGLE QUOTATION MARK see http://www.fileformat.info/info/unicode/char/2019/index.htm) character which is not part of the iso-8859-15 characters set. In…
luc
  • 41,928
  • 25
  • 127
  • 172
3
votes
1 answer

Encode Unicode to iso8859-15 with Python

I am using Python 2.7.6 and I am trying to convert a basic unicode string to iso8859-15. I get an error when trying to convert an string with non-ASCII chars. This would be ok, when those chars would not exist is the iso8859-15 coding but in this…
lutz
  • 123
  • 1
  • 10
2
votes
0 answers

Is ISO8859-15 (Latin-9) a full subselect of UTF-8?

I might need to convert an Oracle DB from ISO8859-15 (Latin-9) to UTF-8? So my question is: Is ISO8859-15 (Latin-9) a full subset of UTF-8? Because than there should be a lower risk of things going wrong. Any experience here?
stan
  • 41
  • 5
2
votes
0 answers

ISO-8859-15 encoding with nodejs

We are experiencing a very anoying encoding problem which started with loopback but seems to be nodejs related. Basically, we just finished developping an API with Loopback based upon an existing SQL_ASCII encoded postgresql database. Since the API…
Airzebeth
  • 41
  • 1
  • 6
2
votes
2 answers

antlr 4 iso-8859-15 encoded file matching string containing \u0161 š

I have this grammar: KEY : [a-zA-Z\u0160\u0161\u00C0-\u00FF][a-zA-Z_0-9\-\''\u0160\u0161\u00C0-\u00FF]* ; Reading a ISO-8859-15 encoded text file new ANTLRFileStream(fileName, "ISO-8859-15") with the string Milešovka. Why is š giving a token…
simsulla
  • 23
  • 2
2
votes
1 answer

Using re2c with ISO-8859-x

We have some text in ISO-8859-15 for which we want to tokenize. (ISO-8859-15 is ISO-8859-1 with the Euro sign and other common accented characters, for more details see ISO-8859-15). I am trying to get the parser to recognize all the characters. The…
Yimin Rong
  • 1,890
  • 4
  • 31
  • 48
2
votes
1 answer

Character encoding issue for french characters

In web application,there is a HTML page with below encoding:
Infotechie
  • 1,653
  • 6
  • 23
  • 35
1
vote
1 answer

Convert from UTF-8 to ISO8859-15 in C++

I would like to do a conversion from UTF-8 to ISO 8859-15 in C/C++, without including an additional library. How can I achieve this? I have found the following piece of code that works for ISO 8859-1 but I'm not sure about how to handle the…
Kamchatka
  • 3,597
  • 4
  • 38
  • 69
1
vote
2 answers

Which double quote characters are automatically replaced when converting from UTF-8 to ISO-8859-15?

I have an input file that is UTF-8 encoded. I need to use some of its content and create an ISO-8859-15 encoded CSV file from it. The problem is that UTF-8 seems to have several characters for double quotes that are automatically replaced to the…
Krisztián Balla
  • 19,223
  • 13
  • 68
  • 84
0
votes
0 answers

'–'.encode('utf-8').decode('iso8859_15') , different output in python2.7 and python3.7

I am migrating a software product ,eventually I come to this problem. s = '–' # https://www.fileformat.info/info/unicode/char/0096/index.htm in python2 s.encode('iso8859_15').decode('iso8859_15') # u'-' s.encode('utf-8').decode('iso8859_15') #…
0
votes
0 answers

Vite server encoding chartset problems (iso-8859-15)

When running a Vite server in vanilla js for development or build, charset encoding changes at index.html. For specific reasons, I need the encoding to be iso-8859-15 (I know utf-8 is a more standard one, but I must use iso-8859-15) for write simple…
omiref
  • 1
  • 1
0
votes
0 answers

Is there any NoSQL database that allows for 8-bit single-byte encodings, such as ISO-8859-1?

I'm trying to choose the best NoSQL database service for my data. Most NoSQL databases out there only support UTF-8, and there's no way to enforce an encoding, unlike relational dbs. And the problem is that UTF-8 uses one byte only for the first 127…
0
votes
0 answers

Convert byte[] to ISO-8859-15 or utf-8

I need help to convert a byte array into a format which could read the € symbol. Now, the program I'm working on is reading the file like this : new String(bytes, "ISO-8859-1") I find ISO-8859-15 and utf-8 can make € readable. But I can't find a…
ANAFLY
  • 1
  • 2
1
2