Questions tagged [non-unicode]

Unicode is intended to be a universal character set for describing all the characters required for written text incorporating all writing systems, technical symbols and punctuation. But unicode isn't supported on every system, and many other character sets exist.

This tag concerns encoding questions dealing with non-unicode character sets. It can be about conversion from/to unicode, or dealing with special characters on systems not supporting unicode.

Some commons character sets:

  • ASCII. 7-bits. Only non-accented latin characters.
  • ISO-8859-1 (a.k.a. Latin-1). 8-bits. Occidental latin characters.
  • ISO-8859-15 (a.k.a Latin-9). Like ISO-8859-1, but with some additional characters like euro sign.
  • CP-1252. 8-bits. Occidental latin characters used by Windows.
  • CP-850. 8-bits. Occidental latin characters used by DOS.
63 questions
89
votes
13 answers

SSIS Convert Between Unicode and Non-Unicode Error

I have an ssis package where I am using an OLEDB source linking to SQL Server 2005 table. All columns except a date column are NVARCHAR(255). I am using an Excel destination and using a SQL statement to create the sheet in the Excel workbook, the…
Andrew
  • 1,728
  • 8
  • 28
  • 39
64
votes
16 answers

Import Package Error - Cannot Convert between Unicode and Non Unicode String Data Type

I have made a dtsx package on my computer using SQL Server 2008. It imports data from a semicolon delimited csv file into a table where all of the field types are NVARCHAR MAX. It works on my computer, but it needs to run on the clients server. …
Mike Thomas
8
votes
1 answer

libc++ vs VC++: Can non-UTF conversions be done with wstring_convert?

The C++11's std::wstring_convert works great* for the standard UTF-8 <-> UTF-16/UCS2/UCS4 conversions. However, when I attempted to instantiate a wstring_convert or wbuffer_convert with a facet not from , it didn't work as expected: //…
Cubbi
  • 46,567
  • 13
  • 103
  • 169
8
votes
4 answers

How do I fix invalid HTML characters in pages served with different encoding?

I have a number of websites that are rendering invalid characters. The pages' meta tags specify UTF-8 encoding. However, a number of pages contain characters that can't be interpreted by UTF-8, probably because the files were saved with another…
Andy
  • 856
  • 9
  • 26
6
votes
2 answers

What is the default VB6 charset?

we have an application written in Java which reads some text generated by a VB6 application. The problem is: this VB6 application generate this output using some special characters, like ç,ã,á which we don't know in what charset. So the question is:…
Kico Lobo
  • 4,374
  • 4
  • 35
  • 48
5
votes
2 answers

How does Windows identify non-Unicode applications?

I am building an MFC C++ application with "Use Unicode Character Set" selected in Visual Studio. I have UNICODE defined, my CStrings are 16-bit, I handle filenames with Japanese characters in them, etc. But, when I put Unicode strings containing…
Mark Gilbert
  • 73
  • 1
  • 5
4
votes
4 answers

How to fix ANSII character in SQL Server table to UTF-8

I have a data import process to import data from csv file into a table in SQL server. I have noticed that some columns contain some accented characters. For example I have noticed the following text in the database table CAFÉ I open a new file…
akd
  • 6,538
  • 16
  • 70
  • 112
4
votes
1 answer

Given a TTF font file, how do detect if it is an unicode or non-unicode?

I am trying to convert a txt file that is non-unicode to unicode. TTF font files are the inputs along with the txt file, to learn about the source and target. How do i get information from the TTF font files about the conversion that needs to take…
John
  • 693
  • 1
  • 12
  • 37
4
votes
2 answers

Why Non-Unicode apps system locale makes Unicode fonts with symbol charset displayed incorrectly?

I'm trying to display Unicode chars from Wingdings font (it's Unicode TrueType font supporting symbol charset only). It's displayed correctly on my Win7/64 system using corresponding regional OS settings: Formats: Russian Location: Russia System…
Rost
  • 8,779
  • 28
  • 50
3
votes
3 answers

When to use Unicode (aside with non-unicode!)

I haven't found much (concise) info about when exactly to use Unicode. I understand that many say best practice is to always use Unicode. But Unicode strings DO have more memory footprint. Am I correct to say that Unicode must be used only when …
Kharski
  • 43
  • 7
3
votes
1 answer

Map Unicode characters to similar EBCDIC 1047 characters automatically

I'm trying to encode a string that has characters not supported by the target encoding (CP 1047). Is there a standard/common/easy way of mapping those characters to a cp1047 equivalent? For example, the text has a fancy double quote character (”)…
tom
  • 694
  • 4
  • 10
3
votes
2 answers

Converting non-unicode binary strings to byte array in Javascript

I read a binary file using ajax from a web server. The HTTP response is of contentType: 'application/octet-stream' and contains a binary string that is just a string of bytes (not unicode), for example (in hex): 0x00 0x08 0x17 0xA1 0x01 Note: in C…
Kane
  • 549
  • 4
  • 8
3
votes
1 answer

SetThreadLocale for the entire Delphi application

My Delphi (XE 10) program uses nonunicode (single byte) legacy Firebird database (character set NONE) and therefore its data (that are read from or saved into database) depends on the windows Locale for the nonunicode programs. Depending on this…
TomR
  • 2,696
  • 6
  • 34
  • 87
3
votes
2 answers

How to use ORDER BY, LOWER in SQL SERVER 2008 with non-unicode data

The question is about Armenian. I'm using sql server 2005, collation SQL_Latin1_General_CP1_CI_AS, data mostly is in Armenian and we can't use unicode. I tested on ms sql 2008 with a windows collation for armenian language ( Cyrillic_General_100_…
hgulyan
  • 8,099
  • 8
  • 50
  • 75
3
votes
2 answers

Delphi - Get Windows' default non-unicode character set

I have a Delphi 7 application. I need to be able to get the default Windows character set for non-unicode programs. I know DEFAULT_CHARSET sets it, but I need to know exactly which charset it is, so that I could compare it to other character sets.…
jedivader
  • 828
  • 10
  • 23
1
2 3 4 5