Questions tagged [codepages]

Code page is another term for character encoding. It consists of a table of values that describes the character set for a particular language.

Code page is another term for , popular with some vendors (IBM, Microsoft, Oracle, etc). It consists of a table of values that describes the character set for a particular language.

See also http://en.wikipedia.org/wiki/Code_page

458 questions
273
votes
10 answers

What is ANSI format?

What is ANSI encoding format? Is it a system default format? In what way does it differ from ASCII?
web dunia
  • 9,381
  • 18
  • 52
  • 64
80
votes
4 answers

How do you properly use WideCharToMultiByte

I've read the documentation on WideCharToMultiByte, but I'm stuck on this parameter: lpMultiByteStr [out] Pointer to a buffer that receives the converted string. I'm not quite sure how to properly initialize the variable and feed it into the…
Obediah Stane
  • 15,471
  • 14
  • 39
  • 28
66
votes
7 answers

Text was truncated or one or more characters had no match in the target code page When importing from Excel file

I have an excel file with four text columns: one of them is called ShortDescription which has the longest value. I created a table in SQL Server 2008 database, with four columns and the ShortDescription column type is set to NvarChar(Max). but when…
Nour
  • 5,252
  • 3
  • 41
  • 66
63
votes
4 answers

Is codepage 65001 and utf-8 the same thing?

<%@LANGUAGE="VBSCRIPT" CODEPAGE="65001"%>
Mask
  • 33,129
  • 48
  • 101
  • 125
62
votes
11 answers

How do I correct the character encoding of a file?

I have an ANSI encoded text file that should not have been encoded as ANSI as there were accented characters that ANSI does not support. I would rather work with UTF-8. Can the data be decoded correctly or is it lost in transcoding? What tools could…
Liam
  • 19,819
  • 24
  • 83
  • 123
42
votes
5 answers

What's the difference between an "encoding," a "character set," and a "code page"?

I'm really trying to get better with this stuff. I'm pretty functional with internationalization concepts like this, but I need to get a better background on the theory behind it. I've read Spolsky's article, but I'm still unclear because these…
Deane
  • 8,269
  • 12
  • 58
  • 108
30
votes
3 answers

The encoding that Notepad++ just calls "ANSI", does anyone know what to call it for Ruby?

I have a bunch of .txt's that Notepad++ says (in its drop-down "Encoding" menu) are "ANSI". They have German characters in them, [äöüß], which display fine in Notepad++. But they don't show up right in irb when I File.read 'this is a German text…
Owen_AR
  • 2,867
  • 5
  • 20
  • 23
26
votes
7 answers

Running BAT/CMD file with accented characters in it

I have a Windows batch file which has an instruction to execute an EXE file in a location whose path contains accented characters. Following are the contents of the batch file. @echo off C:\español\jre\bin\java.exe…
Kryptic Coder
  • 612
  • 2
  • 8
  • 20
23
votes
2 answers

What characters do not directly map from Cp1252 to UTF-8?

I've read in several stackoverflow answers that some characters do not directly map (or are even "unmappable") when converting from Cp1252 (aka Windows-1252; they're the same, aren't they?) to UTF-8, e.g. here:…
Christian
  • 6,070
  • 11
  • 53
  • 103
21
votes
3 answers

Can isdigit legitimately be locale dependent in C

In the section covering setlocale, the ANSI C standard states in a footnote that the only ctype.h functions whose behaviour is not affected by the current locale are isdigit and isxdigit. The Microsoft implementation of isdigit is locale dependent…
cdev
  • 787
  • 5
  • 10
19
votes
6 answers

Change CodePage in CMD permanently?

My Windows cmd CodePage is now under 65001. Since I was doing some Android stuff and need to use console log, but forget how changing it now. When using chcp 437 to change the CodePage back, it works. But if I start a new cmd window, it goes…
Lyn
  • 699
  • 1
  • 7
  • 17
19
votes
4 answers

Why isn't UTF-8 allowed as the "ANSI" code page?

The Windows _setmbcp function allows any valid code page... (except UTF-7 and UTF-8, which are not supported) OK, not supporting UTF-7 makes sense: Characters have non-unique representations and that introduces complexity and security risks. But…
dan04
  • 87,747
  • 23
  • 163
  • 198
18
votes
4 answers

Can .NET convert Unicode to ASCII to remove "smart quotes", etc?

Some of our users use e-mail clients that can't cope with Unicode, even when the encoding, etc. are properly set in the mail headers. I'd like to 'normalise' the content they're receiving. The biggest problem we have is users copy'n'pasting content…
Dylan Beattie
  • 53,688
  • 35
  • 128
  • 197
16
votes
2 answers

Unicode filenames on FAT-32?

As far as I understand - NTFS supports Unicode filenames (UTF-16 as Micorsoft claims?). But official MSDN documentation is very vague regarding what codepage(s) is used to store filenames (filepaths) on FAT-32. Here it says that OEM code page (CP437…
jake.libber
  • 163
  • 1
  • 1
  • 5
16
votes
7 answers

psql shell uses code page 850, windows uses 1252. How to solve change console code page?

I have installed PostgreSQL, wanting to set up a new database, everything is going fine, until I am trying to use the € sign. And then I knew, that warning I get in the beginning is for a reason. The warning I am getting at starting up my psql shell…
mtjiran
  • 292
  • 1
  • 2
  • 12
1
2 3
30 31