Mention ENC_UCS_2 and ENC_UTF_16.

svn path=/trunk/; revision=42602
This commit is contained in:
Guy Harris 2012-05-12 20:10:18 +00:00
parent 3896fea6c0
commit 1c7269a6d1
1 changed files with 11 additions and 5 deletions

View File

@ -2377,15 +2377,21 @@ order.
For string fields, the encoding specifies the character set used for the For string fields, the encoding specifies the character set used for the
string and the way individual code points in that character set are string and the way individual code points in that character set are
encoded. For FT_UINT_STRING fields, the byte order of the count must be encoded. For FT_UINT_STRING fields, the byte order of the count must be
specified; when support for UTF-16 encoding is added, the byte order of specified; for UCS-2 and UTF-16, the byte order of the encoding must be
the encoding will also have to be specified. In other cases, ENC_NA specified (for counted UCS-2 and UTF-16 strings, the byte order of the
should be used. The character encodings that are currently count and the 16-bit values in the string must be the same). In other
supported are: cases, ENC_NA should be used. The character encodings that are
currently supported are:
ENC_UTF_8 - UTF-8
ENC_ASCII - ASCII (currently treated as UTF-8; in the future, ENC_ASCII - ASCII (currently treated as UTF-8; in the future,
all bytes with the 8th bit set will be treated as all bytes with the 8th bit set will be treated as
errors) errors)
ENC_UTF_8 - UTF-8
ENC_UCS_2 - UCS-2
ENC_UTF_16 - UTF-16 (currently treated as UCS-2; in the future,
surrogate pairs will be handled, and non-valid 16-bit
code points and surrogate pairs will be treated as
errors)
ENC_EBCDIC - EBCDIC ENC_EBCDIC - EBCDIC
Other encodings will be added in the future. Other encodings will be added in the future.