9780201616330: The Unicode Standard, Version 3.0

Synopsis

Detailed specifications for Unicode: structure, conformance encoding forms, character properties, semantics, equivalence, combining characters, logical ordering, conversion, allocation, big/little endian usage, Korean syllable formation, control characters, case mappings, numeric values, mathematical properties, writing directions (Arabic, Japanese, English, and so on), character shaping (Arabic, Devanagari, Tamil, and so on).
Expanded implementation guidelines by experts in global software design: normalization, sorting and searching, case mapping, compression, language tagging, boundaries (characters, words, lines, and sentences), rendering of non-spacing marks, transcoding to other character sets, handling unknown characters, surrogate pairs, numbers, editing and selection, keyboard input, and more.

"synopsis" may belong to another edition of this title.

About the Author

The Unicode Consortium is a non-profit organization founded to develop, extend, and promote the use of the Unicode Standard. The membership of the Consortium represents a broad spectrum of corporations and organizations in the computer and information processing industry. The Unicode Consortium actively cooperates with many of the leading standards development organizations, including ISO/IEC JTC1, W3C, IETF, and ECMA.



0201616335AB07232003

From the Back Cover

Unicode

  • Characters for all the languages of the world
  • The standard for the new millennium
  • Required for XML and the Internet
  • The basis for modern software standards and products
  • The official way to implement ISO/IEC 10646
  • The key to global interoperability
The Unicode Standard, Version 3.0

The authoritative, technical guide to the creation of software for worldwide use.

Detailed specifications for Unicode:

  • Structure, conformance, encoding forms, character properties, semantics, equivalence, combining characters, logical ordering, conversion, allocation, big/little endian usage, Korean syllable formation, control characters, case mappings, numeric values, mathematical properties, writing directions (Arabic, Japanese, English, and so on), character shaping (Arabic, Devanagari, Tamil, and so on)

Expanded implementation guidelines by experts in global software design:

  • Normalization, sorting and searching, case mapping, compression, language tagging, boundaries (characters, word, lines, and sentences), rendering of non-spacing marks, transcoding to other character sets, handling unknown characters, surrogate pairs, numbers, editing and selection, keyboard input, and more

Comprehensive charts, references, glossary, and indexes:

  • Codes, names, appearances, aliases, cross-references, equivalences, radical-stroke ideographic index, Shift-JIS index, and more

CD-ROM

The comprehensive Unicode Character Database for:

  • Character codes, names, properties, decompositions, upper- ,lower-, and title cases, normalizations, shaping

International, national, and vendor character mappings for:

  • Western European, Japanese, Chinese, Korean, Greek, Russian, and others
  • Windows, Macintosh, Unix, and Linux

Unicode Technical Reportsthat extend the standard for:

  • Sorting, displaying, normalizing, linebreaking, compression, serialization, regular expressions, CR/LF, XML, case mappings, and more


0201616335B04062001

Excerpt. © Reprinted by permission. All rights reserved.

Preface This book, The Unicode Standard, Version 3.0, is the authoritative source of information on the Unicode character encoding standard, the international character code for information processing that includes all major scripts of the world and is the foundation for development of software for worldwide use. As well as encoding characters used for written communication in a simple and consistent manner, the Unicode Standard defines character properties and algorithms for use in implementations.

Version 3.0 expands on material from Versions 2.0 and 2.1 and supersedes all other previous versions. The previous versions of the Unicode Standard are:

The Unicode Standard, Version 1.0, Volume 1 (1991) The Unicode Standard, Version 1.0, Volume 2 (1992) The Unicode Standard, Version 1.1, Unicode Technical Report #4 (1993) The Unicode Standard, Version 2.0 (1996) The Unicode Standard, Version 2.1, Unicode Technical Report #8 (1998)

Major additions to Version 3.0 include:

conformance rules for transformation formats new scripts including Ethiopic, Khmer, Mongolian, Myanmar, and Sinhala restructured and enhanced character block descriptions clarified bidirectional algorithm updated implementation guidelines a Shift-JIS index

The Unicode Standard maintains consistency with the international standard ISO/IEC 10646. Version 3.0 of the Unicode Standard corresponds to ISO/IEC 10646-1:2000.

"About this title" may belong to another edition of this title.