Skip to main content

Software  > Globalization > 

Globalize your On Demand Business

The Common XML Locale Repository is a step towards solving problems such as collation and date/time formatting that may be incorrect or inconsistent between systems.
Introduction

Unicode has provided a foundation for communicating textual data. However, the locale-dependant data used to drive features such as collation and date/time formatting may be incorrect or inconsistent between systems. This may not only present an irritating user experience, but prevent accurate data transfer.

The Common Locale Data Repository (CDLR) is a step towards solving these problems, by providing an interchange format for locale data and developing a repository of such data available. The first version of the CLDR, Version 1.0, was released on January 16, 2004 and is available for use. The data files are currently hosted by IBM (links to references are at the end of this article).

Traditionally, the data associated with locales provides support for formatting and parsing of dates, times, numbers, and currencies; for the default units of currency; for measurement units, for collation (sorting), plus translated names for time zones, languages, countries, and scripts. CLDR supplies locale data for a wide variety of types of information. In the future it should also supply data for text boundaries (character, word, line, and sentence), text transformations (including transliterations), and support for other services.

Examples of platforms with their own locale data are ICU, OpenOffice.org, and POSIX and POSIX-like operating systems such as Linux, Solaris, and AIX.


gray line

Continue to "Common XML Locale Repository Group"

Further reading
Items marked with a PDF icon require Adobe Acrobat Reader.
outside link icon ICU -  International Components for Unicode
outside link icon OpenI18N site
outside link icon Free Standards Group Web site
outside link icon W3C Web Services Scenarios
outside link icon Locale Data Markup Language (This paper refers to version 1.0 of this specification.)
outside link icon ISO-639 codes (names of languages)
outside link icon ISO-3166 code lists English country names and code elements
outside link icon OpenI18N Locale Naming Guide
outside link icon RFC-3066 Tags for the identification of languages (text file)
outside link icon Unicode Character Database (FTP)
outside link icon UTS #10 - Unicode Collation Algorithm
outside link icon XML attribute value normalization

E-mail us
Easy ways to get the answers you need.
E-mail us