 |
|
|
 | Using ICU
IBM International Components for Unicode (ICU) is a set of versatile components and mature Open-Source implementations of various Globalization operations. ICU is widely used and is regarded as the de-facto standard for globalization and Unicode support in the software industry.One of the services offered by ICU is the conversion service, which converts data between many non-Unicode encodings or code-pages and Unicode.
If your source encoding is listed in ICU Converter Explorer, or a mapping table is available in ICU’s character-set repository, then they can be readily used.Otherwise, you must create the character map to use it. Following is the procedure, in accordance with the steps above, which you should use to write converters using ICU:
-
-
Integrate the ‘.ucm’ files ICU and rebuild.
-
Migrating data from some non-Unicode encodings to Unicode may not be as straightforward as described above. You may need to perform some pre-processing and post-processing of data in conjunction with transliteration to achieve data migration. |
|



Continue to Conclusion
|
|
|
|
|  | |