XJIPC OpenIR  > 多语种信息技术研究室
A compromise Arabic-Kazakh coded character processing method based on the OpenType font format
Dong, J (Dong, Jun); Jiang, TH (Jiang, Tonghai); Cheng, L (Cheng, Li); Anwar, A (Anwar, Azmat); Yang, Y (Yang, Yong)
2018
Source PublicationCOMPUTER STANDARDS & INTERFACES
ISSN0920-5489
Volume55Issue:1Pages:1-7
Abstract

Information systems for Arabic-Kazakh processing must handle the editing and display problems caused by four special vowels: (sic), (sic), (sic) and (sic) The current solution uses combinations of four alternative vowels ((sic), (sic), (sic), and (sic)) with the character (sic) to represent these four special vowels. However, this approach relies on deliberate spelling errors and can cause computer programs to be unable to semantically distinguish the alternative vowels from the original vowels. Moreover, this causes problems in Arabic-Kazakh text-processing applications such as text sorting, script conversion and speech synthesis. We propose a compromise method in which the four special vowels are represented by combinations of themselves with the character (sic) and the related editing and display problems are handled using an OpenType font. The relevant glyph layout features in the OpenType font format are compatible with the proposed compromise method. Results from the sorting and classification of 10,000 randomly selected common Arabic-Kazakh words demonstrate that the new method successfully avoids problems caused by letter replacement, including text sorting errors in 2843 of the tested words and ambiguities with the characters (sic), (sic), (sic), and (sic) in 3960 of the words.

KeywordKazakh Coded Character Unicode Opentype
DOI10.1016/j.csi.2017.02.005
Indexed BySCI
WOS IDWOS:000419411300001
Citation statistics
Document Type期刊论文
Identifierhttp://ir.xjipc.cas.cn/handle/365002/5113
Collection多语种信息技术研究室
Corresponding AuthorJiang, TH (Jiang, Tonghai)
Affiliation1.Chinese Acad Sci, Xinjiang Tech Inst Phys & Chem, Urumqi 830011, Peoples R China
2.Univ Chinese Acad Sci, Beijing 100049, Peoples R China
3.Xinjiang Lab Minor Speech & Language Informat Pro, Urumqi 830011, Peoples R China
4.Xinjiang Normal Univ, Coll Comp Sci, Urumqi 830054, Peoples R China
Recommended Citation
GB/T 7714
Dong, J ,Jiang, TH ,Cheng, L ,et al. A compromise Arabic-Kazakh coded character processing method based on the OpenType font format[J]. COMPUTER STANDARDS & INTERFACES,2018,55(1):1-7.
APA Dong, J ,Jiang, TH ,Cheng, L ,Anwar, A ,&Yang, Y .(2018).A compromise Arabic-Kazakh coded character processing method based on the OpenType font format.COMPUTER STANDARDS & INTERFACES,55(1),1-7.
MLA Dong, J ,et al."A compromise Arabic-Kazakh coded character processing method based on the OpenType font format".COMPUTER STANDARDS & INTERFACES 55.1(2018):1-7.
Files in This Item:
File Name/Size DocType Version Access License
A compromise Arabic-(7528KB)期刊论文作者接受稿开放获取CC BY-NC-SAView Application Full Text
Related Services
Recommend this item
Bookmark
Usage statistics
Export to Endnote
Google Scholar
Similar articles in Google Scholar
[Dong, J (Dong, Jun)]'s Articles
[Jiang, TH (Jiang, Tonghai)]'s Articles
[Cheng, L (Cheng, Li)]'s Articles
Baidu academic
Similar articles in Baidu academic
[Dong, J (Dong, Jun)]'s Articles
[Jiang, TH (Jiang, Tonghai)]'s Articles
[Cheng, L (Cheng, Li)]'s Articles
Bing Scholar
Similar articles in Bing Scholar
[Dong, J (Dong, Jun)]'s Articles
[Jiang, TH (Jiang, Tonghai)]'s Articles
[Cheng, L (Cheng, Li)]'s Articles
Terms of Use
No data!
Social Bookmark/Share
File name: A compromise Arabic-Kazakh coded character processing method based on the OpenType font format.pdf
Format: Adobe PDF
All comments (0)
No comment.
 

Items in the repository are protected by copyright, with all rights reserved, unless otherwise indicated.