XJIPC OpenIR  > 多语种信息技术研究室
A compromise Arabic-Kazakh coded character processing method based on the OpenType font format
Dong, J (Dong, Jun); Jiang, TH (Jiang, Tonghai); Cheng, L (Cheng, Li); Anwar, A (Anwar, Azmat); Yang, Y (Yang, Yong); Jiang, TH
2018
发表期刊COMPUTER STANDARDS & INTERFACES
卷号55期号:1页码:1-7
摘要

Information systems for Arabic-Kazakh processing must handle the editing and display problems caused by four special vowels: (sic), (sic), (sic) and (sic) The current solution uses combinations of four alternative vowels ((sic), (sic), (sic), and (sic)) with the character (sic) to represent these four special vowels. However, this approach relies on deliberate spelling errors and can cause computer programs to be unable to semantically distinguish the alternative vowels from the original vowels. Moreover, this causes problems in Arabic-Kazakh text-processing applications such as text sorting, script conversion and speech synthesis. We propose a compromise method in which the four special vowels are represented by combinations of themselves with the character (sic) and the related editing and display problems are handled using an OpenType font. The relevant glyph layout features in the OpenType font format are compatible with the proposed compromise method. Results from the sorting and classification of 10,000 randomly selected common Arabic-Kazakh words demonstrate that the new method successfully avoids problems caused by letter replacement, including text sorting errors in 2843 of the tested words and ambiguities with the characters (sic), (sic), (sic), and (sic) in 3960 of the words.

关键词Kazakh Coded Character Unicode Opentype
DOI10.1016/j.csi.2017.02.005
收录类别SCI
WOS记录号WOS:000419411300001
引用统计
文献类型期刊论文
条目标识符http://ir.xjipc.cas.cn/handle/365002/5113
专题多语种信息技术研究室
通讯作者Jiang, TH
作者单位1.Chinese Acad Sci, Xinjiang Tech Inst Phys & Chem, Urumqi 830011, Peoples R China
2.Univ Chinese Acad Sci, Beijing 100049, Peoples R China
3.Xinjiang Lab Minor Speech & Language Informat Pro, Urumqi 830011, Peoples R China
4.Xinjiang Normal Univ, Coll Comp Sci, Urumqi 830054, Peoples R China
推荐引用方式
GB/T 7714
Dong, J ,Jiang, TH ,Cheng, L ,et al. A compromise Arabic-Kazakh coded character processing method based on the OpenType font format[J]. COMPUTER STANDARDS & INTERFACES,2018,55(1):1-7.
APA Dong, J ,Jiang, TH ,Cheng, L ,Anwar, A ,Yang, Y ,&Jiang, TH.(2018).A compromise Arabic-Kazakh coded character processing method based on the OpenType font format.COMPUTER STANDARDS & INTERFACES,55(1),1-7.
MLA Dong, J ,et al."A compromise Arabic-Kazakh coded character processing method based on the OpenType font format".COMPUTER STANDARDS & INTERFACES 55.1(2018):1-7.
条目包含的文件
文件名称/大小 文献类型 版本类型 开放类型 使用许可
A compromise Arabic-(7528KB)期刊论文作者接受稿开放获取CC BY-NC-SA浏览 请求全文
个性服务
推荐该条目
保存到收藏夹
查看访问统计
导出为Endnote文件
谷歌学术
谷歌学术中相似的文章
[Dong, J (Dong, Jun)]的文章
[Jiang, TH (Jiang, Tonghai)]的文章
[Cheng, L (Cheng, Li)]的文章
百度学术
百度学术中相似的文章
[Dong, J (Dong, Jun)]的文章
[Jiang, TH (Jiang, Tonghai)]的文章
[Cheng, L (Cheng, Li)]的文章
必应学术
必应学术中相似的文章
[Dong, J (Dong, Jun)]的文章
[Jiang, TH (Jiang, Tonghai)]的文章
[Cheng, L (Cheng, Li)]的文章
相关权益政策
暂无数据
收藏/分享
文件名: A compromise Arabic-Kazakh coded character processing method based on the OpenType font format.pdf
格式: Adobe PDF
此文件暂不支持浏览
所有评论 (0)
暂无评论
 

除非特别说明,本系统中所有内容都受版权保护,并保留所有权利。