XJIPC OpenIR  > 多语种信息技术研究室
Co-occurrence degree based word alignment: A case study on Uyghur-Chinese
Mi, Chenggang1; Yang, Yating1; Zhou, Xi1; Li, Xiao1; Osman, Turghun1
2014
发表期刊Lecture Notes in Computer Science
卷号8801期号:1页码:259-268
摘要Most widely used word alignment models are based on word co-occurrence counts in parallel corpus. However, the data sparseness during training of the word alignment model makes word co-occurrence counts of Uyghur-Chinese parallel corpus cannot indicate associations between source and target words effectively. In this paper, we propose a Uyghur-Chinese word alignment method based on word co-occurrence degree to alleviate the data sparseness problem. Our approach combine the co-occurrence counts and the fuzzy co-occurrence weights as word co-occurrence degree, fuzzy co-occurrence weights can be obtained by searching for fuzzy co-occurrence word pairs and computing differences of length between current Uyghur word and other Uyghur words in fuzzy co-occurrence word pairs. Experiment shows that with the co-occurrence degree based word alignment model, the performance of Uyghur-Chinese word alignment result is outperform the baseline word alignment model, the quality of Uyghur-Chinese machine translation also improved.
关键词Uyghur - Chinese Word Alignment Co - Occurrence Degree Co - Occurrence Count Agglutinative Language Da Ta Sparseness
收录类别EI
文献类型期刊论文
条目标识符http://ir.xjipc.cas.cn/handle/365002/4915
专题多语种信息技术研究室
作者单位1.Xinjiang Technical Institute of Physics&Chemistry of Chinese Academy of Sciences Urumqi, Xinjiang, China
2.University of Chinese Academy of Sciences, Beijing, China
推荐引用方式
GB/T 7714
Mi, Chenggang,Yang, Yating,Zhou, Xi,et al. Co-occurrence degree based word alignment: A case study on Uyghur-Chinese[J]. Lecture Notes in Computer Science,2014,8801(1):259-268.
APA Mi, Chenggang,Yang, Yating,Zhou, Xi,Li, Xiao,&Osman, Turghun.(2014).Co-occurrence degree based word alignment: A case study on Uyghur-Chinese.Lecture Notes in Computer Science,8801(1),259-268.
MLA Mi, Chenggang,et al."Co-occurrence degree based word alignment: A case study on Uyghur-Chinese".Lecture Notes in Computer Science 8801.1(2014):259-268.
条目包含的文件
文件名称/大小 文献类型 版本类型 开放类型 使用许可
Co-occurrence degree(444KB)期刊论文作者接受稿开放获取CC BY-NC-SA浏览 请求全文
个性服务
推荐该条目
保存到收藏夹
查看访问统计
导出为Endnote文件
谷歌学术
谷歌学术中相似的文章
[Mi, Chenggang]的文章
[Yang, Yating]的文章
[Zhou, Xi]的文章
百度学术
百度学术中相似的文章
[Mi, Chenggang]的文章
[Yang, Yating]的文章
[Zhou, Xi]的文章
必应学术
必应学术中相似的文章
[Mi, Chenggang]的文章
[Yang, Yating]的文章
[Zhou, Xi]的文章
相关权益政策
暂无数据
收藏/分享
文件名: Co-occurrence degree based word alignment A case study on Uyghur-Chinese.pdf
格式: Adobe PDF
此文件暂不支持浏览
所有评论 (0)
暂无评论
 

除非特别说明,本系统中所有内容都受版权保护,并保留所有权利。