XJIPC OpenIR  > 多语种信息技术研究室
A Phrase Table Filtering Model Based on Binary Classification for Uyghur-Chinese Machine Translation
Chenggang Mi; Yating Yang; Xi Zhou; Lei Wang; Xiao Li; Tursun, E.
2014
发表期刊Journal of Computers
卷号9期号:12页码:2780-2786
摘要

In statistical machine translation, large amount of unreasonable phrase pairs in a phrase table can affect the decoding efficiency and the overall translation performance, especially in Uyghur-Chinese machine translation. In this paper, we present a novel phrase table filtering model based on binary classification, which consider differences between Uyghur and Chinese, and draw lessons from binary classification in machine learning. In our model, four features are considered: 1) Difference in length between source and target phrase; 2) Proportion of translated words in phrase pairs; 3) Proportion of symbol words; 4) Average number of co-occurrence words in training corpus. We use this model to generate a filtered phrase table. Experimental results show that this new filtering model can improve the performance and efficiency of our current Uygur-Chinese machine translation system.

文献类型期刊论文
条目标识符http://ir.xjipc.cas.cn/handle/365002/5193
专题多语种信息技术研究室
作者单位Xinjiang Technical Institute of Physics and Chemistry of Chinese Academy of Sciences,Urumqi 830011, China;University of Chinese Academy of Sciences, Beijing 100049, China
推荐引用方式
GB/T 7714
Chenggang Mi,Yating Yang,Xi Zhou,et al. A Phrase Table Filtering Model Based on Binary Classification for Uyghur-Chinese Machine Translation[J]. Journal of Computers,2014,9(12):2780-2786.
APA Chenggang Mi,Yating Yang,Xi Zhou,Lei Wang,Xiao Li,&Tursun, E..(2014).A Phrase Table Filtering Model Based on Binary Classification for Uyghur-Chinese Machine Translation.Journal of Computers,9(12),2780-2786.
MLA Chenggang Mi,et al."A Phrase Table Filtering Model Based on Binary Classification for Uyghur-Chinese Machine Translation".Journal of Computers 9.12(2014):2780-2786.
条目包含的文件 下载所有文件
文件名称/大小 文献类型 版本类型 开放类型 使用许可
A Phrase Table Filte(513KB)期刊论文作者接受稿开放获取CC BY-NC-SA浏览 下载
个性服务
推荐该条目
保存到收藏夹
查看访问统计
导出为Endnote文件
谷歌学术
谷歌学术中相似的文章
[Chenggang Mi]的文章
[Yating Yang]的文章
[Xi Zhou]的文章
百度学术
百度学术中相似的文章
[Chenggang Mi]的文章
[Yating Yang]的文章
[Xi Zhou]的文章
必应学术
必应学术中相似的文章
[Chenggang Mi]的文章
[Yating Yang]的文章
[Xi Zhou]的文章
相关权益政策
暂无数据
收藏/分享
文件名: A Phrase Table Filtering Model Based on Binary Classification for Uyghur-Chinese Machine Translation.pdf
格式: Adobe PDF
所有评论 (0)
暂无评论
 

除非特别说明,本系统中所有内容都受版权保护,并保留所有权利。