XJIPC OpenIR  > 多语种信息技术研究室
A Phrase Table Filtering Model Based on Binary Classification for Uyghur-Chinese Machine Translation
Chenggang Mi; Yating Yang; Xi Zhou; Lei Wang; Xiao Li; Tursun, E.
2014
Source PublicationJournal of Computers
Volume9Issue:12Pages:2780-2786
Abstract

In statistical machine translation, large amount of unreasonable phrase pairs in a phrase table can affect the decoding efficiency and the overall translation performance, especially in Uyghur-Chinese machine translation. In this paper, we present a novel phrase table filtering model based on binary classification, which consider differences between Uyghur and Chinese, and draw lessons from binary classification in machine learning. In our model, four features are considered: 1) Difference in length between source and target phrase; 2) Proportion of translated words in phrase pairs; 3) Proportion of symbol words; 4) Average number of co-occurrence words in training corpus. We use this model to generate a filtered phrase table. Experimental results show that this new filtering model can improve the performance and efficiency of our current Uygur-Chinese machine translation system.

Document Type期刊论文
Identifierhttp://ir.xjipc.cas.cn/handle/365002/5193
Collection多语种信息技术研究室
AffiliationXinjiang Technical Institute of Physics and Chemistry of Chinese Academy of Sciences,Urumqi 830011, China;University of Chinese Academy of Sciences, Beijing 100049, China
Recommended Citation
GB/T 7714
Chenggang Mi,Yating Yang,Xi Zhou,et al. A Phrase Table Filtering Model Based on Binary Classification for Uyghur-Chinese Machine Translation[J]. Journal of Computers,2014,9(12):2780-2786.
APA Chenggang Mi,Yating Yang,Xi Zhou,Lei Wang,Xiao Li,&Tursun, E..(2014).A Phrase Table Filtering Model Based on Binary Classification for Uyghur-Chinese Machine Translation.Journal of Computers,9(12),2780-2786.
MLA Chenggang Mi,et al."A Phrase Table Filtering Model Based on Binary Classification for Uyghur-Chinese Machine Translation".Journal of Computers 9.12(2014):2780-2786.
Files in This Item:
File Name/Size DocType Version Access License
A Phrase Table Filte(513KB)期刊论文作者接受稿开放获取CC BY-NC-SAView Application Full Text
Related Services
Recommend this item
Bookmark
Usage statistics
Export to Endnote
Google Scholar
Similar articles in Google Scholar
[Chenggang Mi]'s Articles
[Yating Yang]'s Articles
[Xi Zhou]'s Articles
Baidu academic
Similar articles in Baidu academic
[Chenggang Mi]'s Articles
[Yating Yang]'s Articles
[Xi Zhou]'s Articles
Bing Scholar
Similar articles in Bing Scholar
[Chenggang Mi]'s Articles
[Yating Yang]'s Articles
[Xi Zhou]'s Articles
Terms of Use
No data!
Social Bookmark/Share
File name: A Phrase Table Filtering Model Based on Binary Classification for Uyghur-Chinese Machine Translation.pdf
Format: Adobe PDF
All comments (0)
No comment.
 

Items in the repository are protected by copyright, with all rights reserved, unless otherwise indicated.