XJIPC OpenIR  > 多语种信息技术研究室
维汉机器翻译未登录词识别研究
米成刚; 王磊; 杨雅婷; 陈科海
2013
Source Publication计算机应用研究
ISSN1001-3695
Volume30Issue:4Pages:239-241
Abstract

针对维汉统计机器翻译中未登录词较多的现象和维吾尔语语言资源匮乏这一现状,结合维吾尔语构词特征以及相应的字符串相似度算法,提出了一种基于字符串相似度的维汉机器翻译未登录词识别模型。该模型借助短语表和外部词典,与未翻译的维语词求相似度,取相似度最大短语对应的汉语翻译作为此未登录词的最终翻译。实验证明,与基于词干切分的未登录词识别方法相比,此模型较好地保留了维吾尔语词信息,提高了译文的质量。

Other Abstract

Aimed at the phenomenon that there are so many out-of-vocabulary words in Uyghur-Chinese machine translation and the situation that the Uyghur language resources are very scarce,combined the features of Uyghur and string similarity algorithms, the paper presented an out-of-vocabulary word recognition model of Uyghur-Chinese machine translation which based on string similarity algorithms. With the help of phrase based model’s phrase table,and the external dictionary,the model computed the maximum strings similarity between the out-of-vocabulary word and the Uyghur words’in phrase table and dictionary, got the translation corresponding to the Uyghur word. The experiments show that compared with the out-of-vocabulary words recognition method which based on word segmentation,this model is better retaining the words’information,and also improves the quality of the translation.

Keyword维汉机器翻译 短语表 字符串相似度算法 未登录词 词切分 编辑距离
Indexed ByCSCD
CSCD IDCSCD:4802617
Citation statistics
Cited Times:5[CSCD]   [CSCD Record]
Document Type期刊论文
Identifierhttp://ir.xjipc.cas.cn/handle/365002/2449
Collection多语种信息技术研究室
Affiliation中国科学院新疆理化技术研究所;中国科学院大学
Recommended Citation
GB/T 7714
米成刚,王磊,杨雅婷,等. 维汉机器翻译未登录词识别研究[J]. 计算机应用研究,2013,30(4):239-241.
APA 米成刚,王磊,杨雅婷,&陈科海.(2013).维汉机器翻译未登录词识别研究.计算机应用研究,30(4),239-241.
MLA 米成刚,et al."维汉机器翻译未登录词识别研究".计算机应用研究 30.4(2013):239-241.
Files in This Item:
File Name/Size DocType Version Access License
维汉机器翻译未登录词识别研究.pdf(766KB)期刊论文作者接受稿开放获取CC BY-NC-SAView Application Full Text
Related Services
Recommend this item
Bookmark
Usage statistics
Export to Endnote
Google Scholar
Similar articles in Google Scholar
[米成刚]'s Articles
[王磊]'s Articles
[杨雅婷]'s Articles
Baidu academic
Similar articles in Baidu academic
[米成刚]'s Articles
[王磊]'s Articles
[杨雅婷]'s Articles
Bing Scholar
Similar articles in Bing Scholar
[米成刚]'s Articles
[王磊]'s Articles
[杨雅婷]'s Articles
Terms of Use
No data!
Social Bookmark/Share
File name: 维汉机器翻译未登录词识别研究.pdf
Format: Adobe PDF
All comments (0)
No comment.
 

Items in the repository are protected by copyright, with all rights reserved, unless otherwise indicated.