XJIPC OpenIR  > 多语种信息技术研究室
Alternative TitleRecognition of Chinese Loan Words in Uyghur Based on String Similarity
米成刚; 杨雅婷; 周喜; 李晓; 杨明忠
Source Publication中文信息学报


Other Abstract

There are many Out-Of-Vocabulary words in Uyghur-Chinese machine translation, a large part of them are loan words (including person names, place names, et.al). This paper presents a novel method that recognition the Chinese loan words in Uyghur according to the feature that one loan word pronounce similar with its original word. This method training the existing corpus first, and getting the Uyghur Latin rules that use to recognize Chinese loan word in Uyghur; this paper Latin the Uyghur words according to the rules, Romanization of Chinese words, these transform the sounds similarity to strings similarity which is easy to quantification; proposed three models: Position-related Minimum Edit Distance model, Weighted Common Subsequence model and the fusion model that fused above two with parameters. The experimental results show that the fusion model considering strings' global similarity and local similarity, so it gets the best recognition results.

Keyword借词 未登录词 发音相似度 字符串相似度
Indexed ByCSCD
Citation statistics
Cited Times:1[CSCD]   [CSCD Record]
Document Type期刊论文
Recommended Citation
GB/T 7714
米成刚,杨雅婷,周喜,等. 基于字符串相似度的维吾尔语中汉语借词识别[J]. 中文信息学报,2013,27(5):173-178+190.
APA 米成刚,杨雅婷,周喜,李晓,&杨明忠.(2013).基于字符串相似度的维吾尔语中汉语借词识别.中文信息学报,27(5),173-178+190.
MLA 米成刚,et al."基于字符串相似度的维吾尔语中汉语借词识别".中文信息学报 27.5(2013):173-178+190.
Files in This Item:
File Name/Size DocType Version Access License
基于字符串相似度的维吾尔语中汉语借词识别(715KB)期刊论文作者接受稿开放获取CC BY-NC-SAView Application Full Text
Related Services
Recommend this item
Usage statistics
Export to Endnote
Google Scholar
Similar articles in Google Scholar
[米成刚]'s Articles
[杨雅婷]'s Articles
[周喜]'s Articles
Baidu academic
Similar articles in Baidu academic
[米成刚]'s Articles
[杨雅婷]'s Articles
[周喜]'s Articles
Bing Scholar
Similar articles in Bing Scholar
[米成刚]'s Articles
[杨雅婷]'s Articles
[周喜]'s Articles
Terms of Use
No data!
Social Bookmark/Share
File name: 基于字符串相似度的维吾尔语中汉语借词识别.pdf
Format: Adobe PDF
All comments (0)
No comment.

Items in the repository are protected by copyright, with all rights reserved, unless otherwise indicated.