XJIPC OpenIR  > 多语种信息技术研究室
Thesis Advisor李晓
Degree Grantor中国科学院大学
Place of Conferral北京
Degree Discipline计算机应用
Keyword语音识别 关键词识别 音频切割 语音单词切分 声学特征 语音语料库 关键词检索
Other Abstract






Broadcast news continuous speech sensitive-word spotting is a hot question.The data in broadcast new include a series of acoustical characteristic such as speaker styles, dialect and accent, channeel variety, and acoustical envimnments. Broadcasts are a perfectly research object for study practically about speech technology. Aim at several key problems of broadcast news sensitive-word spotting; I present the recent progress on improving the performance for Uyghur broadcast news continues speech Sensitive-word spotting. Firstly, acoustic analys for Uyhgur phoneme, there are 32 phonemes, presented. In order to get phoneme level information, a novel voiced speech, unvoiced speech classification is proposed. The core of the approach presented here is based on a multivariate Gaussian distribution, five parameters that can be extracted by short-time analysis methods namely: the short-time log energy of the signal, the short-time zero-crossing rate of the signal per 10 msec interval, the short-time autocorrelation coefficient at unit sample delay, the first predictor coefficient of a pth-order linear predictor and the normalized energy of the prediction error of a pth-order linear predictor. Secondly, two novel algorithms for Uyghur sensitive word segmentation, one is mono-spaced word segmentation and another one is based on Bayesian approach, are proposed and implemented. And then the comparative analyses gives of the two approaches of word segmentation and Sensitive-word spotting results apply on the same condition .In order to get phoneme level information, a novel voiced speech, unvoiced speech classification is proposed. Tirdly, we create a small sentive word speech corpus based on Uyghur broadcast news continues speech. And then, uses Matlab, design Uyghur browdcast news sensitive-word spotting system based on this corpus. Same time, implement some usefull tools for speech analys and the update tools for speech corpus. Uses optimization tips to improve the system performance when Matlab coding.

Document Type学位论文
Recommended Citation
GB/T 7714
木合塔尔·沙地克. 维吾尔语广播新闻敏感词检索系统的研究[D]. 北京. 中国科学院大学,2013.
Files in This Item:
File Name/Size DocType Version Access License
维吾尔语广播新闻敏感词检索系统的研究.p(11823KB)学位论文 开放获取CC BY-NC-SAView Application Full Text
Related Services
Recommend this item
Usage statistics
Export to Endnote
Google Scholar
Similar articles in Google Scholar
[木合塔尔·沙地克]'s Articles
Baidu academic
Similar articles in Baidu academic
[木合塔尔·沙地克]'s Articles
Bing Scholar
Similar articles in Bing Scholar
[木合塔尔·沙地克]'s Articles
Terms of Use
No data!
Social Bookmark/Share
File name: 维吾尔语广播新闻敏感词检索系统的研究.pdf
Format: Adobe PDF
All comments (0)
No comment.

Items in the repository are protected by copyright, with all rights reserved, unless otherwise indicated.