文章詳目資料

International Journal of Computational Linguistics And Chinese Language Processing THCI

  • 加入收藏
  • 下載文章
篇名 Automatic Pronunciation Assessment for Mandarin Chinese: Approaches and System Overview
卷期 12:4
作者 Chen, Jiang-chunJang, Jyh-shing RogerTsai, Te-lu
頁次 443-458
關鍵字 CAPT, CALLSpeech RecognitionTone RecognitionSpeech AssessmentPhonemeDownhill Simplex MethodMandarin ChineseGMMIntensityRhythmForced AlignmentTHCI Core
出刊日期 200712

中文摘要

英文摘要

This paper presents the algorithms used in a prototypical software system for automatic pronunciation assessment of Mandarin Chinese. The system uses forced alignment of HMM (Hidden Markov Models) to identify each syllable and the corresponding log probability for phoneme assessment, through a ranking-based confidence measure. The pitch vector of each syllable is then sent to a GMM (Gaussian Mixture Model) for tone recognition and assessment. We also compute the similarity of scores for intensity and rhythm between the target and test utterances. All four scores for phoneme, tone, intensity, and rhythm are parametric functions with certain free parameters. The overall scoring function was then
formulated as a linear combination of these four scoring functions of phoneme, tone, intensity, and rhythm. Since there are both linear and nonlinear parameters involved in the overall scoring function, we employ the downhill Simplex search to fine-tune these parameters in order to approximate the scoring results obtained from a human expert. The experimental results demonstrate that the system can give consistent scores that are close to those of a human’s subjective evaluation.

相關文獻