文章詳目資料

International Journal of Computational Linguistics And Chinese Language Processing THCI

  • 加入收藏
  • 下載文章
篇名 Modeling Pronunciation Variation for Bi-Lingual Mandarin/Taiwanese Speech Recognition
卷期 10:3
作者 Lyu, Dau-chengLyu, Ren-yuanChiang, Yuang-chinHsu, Chun-nan
頁次 363-380
關鍵字 Bi-lingualOne-pass ASRPronunciation ModelingTHCI Core
出刊日期 200509

中文摘要

英文摘要

In this paper, a bi-lingual large vocaburary speech recognition experiment based on the idea of modeling pronunciation variations is described. The two languages under study are Mandarin Chinese and Taiwanese (Min-nan). These two languages are basically mutually unintelligible, and they have many words with the same Chinese characters and the same meanings, although they are pronounced differently. Observing the bi-lingual corpus, we found five types of pronunciation
variations for Chinese characters. A one-pass, three-layer recognizer was
developed that includes a combination of bi-lingual acoustic models, an integrated pronunciation model, and a tree-structure based searching net. The recognizer’s performance was evaluated under three different pronunciation models. The results showed that the character error rate with integrated pronunciation models was better than that with pronunciation models, using either the knowledge-based or the data-driven approach. The relative frequency ratio was also used as a measure to
choose the best number of pronunciation variations for each Chinese character. Finally, the best character error rates in Mandarin and Taiwanese testing sets were found to be 16.2% and 15.0%, respectively, when the average number of pronunciations for one Chinese character was 3.9.

相關文獻