文章詳目資料

電腦與通訊

  • 加入收藏
  • 下載文章
篇名 基於權重式有限狀態機之中文大詞彙連續語音辨識介紹與中文語音辨識之挑戰
卷期 145
並列篇名 Introduction to Weighted Finite-state Transducer Based Mandarin Large VocabularyContinuous Speech Recognition and Unique Challenges From Mandarin Chinese
作者 林奇嶽邱祺添涂家章
頁次 21-28
關鍵字 大詞彙連續語音辨識權重式有限狀態機梅爾頻率倒頻譜係數感知線性預測改良式聶氏平滑化中文Large Vocabulary Continuous Speech RecognitionLVCSRWeighted Finite-state TransducerWFSTMel-frequency Cepstral CoefficientsMFCCPerceptual Linear PredictionPLPModified Kneser-Ney SmoothingMandarin
出刊日期 201206

中文摘要

大詞彙連續語音辨識為語音辨識技術的極致,以其為基礎搭配語言理解與對話管理,能夠發展出許多功能性和趣味性的智慧型應用和服務,提升人機互動的便利性。本文將介紹權重式有限狀態機中文大詞彙連續語音辨識系統幾項必要的功能模組,內容涵蓋聲學特徵參數萃取、聲學模型訓練演算法、語言模型訓練演算法、以及解譯器架構。除此之外,尚會提及GALE計劃裡中文大詞彙連續語音辨識系統的效能,以及中文辨識所要面對的特殊問題和挑戰。

英文摘要

Large vocabulary continuous speech recognition (LVCSR) is the ultimate goal of speech recognition.When incorporated with other techniques such as language understanding and spoken dialogue management,a variety of speech-based intelligent applications and services are realized to enrich interaction betweenhuman and machine. This article gives readers a brief introduction to modules in weighted finite-statetransducer (WFST) based LVCSR system, including acoustic feature extraction, acoustic model trainingalgorithms, language model training algorithms, and decoder architecture. In addition, recent performance ofMandarin LVCSR systems in GALE project and some issues specific to Mandarin are addressed.

相關文獻