文章詳目資料

電腦與通訊

  • 加入收藏
  • 下載文章
篇名 室內長距離語音辨識技術挑戰與初探
卷期 164
並列篇名 Challenges and Preliminary Study on Indoor Distant Speech Recognition
作者 廖憲正郭志忠林政賢
頁次 042-054
關鍵字 自動語音辨識深度神經網路語音人機介面
出刊日期 201512

中文摘要

長距離語音辨識受到收音裝置、室內空間響應、語者說話位置與方位、以及環境噪音等因素 影響,本文針對各個因素進行解析,並嘗試提出解決之方法以及進行初步的實驗。實驗結果顯示 長距離影響了如鼻音與塞擦音等子音語音訊號,使得該類型語音之辨識與驗證正確率大幅下降。 在加入長距離語音語料進行調適後,可提升語音辨識正確率約10%。而以深度神經網路為基礎之 語音模型在加入長距離語料後, 更可以得到約60%的音節辨識正確增加率。

英文摘要

Distant speech recognition accuracy is highly correlated with types of recording devices, room acoustics, speakers’ location and orientation, and environmental noises. This article analyzed causes which decrease distant speech recognition accuracy and tried to propose possible solutions with preliminary experiments. The results showed that the recognition and verification accuracy of consonants, like nasal and affricate, decreased significantly as distance increased. After model adaptation using our distant speech corpus, the recognition accuracy was improved by 10%. There was even about 60% accuracy improvement rate when we used deep neural network as acoustic models trained with the distant speech corpus.

相關文獻