文章詳目資料

International Journal of Computational Linguistics And Chinese Language Processing THCI

  • 加入收藏
  • 下載文章
篇名 The NTNU Taiwanese ASR System for Formosa Speech Recognition Challenge 2020
卷期 26:1
作者 Fu-An ChaoTien-Hong LoShi-Yan WengShih-Hsuan ChiuYao-Ting SungBerlin Chen
頁次 001-016
關鍵字 Formosa Speech Recognition ChallengeDeep LearningTransfer LearningSemi-supervised TrainingTHCI Core
出刊日期 202106

中文摘要

英文摘要

This paper describes the NTNU ASR system participating in the Formosa Speech Recognition Challenge 2020 (FSR-2020) supported by the Formosa Speech in the Wild project (FSW). FSR-2020 aims at fostering the development of Taiwanese speech recognition. Apart from the issues on tonal and dialectical variations of the Taiwanese language, speech artificially contaminated with different types of real-world noise also has to be dealt with in the final test stage; all of these make FSR-2020 much more challenging than before. To work around the under-resourced issue, the main technical aspects of our ASR system include various deep learning techniques, such as transfer learning, semi-supervised learning, front-end speech enhancement and model ensemble, as well as data cleansing and data augmentation conducted on the training data. With the best configuration, our system obtains 13.1 % syllable error rate (SER) on the final-test set, achieving the first place among all participating systems on Track 3.

相關文獻