文章詳目資料

International Journal of Computational Linguistics And Chinese Language Processing THCI

  • 加入收藏
  • 下載文章
篇名 Automatic Segmentation and Labeling for Mandarin Chinese Speech Corpora for Concatenation-based TTS
卷期 10:2
作者 Lin, Cheng-yuanJang, Jyh-shing RogerChen, Kuan-ting
頁次 145-166
關鍵字 speech assessment methods phonetic alphabetspeech corpussequential forward selectionleave-one-outk-nearest neighbor rulespeaker-adapted modelcontext-dependent hidden Markov model THCI Core
出刊日期 200506

中文摘要

英文摘要

Precise phone/syllable boundary labeling of the utterances in a speech corpus plays an important role in constructing a corpus-based TTS (text-to-speech) system. However, automatic labeling based on Viterbi forced alignment does not always produce satisfactory results. Moreover, a suitable labeling method for one language does not necessarily produce desirable results for another language. Hence in this paper, we propose a new procedure for refining the boundaries of utterances in a Mandarin speech corpus. This procedure employs different sets of acoustic features
for four different phonetic categories. In addition, a new scheme is proposed to deal with the “periodic voiced + periodic voiced” case, which produced most of the segmentation errors in our experiment. Several experiments were conducted to demonstrate the feasibility of the proposed approach.

相關文獻