篇名 | 中英文語碼轉換語音合成系統開發 |
---|---|
卷期 | 27:2 |
並列篇名 | Development of Mandarin-English Code-switching Speech Synthesis System |
作者 | 練欣柔 、 黃立宇 、 陳嘉平 |
頁次 | 047-056 |
關鍵字 | 語音合成 、 語碼轉換 、 資料前處理 、 Speech Synthesize 、 Codeswitching 、 Text Preprocessing 、 THCI Core |
出刊日期 | 202212 |
本論文提出中英文語碼轉換語音合成系統。為了使系統可專注於學習不同語言間的內容,利用已統一語者風格的多語言人工資料集進行訓練。之後在合成器中加入語言向量,以增加系統對多語言的掌握。此外對輸入的中、英文分別進行不同的前處理,將中文進行斷詞且轉為漢語拼音,藉此增加語音的自然度,且減輕學習時的複雜度,也透過數字正規化判斷句子中的阿拉伯數字,是否需要加上數字單位。英文部份則對複雜的頭字語進行讀音判斷與轉換。
In this paper, the Mandarin-English codeswitching speech synthesis system has been proposed. To focus on learning the content information between two languages, the training dataset is multilingual artificial dataset whose speaker style is unified. Adding language embedding into the system helps it be more adaptive to multilingual dataset. Besides, text preprocessing is applied and be used in different way which depends on the languages. Word segmentation and text-to-pinyin are the text preprocessing for Mandarin, which not only improves the fluency but also reduces the learning complexity. Number normalization decides whether the arabic numerals in sentence needs to add the digits. The preprocessing for English is acronym conversion which decides the pronunciation of acronym