篇名 | MATBN: A Mandarin Chinese Broadcast News Corpus |
---|---|
卷期 | 10:2 |
作者 | Wang, Hsin-min 、 Chen, Berlin 、 Kuo, Jen-wei 、 Cheng, Shih-sian |
頁次 | 219-235 |
關鍵字 | broadcast news 、 Mandarin Chinese 、 speech recognition 、 corpus 、 transcription 、 annotation 、 THCI Core |
出刊日期 | 200506 |
The MATBN Mandarin Chinese broadcast news corpus contains a total of 198
hours of broadcast news from the Public Television Service Foundation (Taiwan) with corresponding transcripts. The primary purpose of this collection is to provide training and testing data for continuous speech recognition evaluation in the broadcast news domain. In this paper, we briefly introduce the speech corpus and report on some preliminary statistical analysis and speech recognition evaluation results.