篇名 | Robust Target Speaker Tracking in Broadcast TV Streams |
---|---|
卷期 | 11:1 |
作者 | Bai, Junmei 、 Jiang, Hongchen 、 Zhang, Shilei 、 Zhang, Shuwu 、 Xu, Bo |
頁次 | 057-072 |
關鍵字 | Speaker Tracking 、 Audio Segmentation 、 Entropy 、 GMM 、 THCI Core |
出刊日期 | 200603 |
This paper addresses the problem of audio change detection and speaker tracking in broadcast TV streams. A two-pass audio change detection algorithm, which includes detection of the potential change boundaries and refinement, is proposed. Speaker tracking is performed based on the results of speaker change detection. In speaker tracking, Wiener filtering, endpoint detection of pitch, and segmental cepstral feature normalization are applied to obtain a more reliable result. The algorithm has low complexity. Our experiments show that the algorithm achieves
very satisfactory results.