文章詳目資料

International Journal of Computational Linguistics And Chinese Language Processing THCI

  • 加入收藏
  • 下載文章
篇名 Robust Target Speaker Tracking in Broadcast TV Streams
卷期 11:1
作者 Bai, JunmeiJiang, HongchenZhang, ShileiZhang, ShuwuXu, Bo
頁次 057-072
關鍵字 Speaker TrackingAudio SegmentationEntropyGMMTHCI Core
出刊日期 200603

中文摘要

英文摘要

This paper addresses the problem of audio change detection and speaker tracking in broadcast TV streams. A two-pass audio change detection algorithm, which includes detection of the potential change boundaries and refinement, is proposed. Speaker tracking is performed based on the results of speaker change detection. In speaker tracking, Wiener filtering, endpoint detection of pitch, and segmental cepstral feature normalization are applied to obtain a more reliable result. The algorithm has low complexity. Our experiments show that the algorithm achieves
very satisfactory results.

相關文獻