文章詳目資料

International Journal of Computational Linguistics And Chinese Language Processing THCI

  • 加入收藏
  • 下載文章
篇名 Speaker Identification Method Using Earth Mover’s Distance for CCC Speaker Recognition Evaluation 2006
卷期 12:3
作者 Shingo KuroiwaTsuge, SatoruKita, MasahikoRen, Fuji
頁次 239-254
關鍵字 Speaker IdentificationNon-ParametricEarth Mover’s DistanceVector QuantizationChinese Speech CorpusTHCI Core
出刊日期 200709

中文摘要

英文摘要

In this paper, we present a non-parametric speaker identification method using Earth Mover’s Distance (EMD) designed for text-indepedent speaker identification and its evaluation results for CCC Speaker Recognition Evaluation 2006, organized by the Chinese Corpus Consortium (CCC) for the th International Symposium on Chinese Spoken Language Processing (ISCSLP 2006). EMD based speaker identification (EMD-IR) was originally designed to be applied to a distributed speaker identification system, in which the feature vectors are compressed by vector quantization at a terminal and sent to a server that executes a pattern matching process. In this structure, we had to train speaker models using quantized data, then we utilized a non-parametric speaker model and EMD. From the experimental results on a Japanese speech corpus, EMD-IR showed higher robustness to the quantized data than the conventional GMM technique. Moreover, it achieved higher accuracy than GMM even if the data was not quantized. Hence, we have taken the challenge of CCC Speaker Recognition Evaluation 2006 using EMD-IR. Since the identification tasks defined in the evaluation were on an open-set basis, we introduce a new speaker verification module. Evaluation results show that EMD-IR achieves 99.3 % Identification Correctness Rate in a closed-channel speaker identification task.

相關文獻