HyRead Journal 台灣全文資料庫

文章詳目資料

Journal of Computers EIMEDLINEScopus

自然科學/資訊/科技

篇名	Robust Speaker Verification Based on Max Pooling of Sparse Representation
卷期	24:4
作者	Wang, Wei 、 Han, Jiqing 、 Zheng, Tieran 、 Zheng, Guibin
頁次	056-065
關鍵字	speaker verification 、 sparse representation 、 robust feature extraction 、 EI 、 MEDLINE 、 Scopus
出刊日期	201401

In the human nervous system, sensory inputs are coded in a sparse manner where only small numbers of neurons are active at a given time, thus the sparse coding is reasonable to be as a plausible model of the auditory cortex. In this paper, we propose a biologically inspired feature extraction method for speaker verification based on sparse coding. When encoding the speech data using sparse coding model, the learned dictionary has the similar characteristics with simple cell receptive fields of auditory neurons and the sparse coding coefficients simulate the response of the auditory cortex neuron. Moreover, every dictionary is learned from every speaker training sample, so that it has more individual information of the speaker and is useful for discriminating different speakers with less dictionary atoms. And based on human auditory masking effect, a neuron which performs a Max Pooling operation on the pooled inputs responds to the strongest one of its inputs and inhibits other weaker inputs. The robustness of the proposed method is better in terms of a strategy to represent natural sounds. The experimental results show that the proposed method outperforms the baseline system on two typical corpuses.

本卷期文章目次

關鍵知識WIKI

文章詳目資料

Journal of Computers EIMEDLINEScopus

中文摘要

英文摘要

本卷期文章目次

關鍵知識WIKI

相關文獻