学会発表(口頭発表・ポスター)

基本情報

氏名 畔津 忠博
氏名(カナ) アゼツ タダヒロ
氏名(英語) AZETSU Tadahiro

タイトル

Speaker identification in noisy environment with use of the precise model of the human auditory system

会議名

Proc. of the International MultiConference of Engineers and Computer Scientists (IMECS2012), pp.92-95

主催者(学会名等)

IMECS2012

開催場所

Hong Kong

開催年月日

2012/03/16

単独・共同の区分

共同

発表者

T. Azetsu, M. Abuku, N. Suetake, E. Uchino

記述言語

英語

会議種別

口頭発表(一般)

概要

This paper discusses an approach for speaker identification in noisy environment using the multi-dimensional
pulse signals generated from the model of a human peripheral auditory system. The peripheral auditory model employed here consists of a basilar membrane, hair cells, and auditory nerves. The input to this model is a speech signal divided into frames, and the outputs of which are the multi-dimensional pulse signals for each framed signal. The feature vectors based on the poststimulus time histogram (PSTH) of the pulse signals are used for the speaker identification. In this paper, we propose to set adaptively the threshold of the action potential for pulse generation in the auditory nerve model. In order to verify the performance of noise immunity for the speaker identification, the experiments were conducted for each Japanese vowel spoken by 12 speakers (9 males and 3 females). The effectiveness of using the peripheral auditory model has been verified
by comparing with the methods using the conventional LPC spectrum and using the excitation patterns.