学会発表(口頭発表・ポスター)

基本情報

氏名 畔津 忠博
氏名(カナ) アゼツ タダヒロ
氏名(英語) AZETSU Tadahiro

タイトル

Application of peripheral auditory model to speaker identification

会議名

Proc. of World Congress on Nature and Biologically Inspired Computing (NABIC2010), pp.673-678

主催者(学会名等)

NABIC2010

開催場所

Kitakyushu

開催年月日

2010/12/17

単独・共同の区分

共同

発表者

M. Abuku, T. Azetsu, E. Uchino, N. Suetake

記述言語

英語

会議種別

口頭発表(一般)

概要

This paper discusses an approach for speaker identification using the multi-dimensional pulse signals generated from a model of a peripheral auditory system. The model of the peripheral auditory system employed here consists of a basilar membrane, hair cells, and auditory nerves. The input to this model is a speech signal divided into frames, and the outputs from which are the multi-dimensional pulse signals for each framed signal. The feature vectors based on the post-stimulus time histogram (PSTH) of the pulse signals are used for the speaker identification. Also, in order to improve the accuracy of the speaker identification, the feature vector conversion, using the mean and the diagonal matrix of standard deviations, is performed. The experiments were conducted for each Japanese vowel spoken by 12 speakers (9 males and 3
females), and the speaker identification accuracy is evaluated by 5 hold leave 2 out cross-validation for each vowel. The effectiveness of the proposed method has been verified by comparing with the conventional LPC analysis.