< Terug naar vorige pagina

Publicatie

Speaker age estimation using i-vectors

Tijdschriftbijdrage - Tijdschriftartikel

In this paper, a new approach for age estimation from speech signals based on i-vectors is proposed. In this method, each utterance is modeled by its corresponding i-vector. Then, a Within-Class Covariance Normalization technique is used for session variability compensation. Finally, a least squares support vector regression (LSSVR) is applied to estimate the age of speakers. The proposed method is trained and tested on telephone conversations of the National Institute for Standard and Technology (NIST) 2010 and 2008 speaker recognition evaluation databases. Evaluation results show that the proposed method yields significantly lower mean absolute error and higher Pearson correlation coefficient between chronological speaker age and estimated speaker age compared to different conventional schemes. The obtained relative improvements of mean absolute error and correlation coefficient compared to our best baseline system are around 5% and 2% respectively. Finally, the effect of some major factors influencing the proposed age estimation system, namely utterance length and spoken language are analyzed. © 2014 Elsevier Ltd.
Tijdschrift: Engineering Applications of Artificial Intelligence
ISSN: 0952-1976
Volume: 34
Pagina's: 99 - 108
Jaar van publicatie:2014
BOF-keylabel:ja
IOF-keylabel:ja
BOF-publication weight:2
CSS-citation score:2
Auteurs:International
Authors from:Higher Education
Toegankelijkheid:Open