Volume 12, Number 3 (September 2016)                   IJEEE 2016, 12(3): 197-205 | Back to browse issues page



DOI: 10.22068/IJEEE.12.3.197

XML Print


Download citation:
BibTeX | RIS | EndNote | Medlars | ProCite | Reference Manager | RefWorks
Send citation to:

Bashirpour M, Geravanchizadeh M. Speech Emotion Recognition Based on Power Normalized Cepstral Coefficients in Noisy Conditions. IJEEE. 2016; 12 (3) :197-205
URL: http://ijeee.iust.ac.ir/article-1-962-en.html

Abstract:   (1126 Views)

Automatic recognition of speech emotional states in noisy conditions has become an important research topic in the emotional speech recognition area, in recent years. This paper considers the recognition of emotional states via speech in real environments. For this task, we employ the power normalized cepstral coefficients (PNCC) in a speech emotion recognition system. We investigate its performance in emotion recognition using clean and noisy speech materials and compare it with the performances of the well-known MFCC, LPCC, RASTA-PLP, and also TEMFCC features. Speech samples are extracted from the Berlin emotional speech database (Emo DB) and Persian emotional speech database (Persian ESD) which are corrupted with 4 different noise types under various SNR levels. The experiments are conducted in clean train/noisy test scenarios to simulate practical conditions with noise sources. Simulation results show that higher recognition rates are achieved for PNCC as compared with the conventional features under noisy conditions.

Full-Text [PDF 344 kb]   (665 Downloads)    
Type of Study: Research Paper | Subject: Speech Processing
Received: 2016/06/21 | Accepted: 2016/10/15 | Published: 2016/10/15

Add your comments about this article : Your username or email:
Write the security code in the box