جلد 12، شماره 3 - ( 6-1395 )                   جلد 12 شماره 3 صفحات 197-205 | برگشت به فهرست نسخه ها



DOI: 10.22068/IJEEE.12.3.197

XML Print


Download citation:
BibTeX | RIS | EndNote | Medlars | ProCite | Reference Manager | RefWorks
Send citation to:

Bashirpour M, Geravanchizadeh M. Speech Emotion Recognition Based on Power Normalized Cepstral Coefficients in Noisy Conditions. IJEEE. 2016; 12 (3) :197-205
URL: http://ijeee.iust.ac.ir/article-1-962-fa.html
Speech Emotion Recognition Based on Power Normalized Cepstral Coefficients in Noisy Conditions. . 1395; 12 (3) :197-205

URL: http://ijeee.iust.ac.ir/article-1-962-fa.html


چکیده:   (1133 مشاهده)

Automatic recognition of speech emotional states in noisy conditions has become an important research topic in the emotional speech recognition area, in recent years. This paper considers the recognition of emotional states via speech in real environments. For this task, we employ the power normalized cepstral coefficients (PNCC) in a speech emotion recognition system. We investigate its performance in emotion recognition using clean and noisy speech materials and compare it with the performances of the well-known MFCC, LPCC, RASTA-PLP, and also TEMFCC features. Speech samples are extracted from the Berlin emotional speech database (Emo DB) and Persian emotional speech database (Persian ESD) which are corrupted with 4 different noise types under various SNR levels. The experiments are conducted in clean train/noisy test scenarios to simulate practical conditions with noise sources. Simulation results show that higher recognition rates are achieved for PNCC as compared with the conventional features under noisy conditions.

متن کامل [PDF 344 kb]   (671 دریافت)    
نوع مطالعه: Research Paper | موضوع مقاله: 5-Speech Processing
دریافت: ۱۳۹۵/۴/۱ | پذیرش: ۱۳۹۵/۷/۲۴ | انتشار: ۱۳۹۵/۷/۲۴

ارسال نظر درباره این مقاله : نام کاربری یا پست الکترونیک شما:
کد امنیتی را در کادر بنویسید