Deep Neural Networks for Vocal Emotion Analysis Listener Response Prediction

Yang, Haiwang

doi:10.65102/is2026082

Research article

Ingegneria Sismica

Volume 43 Issue 1
Pages: 1
-20

Deep Neural Networks for Vocal Emotion Analysis Listener Response Prediction

Author(s): ^¹

¹School of Educational Sciences, Zhaotong University, Zhaotong 657000, Yunnan Province, China

Published: 30/04/2026

Cite

Yang, Haiwang. “Deep Neural Networks for Vocal Emotion Analysis Listener Response Prediction.” Ingegneria Sismica Volume 43 Issue 1: 1-20, doi:10.65102/is2026082.

https://doi.org/10.65102/is2026082

Abstract

For digital music dissemination and intelligent audio analysis scenarios, this paper constructs a deep neural network model around the problem of audience reaction prediction in vocal emotion analysis. Based on the time-frequency feature extraction and preprocessing of human voice audio, the model uses the fully convolutional network to extract the spatial information in the spectral domain, combines the bidirectional long short-term memory network to capture the time dependence of emotion in phrase progression, and introduces the context attention fusion mechanism to adaptively weight the key frequency band, key frame and cross-segment association information. Thus a computational mapping between vocal expression and listener feedback is established. The experimental results show that the accuracy of the model on the human voice emotion recognition task reaches 91.8%, and the macro-average F1 value is 91.1%. In the listener response prediction task, the mean absolute errors of preference prediction and arousal prediction are reduced to 0.356 and 0.339, respectively. The results show that the proposed model can stably improve the accuracy of emotion analysis and the ability of listener feedback prediction in complex vocal clips.

Keywords
Deep neural network; Vocal emotion analysis; Listener response prediction; Spatio-temporal feature fusion

Research article
https://doi.org/10.65102/is20261300

Research on high-quality image super-resolution re...

Volume 43 Issue 3
Pages: 1
-21
08/07/2026

^¹,², ^¹,², ^¹

¹Hainan Vocational University of Science and Technology, Haikou 571126, China

²Institute for Mathematical Research, Universiti Putra Malaysia, Serdang 43400, Malaysia

Research article
https://doi.org/10.65102/is20261301

Multi-scale Dual Transformer based Multi long-term...

Volume 43 Issue 3
Pages: 1
-18
08/07/2026

^¹,², ^¹,², ^¹

¹Hainan Vocational University of Science and Technology, Haikou 571126, China

²Institute for Mathematical Research, Universiti Putra Malaysia, Serdang 43400, Malaysia

Research article
https://doi.org/10.65102/is20261299

Ultra-Short-Term Wind Power Forecasting Based on V...

Volume 43 Issue 3
Pages: 1
-15
08/07/2026

^¹, ^², ^¹, ^¹, ^¹

¹Electric Power Research Institute, State Grid Shanxi Electric Power Co., Ltd., Taiyuan, 030001, Shanxi, China

²Jincheng Power Supply Branch, State Grid Shanxi Electric Power Co., Ltd., Jincheng, 048000, Shanxi, China

Research article
https://doi.org/10.65102/is20261298

Integration of Traditional Culture Elements and Co...

Volume 43 Issue 3
Pages: 1
-12
01/07/2026

^¹,²

¹China Academy of Cultural Heritage, Chaoyang District, 100029, Beijing, China

²Beijing University of Civil Engineering and Architecture, Xicheng District, 100044, Beijing, China

Research article
https://doi.org/10.65102/is20261297

Quantitative Evaluation Model of the Policy Effect...

Volume 43 Issue 3
Pages: 1
-15
01/07/2026

^¹, ^², ^¹

¹School of Digital Media, Shenzhen Polytechnic University, Shenzhen 518055, Guangdong, China

²Postdoctoral Mobile Station of Journalism and communication, Fudan University, Shanghai 200433, Shanghai, China

Outline

Ingegneria Sismica

Deep Neural Networks for Vocal Emotion Analysis Listener Response Prediction

Abstract

Related Articles

Research on high-quality image super-resolution re...

Multi-scale Dual Transformer based Multi long-term...

Ultra-Short-Term Wind Power Forecasting Based on V...

Integration of Traditional Culture Elements and Co...

Quantitative Evaluation Model of the Policy Effect...

Open Access