LSTM Networks Optimize English Speech Recognition Accuracy

Liu, Jingwen; Li, Yannan

doi:10.65102/is2026491

Research article

Ingegneria Sismica

Volume 43 Issue 1
Pages: 1
-21

LSTM Networks Optimize English Speech Recognition Accuracy

Author(s): ^¹, ^¹

¹School of Humanities and Education, Jinan Preschool Education College, Jinan 250000, Shandong, China

Published: 30/04/2026

Cite

Liu, Jingwen. and Li, Yannan. “LSTM Networks Optimize English Speech Recognition Accuracy.” Ingegneria Sismica Volume 43 Issue 1: 1-21, doi:10.65102/is2026491.

https://doi.org/10.65102/is2026491

Abstract

In order to solve the problems of insufficient context dependence modeling, high recognition error of long sentences and limited decoding stability in English continuous speech recognition, this paper proposes a recognition accuracy optimization method based on LSTM network. From the perspective of computer implementation, this study constructs an integrated recognition process of “speech preprocessing-feature representation-time series modeling-sequence decoding”. After completing pre-emphasis, framing and window-adding, log Mel spectrum extraction and feature normalization, bidirectional LSTM is used to jointly model the context of speech sequence. CTC beam search and language model re-ranking are combined to improve the consistency and readability of the output text. At the same time, joint optimization is carried out around the number of hidden units, the number of network layers, the learning rate, the Dropout rate and the decoding parameters to enhance the adaptability of the model under different speaking rates and sentence lengths. Experimental results show that the word error rate of the optimized LSTM model is reduced to 5.7%, the character error rate is 3.1%, and the sentence recognition accuracy is 84.6% on the English speech test set. The overall performance of the optimized LSTM model is better than DNN, RNN and unoptimized LSTM model. The results show that the LSTM network has strong temporal expression advantages in English speech recognition tasks, and can effectively improve the recognition accuracy and operation stability of the system after combining reasonable parameter adjustment and decoding strategy.

Keywords
LSTM network; English speech recognition; Temporal modeling; Recognition accuracy Optimization

Research article
https://doi.org/10.65102/is20261300

Research on high-quality image super-resolution re...

Volume 43 Issue 3
Pages: 1
-21
08/07/2026

^¹,², ^¹,², ^¹

¹Hainan Vocational University of Science and Technology, Haikou 571126, China

²Institute for Mathematical Research, Universiti Putra Malaysia, Serdang 43400, Malaysia

Research article
https://doi.org/10.65102/is20261301

Multi-scale Dual Transformer based Multi long-term...

Volume 43 Issue 3
Pages: 1
-18
08/07/2026

^¹,², ^¹,², ^¹

¹Hainan Vocational University of Science and Technology, Haikou 571126, China

²Institute for Mathematical Research, Universiti Putra Malaysia, Serdang 43400, Malaysia

Research article
https://doi.org/10.65102/is20261299

Ultra-Short-Term Wind Power Forecasting Based on V...

Volume 43 Issue 3
Pages: 1
-15
08/07/2026

^¹, ^², ^¹, ^¹, ^¹

¹Electric Power Research Institute, State Grid Shanxi Electric Power Co., Ltd., Taiyuan, 030001, Shanxi, China

²Jincheng Power Supply Branch, State Grid Shanxi Electric Power Co., Ltd., Jincheng, 048000, Shanxi, China

Research article
https://doi.org/10.65102/is20261298

Integration of Traditional Culture Elements and Co...

Volume 43 Issue 3
Pages: 1
-12
01/07/2026

^¹,²

¹China Academy of Cultural Heritage, Chaoyang District, 100029, Beijing, China

²Beijing University of Civil Engineering and Architecture, Xicheng District, 100044, Beijing, China

Research article
https://doi.org/10.65102/is20261297

Quantitative Evaluation Model of the Policy Effect...

Volume 43 Issue 3
Pages: 1
-15
01/07/2026

^¹, ^², ^¹

¹School of Digital Media, Shenzhen Polytechnic University, Shenzhen 518055, Guangdong, China

²Postdoctoral Mobile Station of Journalism and communication, Fudan University, Shanghai 200433, Shanghai, China

Outline

Ingegneria Sismica

LSTM Networks Optimize English Speech Recognition Accuracy

Abstract

Related Articles

Research on high-quality image super-resolution re...

Multi-scale Dual Transformer based Multi long-term...

Ultra-Short-Term Wind Power Forecasting Based on V...

Integration of Traditional Culture Elements and Co...

Quantitative Evaluation Model of the Policy Effect...

Open Access