Outline

Ingegneria Sismica

Ingegneria Sismica

An Intelligent Comparative Study of Multilingual Corpora of English in ASEAN Driven by Neural Machine Translation

Author(s): Jianghong Kuang1, Haopeng Song2, Yanfu Zhang1
1School of Languages, Cultures and International Education, Liuzhou Institute of Technology Liuzhou 545000, Guangxi, China
2School of General Education and Pedagogy, Guangzhou Huashang Vocational College, Guangzhou 511300, Guangdong, China
Kuang, Jianghong ., Song, Haopeng., and Zhang, Yanfu. “An Intelligent Comparative Study of Multilingual Corpora of English in ASEAN Driven by Neural Machine Translation.” Ingegneria Sismica Volume 43 Issue 1: 1-21, doi:10.65102/is2026087.

Abstract

In the context of the continuous growth in cross-border digital communication and multilingual information processing demands, the modeling and difference identification of ASEAN English multilingual corpora have become an important topic in intelligent translation research. This paper constructs a technical framework that integrates corpus collection, language identification, sentence-level alignment, subword segmentation, Transformer modeling, and intelligent comparative analysis. It introduces multi-head self-attention, difference scoring function, vector representation learning, and feature fusion mechanisms, and uses joint loss to achieve collaborative optimization of translation generation and difference discrimination. Experiments show that the model in this paper achieves a BLEU score of 39.63, a TER of 0.347, a semantic similarity of 0.879, and a difference identification accuracy of 92.3%; compared with Transformer, BLEU increases by 7.3% and the accuracy improves by 3.6 percentage points. This method can effectively reveal the translation differences between ASEAN English and Thai, Vietnamese, Indonesian, and Malay, and has practical significance for regional English variant computing research and intelligent translation analysis.

Povzetek: This study focuses on the intelligent comparison of multilingual corpora of ASEAN English, and constructs a neural machine translation framework that integrates Transformer semantic modeling, difference scoring, feature fusion, and joint optimization. Based on experiments with four types of corpora, the model’s BLEU score reaches 39.63, TER drops to 0.347, and the accuracy of difference identification reaches 92.3%, which can well reveal the differences in cross-language vocabulary, structure, and semantics.

Keywords
Neural Machine Translation; ASEAN English;Multilingual Corpus; Intelligent Comparison;Corpus-based Translation Studies

Related Articles

Huiqiao Liu1
1Yinchuan University of Energy, Ningxia, 750000, China
Xin Zhao1, Yan Li1, Xiangyang Cao1, Qiushuang Li1, Jianing Zhang1
1State Grid Shandong Electric Power Company Economic and Technological Research Institute ShanDong JiNan 250001, China
Dan Yang1
1School of Marxism, Suzhou Polytechnic University, Suzhou, 215104, China
Liuhang Shen1, Xiangwen Sun1
1Ulster college at Shaanxi University of Science &Technology, Xi’an,710021, Shaanxi, China