Analysis of Differences in National Identity and International Outlook Among English, Vietnamese, and Thai Target-Language Students in Chinese Higher Education Based on Data Mining Techniques

Yi, Chunyan; Huang, Shan

doi:10.65102/is2026862

Research article

Ingegneria Sismica

Volume 43 Issue 2
Pages: 1
-16

Analysis of Differences in National Identity and International Outlook Among English, Vietnamese, and Thai Target-Language Students in Chinese Higher Education Based on Data Mining Techniques

Author(s): ^¹, ^²

¹School of Foreign Languages ,Guangxi Minzu Normal University, Chongzuo 532200, Guangxi, China

²Department of Development Planning, Guangxi Minzu Normal University, Chongzuo 532200, Guangxi, China

Published: 30/04/2026

Cite

Yi, Chunyan. and Huang, Shan. “Analysis of Differences in National Identity and International Outlook Among English, Vietnamese, and Thai Target-Language Students in Chinese Higher Education Based on Data Mining Techniques.” Ingegneria Sismica Volume 43 Issue 2: 1-16, doi:10.65102/is2026862.

https://doi.org/10.65102/is2026862

Abstract

This study constructs a target language difference mining model that integrates principal component analysis, XGBoost cluster identification, and multi-output regression. Based on six-dimensional feature encoding, the model performs structural characterization, category discrimination, and effect estimation for English, Vietnamese, and Thai learner groups.The model uses national identity and international outlook as joint outputs. Comparative results show that the Vietnamese group’s overall mean scores for national identity and international outlook were 4.97 and 4.86, respectively, higher than those of the English group (4.91 and 4.69) and the Thai group (4.90 and 4.71);significant differences were found in the cultural identity dimension (F=5.696, p=.004), and similarly in the dimension of attention to international/regional issues (F=3.778, p=.024).

Keywords
Data mining; Target language; National identity; International perspective; Group identification