Outline

Ingegneria Sismica

Ingegneria Sismica

R language was used to analyze the blood metabolites and hypertension of textile workers in NHANES

Author(s): Gefan Lyu1, Sufen Duan2, Zhixiang Gao1
1School of Public Health, Inner Mongolia Medical University; Hohhot, Inner Mongolia, 010110, China
2Department of Mathematics, Zibo Normal College, Zibo, 255130, Shandong, China
Lyu, Gefan., Duan, Sufen., and Gao, Zhixiang. “R language was used to analyze the blood metabolites and hypertension of textile workers in NHANES.” Ingegneria Sismica Volume 43 Issue 2: 1-22, doi:10.65102/is2026673.

Abstract

Hypertension risk identification has strong practical significance for occupational health monitoring and cardiovascular early warning. Based on the NHANES database, this paper screens the samples of textile practitioners, and constructs an R language computational analysis framework around the association between blood metabolites and hypertension. In the data processing stage, occupational information matching, missing value imputation, outlier correction, variable standardization, hypertension label coding and class imbalance correction were completed. In the feature recognition stage, a three-stage process of “correlation constraint-sparse projection-stable screening” is used to realize high-dimensional metabolite compression and recombination. In the classification stage, a double hidden layer discriminative model is constructed, and logistic regression, random forest and gradient boosting models are set as controls. The results showed that 1234 samples were included, of which 456 cases were hypertension group, accounting for 36.9%. After feature optimization, the accuracy of the optimal model is 86.2%, the AUC is 91.4%, and the F1 value is 83.6%, which is better than that of the unoptimized model. The results show that the proposed method can enhance the classification and discrimination ability of hypertension while maintaining computational efficiency, and provide a reusable data modeling path for occupational health risk identification of textile practitioners. At the same time, it can provide computational support for occupational sample screening stratification and risk early warning.

Povzetek: Na podlagi baze NHANES je bila izbrana skupina tekstilnih delavcev za vzpostavitev postopka prepoznavanja hipertenzije s krvnimi metaboliti v jeziku R. Postopek je vključeval imputacijo manjkajočih vrednosti, kodiranje oznak, uravnoteženje razredov, izbiro značilk in klasifikacijsko modeliranje. V analizo je bilo vključenih 1234 primerov; delež hipertenzivne skupine je bil 36,9 %, nehipertenzivne pa 63,1 %. Najboljši model je dosegel natančnost 86,2 %, AUC 91,4 % in F1 83,6 %, kar je bilo boljše od neoptimiziranega modela. Metoda kaže dobro klasifikacijsko sposobnost in podpira prepoznavanje tveganja za hipertenzijo pri tekstilnih delavcih。

Keywords
NHANES; Textile practitioners; Blood metabolites; Hypertension recognition; The R Language

Related Articles

Liying Wang1
1Jilin Animation Institute Jai School of Comics, Jilin, Changchun 130012, China
Yingbo Wu1, Lijuan Chen1, Shan Yang2, Jian Zhang3, Yafei Mao1, Lian Peng1
1Big Data Business Center, Hubei Engineering Research Center for Intelligent Digital Technology in New Power System (Hubei Central China Technology Development of Electric Power Co., Ltd), Wuhan 430070, Hubei, China
2Digital Work Department, State Grid Hubei Electric Power Co., Ltd., Wuhan 430077, Hubei, China
3Big Data Center, Information and Communication Branch of State Grid Hubei Electric Power Co., Ltd., Wuhan 430077, Hubei, China
Xihan Gong1, Haonan Cui2
1School of Marxism, Northeastern University, Shenyang, 110169, Liaoning, China
2School of Information Engineering, Shenyang Institute of Science and Technology, Shenyang, 110167, Liaoning, China
Lu Zhong1
1Yantai Nanshan University, Yantai, Shandong, China, 265713
Huanyong Zhang1, Ruzhu Jiang1
1School of business, Jiangnan University, Wuxi214122, Shaanxi, China