R language was used to analyze the blood metabolites and hypertension of textile workers in NHANES

Lyu, Gefan; Duan, Sufen; Gao, Zhixiang

doi:10.65102/is2026673

Research article

Ingegneria Sismica

Volume 43 Issue 2
Pages: 1
-22

R language was used to analyze the blood metabolites and hypertension of textile workers in NHANES

Author(s): ^¹, ^², ^¹

¹School of Public Health, Inner Mongolia Medical University; Hohhot, Inner Mongolia, 010110, China

²Department of Mathematics, Zibo Normal College, Zibo, 255130, Shandong, China

Published: 30/04/2026

Cite

Lyu, Gefan., Duan, Sufen., and Gao, Zhixiang. “R language was used to analyze the blood metabolites and hypertension of textile workers in NHANES.” Ingegneria Sismica Volume 43 Issue 2: 1-22, doi:10.65102/is2026673.

https://doi.org/10.65102/is2026673

Abstract

Hypertension risk identification has strong practical significance for occupational health monitoring and cardiovascular early warning. Based on the NHANES database, this paper screens the samples of textile practitioners, and constructs an R language computational analysis framework around the association between blood metabolites and hypertension. In the data processing stage, occupational information matching, missing value imputation, outlier correction, variable standardization, hypertension label coding and class imbalance correction were completed. In the feature recognition stage, a three-stage process of “correlation constraint-sparse projection-stable screening” is used to realize high-dimensional metabolite compression and recombination. In the classification stage, a double hidden layer discriminative model is constructed, and logistic regression, random forest and gradient boosting models are set as controls. The results showed that 1234 samples were included, of which 456 cases were hypertension group, accounting for 36.9%. After feature optimization, the accuracy of the optimal model is 86.2%, the AUC is 91.4%, and the F1 value is 83.6%, which is better than that of the unoptimized model. The results show that the proposed method can enhance the classification and discrimination ability of hypertension while maintaining computational efficiency, and provide a reusable data modeling path for occupational health risk identification of textile practitioners. At the same time, it can provide computational support for occupational sample screening stratification and risk early warning.

Povzetek: Na podlagi baze NHANES je bila izbrana skupina tekstilnih delavcev za vzpostavitev postopka prepoznavanja hipertenzije s krvnimi metaboliti v jeziku R. Postopek je vključeval imputacijo manjkajočih vrednosti, kodiranje oznak, uravnoteženje razredov, izbiro značilk in klasifikacijsko modeliranje. V analizo je bilo vključenih 1234 primerov; delež hipertenzivne skupine je bil 36,9 %, nehipertenzivne pa 63,1 %. Najboljši model je dosegel natančnost 86,2 %, AUC 91,4 % in F1 83,6 %, kar je bilo boljše od neoptimiziranega modela. Metoda kaže dobro klasifikacijsko sposobnost in podpira prepoznavanje tveganja za hipertenzijo pri tekstilnih delavcih｡

Keywords
NHANES; Textile practitioners; Blood metabolites; Hypertension recognition; The R Language

Research article
https://doi.org/10.65102/is2026791

Northeast revitalization strategy background below...

Volume 43 Issue 2
Pages: 1
-22
30/04/2026

^¹

¹Jilin Animation Institute Jai School of Comics, Jilin, Changchun 130012, China

Research article
https://doi.org/10.65102/is2026790

Research on Visualization method of Data Asset Val...

Volume 43 Issue 2
Pages: 1
-23
30/04/2026

^¹, ^¹, ^², ^³, ^¹, ^¹

¹Big Data Business Center, Hubei Engineering Research Center for Intelligent Digital Technology in New Power System (Hubei Central China Technology Development of Electric Power Co., Ltd), Wuhan 430070, Hubei, China

²Digital Work Department, State Grid Hubei Electric Power Co., Ltd., Wuhan 430077, Hubei, China

³Big Data Center, Information and Communication Branch of State Grid Hubei Electric Power Co., Ltd., Wuhan 430077, Hubei, China