Research on dynamic regulation strategy optimization of supply chain greenwashing behavior based on reinforcement learning

Peng, Qintao; Chen, Fan

doi:10.65102/is20261054

Research article

Ingegneria Sismica

Volume 43 Issue 3
Pages: 1
-23

Research on dynamic regulation strategy optimization of supply chain greenwashing behavior based on reinforcement learning

Author(s): ^¹,², ^³

¹College of Economics and Management, China Three Gorges University, Yichang 443002, Hubei, China

²College of Economics and Management, Jingchu University of Technology, Jingmen 448000, Hubei, China

³School of Artificial Intelligence, Jingchu University of Technology, Jingmen 448000, Hubei, China

Published: 10/06/2026

Cite

Peng, Qintao. and Chen, Fan . “Research on dynamic regulation strategy optimization of supply chain greenwashing behavior based on reinforcement learning.” Ingegneria Sismica Volume 43 Issue 3: 1-23, doi:10.65102/is20261054.

https://doi.org/10.65102/is20261054

Abstract

In order to solve the problems of static identification lag, insufficient matching of regulatory actions and insufficient utilization of feedback in the regulation of supply chain greenwashing behavior, this paper constructs a dynamic regulation strategy optimization model based on reinforcement learning. The model takes the consistency of green declaration, performance deviation, certification change, text anomaly and historical feedback as the status input, sets up supervision actions such as prompt description, data review, key spot check, credit constraint and continuous tracking, and comprehensively restricts risk reduction, resource consumption and misjudgment loss through the reward function. The experiment was carried out based on 1260 supply chain subjects, 85420 structured records and 18670 text disclosure samples. The model was trained for 500 rounds, and compared with Logistic regression, SVM, random forest, XGBoost and static DQN. The results show that the Accuracy of the model in this paper reaches 93.6%, Macro-F1 reaches 91.8%, the high-risk recall rate reaches 92.4%, the invalid resource consumption rate is reduced to 13.8%, and the average response cycle is shortened to 2.4 working days. The research results show that the proposed model can improve the identification accuracy of greenwashing risk and the adaptation ability of dynamic supervision actions, and provide a computable optimization path for the intelligent supervision of supply chain greenwashing behavior.

Keywords
Reinforcement learning; Supply chain management; Greenwashing behavior; Dynamic supervision strategy

Research article
https://doi.org/10.65102/is20261302

Visual analysis of related hotspots affecting Diab...

Volume 43 Issue 3
Pages: 1
-18
08/07/2026

^¹, ^¹

¹Guangzhou University of Chinese Medicine, School of Pharmaceutical Medicine, Guangzhou,Guangdong,China,510006

Research article
https://doi.org/10.65102/is20261300

Research on high-quality image super-resolution re...

Volume 43 Issue 3
Pages: 1
-21
08/07/2026

^¹,², ^¹,², ^¹

¹Hainan Vocational University of Science and Technology, Haikou 571126, China

²Institute for Mathematical Research, Universiti Putra Malaysia, Serdang 43400, Malaysia

Research article
https://doi.org/10.65102/is20261301

Multi-scale Dual Transformer based Multi long-term...

Volume 43 Issue 3
Pages: 1
-18
08/07/2026

^¹,², ^¹,², ^¹

¹Hainan Vocational University of Science and Technology, Haikou 571126, China

²Institute for Mathematical Research, Universiti Putra Malaysia, Serdang 43400, Malaysia

Research article
https://doi.org/10.65102/is20261299

Ultra-Short-Term Wind Power Forecasting Based on V...

Volume 43 Issue 3
Pages: 1
-15
08/07/2026

^¹, ^², ^¹, ^¹, ^¹

¹Electric Power Research Institute, State Grid Shanxi Electric Power Co., Ltd., Taiyuan, 030001, Shanxi, China

²Jincheng Power Supply Branch, State Grid Shanxi Electric Power Co., Ltd., Jincheng, 048000, Shanxi, China

Research article
https://doi.org/10.65102/is20261298

Integration of Traditional Culture Elements and Co...

Volume 43 Issue 3
Pages: 1
-12
01/07/2026

^¹,²

¹China Academy of Cultural Heritage, Chaoyang District, 100029, Beijing, China

²Beijing University of Civil Engineering and Architecture, Xicheng District, 100044, Beijing, China

Outline

Ingegneria Sismica

Research on dynamic regulation strategy optimization of supply chain greenwashing behavior based on reinforcement learning

Abstract

Related Articles

Visual analysis of related hotspots affecting Diab...

Research on high-quality image super-resolution re...

Multi-scale Dual Transformer based Multi long-term...

Ultra-Short-Term Wind Power Forecasting Based on V...

Integration of Traditional Culture Elements and Co...

Open Access