Outline

Ingegneria Sismica

Ingegneria Sismica

Research on dynamic regulation strategy optimization of supply chain greenwashing behavior based on reinforcement learning

Author(s): Qintao Peng1,2, Fan Chen3
1College of Economics and Management, China Three Gorges University, Yichang 443002, Hubei, China
2College of Economics and Management, Jingchu University of Technology, Jingmen 448000, Hubei, China
3School of Artificial Intelligence, Jingchu University of Technology, Jingmen 448000, Hubei, China
Peng, Qintao. and Chen, Fan . “Research on dynamic regulation strategy optimization of supply chain greenwashing behavior based on reinforcement learning.” Ingegneria Sismica Volume 43 Issue 3: 1-23, doi:10.65102/is20261054.

Abstract

In order to solve the problems of static identification lag, insufficient matching of regulatory actions and insufficient utilization of feedback in the regulation of supply chain greenwashing behavior, this paper constructs a dynamic regulation strategy optimization model based on reinforcement learning. The model takes the consistency of green declaration, performance deviation, certification change, text anomaly and historical feedback as the status input, sets up supervision actions such as prompt description, data review, key spot check, credit constraint and continuous tracking, and comprehensively restricts risk reduction, resource consumption and misjudgment loss through the reward function. The experiment was carried out based on 1260 supply chain subjects, 85420 structured records and 18670 text disclosure samples. The model was trained for 500 rounds, and compared with Logistic regression, SVM, random forest, XGBoost and static DQN. The results show that the Accuracy of the model in this paper reaches 93.6%, Macro-F1 reaches 91.8%, the high-risk recall rate reaches 92.4%, the invalid resource consumption rate is reduced to 13.8%, and the average response cycle is shortened to 2.4 working days. The research results show that the proposed model can improve the identification accuracy of greenwashing risk and the adaptation ability of dynamic supervision actions, and provide a computable optimization path for the intelligent supervision of supply chain greenwashing behavior.

Keywords
Reinforcement learning; Supply chain management; Greenwashing behavior; Dynamic supervision strategy

Related Articles

Liqin Zheng1, Dongrui Qing2, Yan Zhang1
1School of Mathematics and Statistics, Shaan Xi Xue Qian Normal University Xi’an 710100, P.R.China
2School of Marxism, Xi’an University of Finance and Economics Xi’an 710100, P.R.China
Yanan Gao1, Aiqun Peng2, Nina Ma2
1Management School of Anhui Business and Technology College Hefei 230000, Anhui, China
2Economics and Trade School of Anhui Business and Technology College Hefei 230000, Anhui, China
Ya’ning Liu1, Ping Ma1
1School of Teacher Education, Shihezi University, Shihezi, Xinjiang, 832000, China
Yuhui Li1, Zhongliang Gong1
1College of Mechanical and Intelligent Manufacturing, Central South University of Forestry and Technology, Changsha, Hunan, 410004, China
Hanqing Hu1, Chengjin Liu1, Tianmu Tian1
1School of Management Science and Engineering, Beijing Information Science & Technology University, Beijing 100192