Outline

Ingegneria Sismica

Ingegneria Sismica

A Multi-Intelligent Body Collaboration Framework for Complex Task Decomposition Based on Large Language Modeling

Author(s): Houzhuo Wu1
1Applied Math and Statistics Department, Johns Hopkins University, Baltimore, MD, USA
Wu, Houzhuo. “A Multi-Intelligent Body Collaboration Framework for Complex Task Decomposition Based on Large Language Modeling.” Ingegneria Sismica Volume 43 Issue 1: 1-22, doi:10.65102/is2026100.

Abstract

With the breakthrough progress of the large language model in the field of natural language processing, the LLM-based intelligent body technology is gradually moving from theoretical research to practical application. The study first clarifies the complex task decomposition representation, takes reinforcement learning and multi-intelligent body reinforcement learning as the theoretical basis, elaborates the principle of LLaMA large language model, then introduces the PPO algorithm for strategy optimization, obtains the reward signal by interacting with the environment, and uses the dominance function and the pruning strategy to ensure the stability of the training and the convergence, realizing the application of large language model in the task decomposition. Finally, experiments are conducted in home and warehouse task scenarios for analysis. The results show that the performance of the model after fine-tuning using the PPO algorithm is significantly improved, and the average reward value during task decomposition is increased from 0.141 to 0.780, and the efficiency and stability of task decomposition are better than the original model. In the A/B test, LLaMA-PPO has a 3.46% improvement, which shows that the algorithm’s training speed and final performance have been improved while the task decomposition has been automated efficiently.

Keywords
lLaMA large language model; reinforcement learning; PPO; task decomposition

Related Articles

Huiqiao Liu1
1Yinchuan University of Energy, Ningxia, 750000, China
Xin Zhao1, Yan Li1, Xiangyang Cao1, Qiushuang Li1, Jianing Zhang1
1State Grid Shandong Electric Power Company Economic and Technological Research Institute ShanDong JiNan 250001, China
Dan Yang1
1School of Marxism, Suzhou Polytechnic University, Suzhou, 215104, China
Liuhang Shen1, Xiangwen Sun1
1Ulster college at Shaanxi University of Science &Technology, Xi’an,710021, Shaanxi, China