Outline

Ingegneria Sismica

Ingegneria Sismica

Application of Reinforcement Learning in Optimizing New Media Content Recommendation Systems

Author(s): Ruofei Gu1
1School of Digital Media and Design Arts, Beijing University of Posts and Telecommunications, Beijing, 102206, China
Gu, Ruofei. “Application of Reinforcement Learning in Optimizing New Media Content Recommendation Systems.” Ingegneria Sismica Volume 43 Issue 3: 1-20, doi:10.65102/is20261253.

Abstract

To address the conflict between immediate click-through rate and long-term retention in new media recommendation, this paper proposes DRL-MOREC, a deep reinforcement learning framework. A hybrid state encoder fuses users’ short-term behavior sequences with long-term interest graphs to capture dual temporal scales of interest. A two-stage reward function allocates session-level 7-day retention prediction signals to each step via a discount factor, alleviating delayed reward sparsity. Conservative Q-learning and inverse propensity score weighting are introduced to mitigate distribution shift and popularity bias, respectively. Offline experiments on a short-video platform dataset show that the proposed method achieves a 7-day retention rate 2.0, 2.7, and 1.3 percentage points higher than DeepFM, DDPG-TD3, and SAC-Rec, respectively, while improving catalog coverage (ECC) by 0.09. Online A/B testing demonstrates an 11.3% lift in daily active user retention over DeepFM. Ablation studies reveal that the delayed reward contributes 4.0 percentage points to the retention improvement. These results validate the effectiveness of reinforcement learning in optimizing long-term user value under dynamic recommendation scenarios.

Keywords
Deep Reinforcement Learning; Recommender System; New Media Content; Multi-Objective Optimization; User Retention

Related Articles

Liqin Zheng1, Dongrui Qing2, Yan Zhang1
1School of Mathematics and Statistics, Shaan Xi Xue Qian Normal University Xi’an 710100, P.R.China
2School of Marxism, Xi’an University of Finance and Economics Xi’an 710100, P.R.China
Yanan Gao1, Aiqun Peng2, Nina Ma2
1Management School of Anhui Business and Technology College Hefei 230000, Anhui, China
2Economics and Trade School of Anhui Business and Technology College Hefei 230000, Anhui, China
Ya’ning Liu1, Ping Ma1
1School of Teacher Education, Shihezi University, Shihezi, Xinjiang, 832000, China
Yuhui Li1, Zhongliang Gong1
1College of Mechanical and Intelligent Manufacturing, Central South University of Forestry and Technology, Changsha, Hunan, 410004, China
Hanqing Hu1, Chengjin Liu1, Tianmu Tian1
1School of Management Science and Engineering, Beijing Information Science & Technology University, Beijing 100192