Outline

Ingegneria Sismica

Ingegneria Sismica

Modeling and Application of E-Commerce Consumer Purchase Intention Prediction Using Multi-Source Data

Author(s): Qiong’ao Mei1, Huiyuan Zhang2, Yu Chen1
1Economics and Management, Huaibei Institute of Technology, Huaibei 235000, Anhui, China
2Management Engineering, Suzhou College of Information Technology, Suzhou 215000, Jiangsu, China
Mei, Qiong’ao., Zhang, Huiyuan., and Chen, Yu. “Modeling and Application of E-Commerce Consumer Purchase Intention Prediction Using Multi-Source Data.” Ingegneria Sismica Volume 43 Issue 2: 1-19, doi:10.65102/is2026764.

Abstract

Facing the problems of lagging identification of in-session transactions, sparse positive class samples and dispersed evidence from multiple sources on e-commerce platforms, this paper constructs a multi-source data-driven prediction model of consumers’ purchase intention, MSF-PIN.The study takes session as the basic object, and uniformly organizes behavioral logs, product attributes, evaluative sentiment, price promotions, and access contexts into 162,840 session-level samples, among which 24,912 purchase samples account for 15.30% of positive class. Among them, there are 24,912 purchase samples, accounting for 15.30% of positive categories. At the model level, behavioral sequence coding, cross-source gating fusion, and click-add-purchase multi-task learning are introduced to improve the ability to portray the strength of short-term interest paths and heterogeneous evidence. Experiments are conducted using temporal order slicing and deployment-oriented evaluation protocols. The results show that MSF-PIN has an AUC of 0.902, a PR-AUC of 0.523, an F1 of 0.587, a Logloss of 0.140, and an ECE of 0.032, which is overall better than the comparative models of LightGBM, CatBoost, DeepFM, AutoInt, DCN V2, and BST. Results obtained by checking each scenario one by one indicate that the model obtains an AUC of 0.913 in the scenario of returning visitors and a PR-AUC of 0.541 in the scenario of promotion. The deployment analysis further gives out that when we screen the top 10% high-intention sessions through predictive probability, the model can cover 48.9% of actual purchase samples, and the average inference delay is 34.9 ms for each 1000 sessions. The outcome illuminates that sample-level weight redistribution using many kinds of proof can promote the utilization degree of purchase intention forecast, thus give a directly gotten probability foundation for recommendation weight adjustment, coupon start, and customer service help.

Keywords
e-commerce purchase intention prediction; multi-source data; gated fusion; behavior sequence modeling; deployment-oriented evaluation

Related Articles

Huiqiao Liu1
1Yinchuan University of Energy, Ningxia, 750000, China
Xin Zhao1, Yan Li1, Xiangyang Cao1, Qiushuang Li1, Jianing Zhang1
1State Grid Shandong Electric Power Company Economic and Technological Research Institute ShanDong JiNan 250001, China
Dan Yang1
1School of Marxism, Suzhou Polytechnic University, Suzhou, 215104, China
Liuhang Shen1, Xiangwen Sun1
1Ulster college at Shaanxi University of Science &Technology, Xi’an,710021, Shaanxi, China