Based on panel data from 21 prefecture level cities in Guangdong Province from 2018 to 2024, this article constructs the Digital Village Development Index (DVI) and Sustainable Development Performance Index (SDI), and introduces the Proximal Policy Optimization (PPO) model to explore the optimization of digital rural resource allocation and sustainable development path under budget constraints. The results show that the digital rural development index in Guangdong has increased from 0.310 to 0.565, but there is still a stable gradient between the Pearl River Delta and non Pearl River Delta, and the growth rate of digital governance is higher than that of digital infrastructure and digital industrialization. Compared with the average distribution, the PPO strategy increased the comprehensive return by 19.6%, DVI by 13.4%, SDI by 11.2%, and the convergence rate of regional differences reached 15.7%. Further analysis reveals that the Pearl River Delta is more suitable for efficiency improvement paths, while eastern, western, and northern Guangdong are more suitable for fair compensation paths. Ecological sensitive agricultural areas should strengthen green collaboration. The study provides a quantitative basis for the implementation of policies and dynamic optimization in the zoning of digital rural areas in Guangdong.