사업성과 BK21 FOUR 산업혁신 애널리틱스 교육연구단

논문

2025 Spatio-temporal task pricing for shared electric micro-mobility battery-swapping platform with reinforcement learning

페이지 정보

작성자 관리자 작성일 25-10-14 13:24

본문

Author
Minjeong Kim, Ilkyeong Moon
Journal
International Journal of Production Research
Vol
63(4)
Page
1473-1494
Year
2025

Abstract

Spatial crowdsourcing has emerged in shared electric micro-mobility platforms, compensating occasional drivers (ODs) per task of swapping micro-mobility batteries. As ODs autonomously select tasks only when satisfied with predetermined compensation and travel distance, a traditional uniform pricing strategy results in possible low task completion. To resolve the imbalance between ODs and tasks, this study introduces a spatio-temporal pricing strategy where task prices differ by region and time interval. Considering the daily variations in task distribution and OD availability, the goal is to minimise the platform costs equal to the sum of total OD wages and penalties for uncompleted tasks. The reinforcement learning approach with proximal policy optimisation (PPO) is implemented to generate real-time continuous task prices. A domain-specific masking technique is incorporated to improve the learning process by disregarding the data from inactive grids in loss calculations. Computational results show that the PPO agent strategically raises prices in regions with insufficient ODs according to the OD density level. Further comparison with the mixed integer programming model with perfect information on ODs' willingness-to-accept parameters demonstrates the superior capability of our algorithm in navigating the uncertainties of OD task acceptance. A sensitivity analysis provides insights into the decision of system parameters.