서울대학교 산업혁신 애널리틱스 교육연구단

2025 Spatio-temporal task pricing for shared electric micro-mobility battery-swapping platform with reinforcement learning

페이지 정보

작성자 관리자 작성일 25-10-14 13:24

본문

Author: Minjeong Kim, Ilkyeong Moon

Journal: International Journal of Production Research

Vol: 63(4)

Page: 1473-1494

Year: 2025

Abstract

Spatial crowdsourcing has emerged in shared electric micro-mobility platforms, compensating occasional drivers (ODs) per task of swapping micro-mobility batteries. As ODs autonomously select tasks only when satisfied with predetermined compensation and travel distance, a traditional uniform pricing strategy results in possible low task completion. To resolve the imbalance between ODs and tasks, this study introduces a spatio-temporal pricing strategy where task prices differ by region and time interval. Considering the daily variations in task distribution and OD availability, the goal is to minimise the platform costs equal to the sum of total OD wages and penalties for uncompleted tasks. The reinforcement learning approach with proximal policy optimisation (PPO) is implemented to generate real-time continuous task prices. A domain-specific masking technique is incorporated to improve the learning process by disregarding the data from inactive grids in loss calculations. Computational results show that the PPO agent strategically raises prices in regions with insufficient ODs according to the OD density level. Further comparison with the mixed integer programming model with perfect information on ODs' willingness-to-accept parameters demonstrates the superior capability of our algorithm in navigating the uncertainties of OD task acceptance. A sensitivity analysis provides insights into the decision of system parameters.

HOME

교육연구단 소개

참여인력

사업성과

게시판

관련사이트

사업성과 BK21 FOUR 산업혁신 애널리틱스 교육연구단

논문

2025 Spatio-temporal task pricing for shared electric micro-mobility battery-swapping platform with reinforcement learning

페이지 정보

본문

관련링크