File Download
There are no files associated with this item.
Links for fulltext
(May Require Subscription)
- Publisher Website: 10.1016/j.trb.2021.06.014
- Scopus: eid_2-s2.0-85110352014
- WOS: WOS:000685516700004
- Find via
Supplementary
- Citations:
- Appears in Collections:
Article: A mean-field Markov decision process model for spatial-temporal subsidies in ride-sourcing markets
Title | A mean-field Markov decision process model for spatial-temporal subsidies in ride-sourcing markets |
---|---|
Authors | |
Keywords | Markov decision process Mean-field Mixed agents Ride-sourcing Subsidy |
Issue Date | 2021 |
Citation | Transportation Research Part B: Methodological, 2021, v. 150, p. 540-565 How to Cite? |
Abstract | Ride-sourcing services are increasingly popular because of their ability to accommodate on-demand travel needs. A critical issue faced by ride-sourcing platforms is the supply-demand imbalance, as a result of which drivers may spend substantial time on idle cruising and picking up remote passengers. Some platforms attempt to mitigate the imbalance by providing relocation guidance for idle drivers who may have their own self-relocation strategies and decline to follow the suggestions. Platforms then seek to induce drivers to system-desirable locations by offering them subsidies. This paper proposes a mean-field Markov decision process (MF-MDP) model to depict the dynamics in ride-sourcing markets with mixed agents, whereby the platform aims to optimize some objectives from a system perspective using spatial-temporal subsidies with predefined subsidy rates, and a number of drivers aim to maximize their individual income by following certain self-relocation strategies. To solve the model more efficiently, we further develop a representative-agent reinforcement learning algorithm that uses a representative driver to model the decision-making process of multiple drivers. This approach is shown to achieve significant computational advantages, faster convergence, and better performance. Using case studies, we demonstrate that by providing some spatial-temporal subsidies, the platform is able to well balance a short-term objective of maximizing immediate revenue and a long-term objective of maximizing service rate, while drivers can earn higher income. |
Persistent Identifier | http://hdl.handle.net/10722/308874 |
ISSN | 2023 Impact Factor: 5.8 2023 SCImago Journal Rankings: 2.660 |
ISI Accession Number ID |
DC Field | Value | Language |
---|---|---|
dc.contributor.author | Zhu, Zheng | - |
dc.contributor.author | Ke, Jintao | - |
dc.contributor.author | Wang, Hai | - |
dc.date.accessioned | 2021-12-08T07:50:19Z | - |
dc.date.available | 2021-12-08T07:50:19Z | - |
dc.date.issued | 2021 | - |
dc.identifier.citation | Transportation Research Part B: Methodological, 2021, v. 150, p. 540-565 | - |
dc.identifier.issn | 0191-2615 | - |
dc.identifier.uri | http://hdl.handle.net/10722/308874 | - |
dc.description.abstract | Ride-sourcing services are increasingly popular because of their ability to accommodate on-demand travel needs. A critical issue faced by ride-sourcing platforms is the supply-demand imbalance, as a result of which drivers may spend substantial time on idle cruising and picking up remote passengers. Some platforms attempt to mitigate the imbalance by providing relocation guidance for idle drivers who may have their own self-relocation strategies and decline to follow the suggestions. Platforms then seek to induce drivers to system-desirable locations by offering them subsidies. This paper proposes a mean-field Markov decision process (MF-MDP) model to depict the dynamics in ride-sourcing markets with mixed agents, whereby the platform aims to optimize some objectives from a system perspective using spatial-temporal subsidies with predefined subsidy rates, and a number of drivers aim to maximize their individual income by following certain self-relocation strategies. To solve the model more efficiently, we further develop a representative-agent reinforcement learning algorithm that uses a representative driver to model the decision-making process of multiple drivers. This approach is shown to achieve significant computational advantages, faster convergence, and better performance. Using case studies, we demonstrate that by providing some spatial-temporal subsidies, the platform is able to well balance a short-term objective of maximizing immediate revenue and a long-term objective of maximizing service rate, while drivers can earn higher income. | - |
dc.language | eng | - |
dc.relation.ispartof | Transportation Research Part B: Methodological | - |
dc.subject | Markov decision process | - |
dc.subject | Mean-field | - |
dc.subject | Mixed agents | - |
dc.subject | Ride-sourcing | - |
dc.subject | Subsidy | - |
dc.title | A mean-field Markov decision process model for spatial-temporal subsidies in ride-sourcing markets | - |
dc.type | Article | - |
dc.description.nature | link_to_subscribed_fulltext | - |
dc.identifier.doi | 10.1016/j.trb.2021.06.014 | - |
dc.identifier.scopus | eid_2-s2.0-85110352014 | - |
dc.identifier.volume | 150 | - |
dc.identifier.spage | 540 | - |
dc.identifier.epage | 565 | - |
dc.identifier.isi | WOS:000685516700004 | - |