File Download

There are no files associated with this item.

  Links for fulltext
     (May Require Subscription)
Supplementary

Article: A mixed-integer programming-based Q-learning approach for electric bus scheduling with multiple termini and service routes

TitleA mixed-integer programming-based Q-learning approach for electric bus scheduling with multiple termini and service routes
Authors
KeywordsElectric buses
Mixed-integer linear programming
Public transport
Q-learning
Issue Date4-Apr-2024
PublisherElsevier
Citation
Transportation Research Part C: Emerging Technologies, 2024, v. 162 How to Cite?
AbstractElectric buses (EBs) are considered a more environmentally friendly mode of public transit. In addition to other practical challenges, including high infrastructure costs and short driving ranges, the operations of EBs are more demanding due to the necessary battery charging activities. Consequently, more sophisticated optimisation models and algorithms are required for effective operations. This paper presents an EB scheduling problem with multiple termini and service routes. Various realistic but complicated factors, such as shared facilities at multiple termini, the flexibility of plugging and unplugging chargers before an EB is fully charged, stochastic travel times, and EB breakdowns, are considered. We propose an integrated learning and mixed-integer linear programming (MILP) framework to overcome the computational difficulties when solving the problem. This framework leverages the strengths of reinforcement learning and MILP for fast computations due to its capability of learning from outcomes of state–action pairs and computational effectiveness guaranteed by the constraints governing the solution feasibility. Q-Learning and Twin Delayed Deep Deterministic Policy Gradient are adopted as our training methods. We conduct numerical experiments on artificial instances and realistic instances of a bus network in Hong Kong to assess the performance of our proposed approach. The results show that our proposed framework outperforms the benchmark optimisation approach, in terms of penalty on missed service trips, average headway, and variance of headway. The benefits of our proposed framework are more significant under a highly stochastic environment.
Persistent Identifierhttp://hdl.handle.net/10722/345924
ISSN
2023 Impact Factor: 7.6
2023 SCImago Journal Rankings: 2.860

 

DC FieldValueLanguage
dc.contributor.authorYan, Yimo-
dc.contributor.authorWen, Haomin-
dc.contributor.authorDeng, Yang-
dc.contributor.authorChow, Andy HF-
dc.contributor.authorWu, Qihao-
dc.contributor.authorKuo, Yong Hong-
dc.date.accessioned2024-09-04T07:06:29Z-
dc.date.available2024-09-04T07:06:29Z-
dc.date.issued2024-04-04-
dc.identifier.citationTransportation Research Part C: Emerging Technologies, 2024, v. 162-
dc.identifier.issn0968-090X-
dc.identifier.urihttp://hdl.handle.net/10722/345924-
dc.description.abstractElectric buses (EBs) are considered a more environmentally friendly mode of public transit. In addition to other practical challenges, including high infrastructure costs and short driving ranges, the operations of EBs are more demanding due to the necessary battery charging activities. Consequently, more sophisticated optimisation models and algorithms are required for effective operations. This paper presents an EB scheduling problem with multiple termini and service routes. Various realistic but complicated factors, such as shared facilities at multiple termini, the flexibility of plugging and unplugging chargers before an EB is fully charged, stochastic travel times, and EB breakdowns, are considered. We propose an integrated learning and mixed-integer linear programming (MILP) framework to overcome the computational difficulties when solving the problem. This framework leverages the strengths of reinforcement learning and MILP for fast computations due to its capability of learning from outcomes of state–action pairs and computational effectiveness guaranteed by the constraints governing the solution feasibility. Q-Learning and Twin Delayed Deep Deterministic Policy Gradient are adopted as our training methods. We conduct numerical experiments on artificial instances and realistic instances of a bus network in Hong Kong to assess the performance of our proposed approach. The results show that our proposed framework outperforms the benchmark optimisation approach, in terms of penalty on missed service trips, average headway, and variance of headway. The benefits of our proposed framework are more significant under a highly stochastic environment.-
dc.languageeng-
dc.publisherElsevier-
dc.relation.ispartofTransportation Research Part C: Emerging Technologies-
dc.subjectElectric buses-
dc.subjectMixed-integer linear programming-
dc.subjectPublic transport-
dc.subjectQ-learning-
dc.titleA mixed-integer programming-based Q-learning approach for electric bus scheduling with multiple termini and service routes-
dc.typeArticle-
dc.identifier.doi10.1016/j.trc.2024.104570-
dc.identifier.scopuseid_2-s2.0-85189754065-
dc.identifier.volume162-
dc.identifier.eissn1879-2359-
dc.identifier.issnl0968-090X-

Export via OAI-PMH Interface in XML Formats


OR


Export to Other Non-XML Formats