File Download
There are no files associated with this item.
Links for fulltext
(May Require Subscription)
- Publisher Website: 10.1177/03611981241306754
- Scopus: eid_2-s2.0-85216764386
- Find via
Supplementary
-
Citations:
- Scopus: 0
- Appears in Collections:
Article: Causal Graph Discovery for Urban Bus Operation Delays: A Case Study in Stockholm
Title | Causal Graph Discovery for Urban Bus Operation Delays: A Case Study in Stockholm |
---|---|
Authors | |
Keywords | big data data and data science data mining GTFS operations public transportation transformative trends in transit data |
Issue Date | 31-Jan-2025 |
Publisher | SAGE Publications |
Citation | Transportation Research Record: Journal of the Transportation Research Board, 2025 How to Cite? |
Abstract | Bus delays significantly affect urban public transportation by reducing operational efficiency and incurring high costs. Understanding the causes of these delays is essential for developing targeted mitigation strategies. While traditional research focuses on correlation-based analysis, it often fails to uncover the underlying causal mechanisms. This study examines various causal graph discovery algorithms combined with structural equation models (SEMs) to infer the causal relationships among factors that affect bus delays. These algorithms generate causal graphs for bus delays, revealing the interrelations and impacts of various operational factors. SEM is used to quantify the causal effects. This study evaluates the performance of these algorithms from the perspectives of both the statistical data fitting and the causal relationships generated. A case study is conducted using General Transit Feed Specification (GTFS) data from frequent bus routes in Stockholm, Sweden. The validation results demonstrate the effectiveness of data-driven causal discovery models in identifying causal links, particularly when combined with domain knowledge. The empirical analysis shows the complexity of factors contributing to bus delays, emphasizing the necessity of integrating causality into bus delay analysis. For example, a high correlation between origin delay and bus arrival delay (coefficient = 0.63) does not indicate direct causation, and a strong causation between dwell time and arrival delay does not imply a higher correlation (coefficient = 0.12). Comparing variable importance with linear regression (LR) reveals notable differences; origin delay, which is often overlooked by previous studies, is significant in the causal graph model (standardized coefficient = 0.601) but ranks much lower in LR (standardized coefficient = 0.003). These insights underscore the importance of automated, data-driven causal discovery in enhancing decision-making processes and improving the efficiency and reliability of transit services. |
Persistent Identifier | http://hdl.handle.net/10722/354618 |
ISSN | 2023 Impact Factor: 1.6 2023 SCImago Journal Rankings: 0.543 |
DC Field | Value | Language |
---|---|---|
dc.contributor.author | Zhang, Qi | - |
dc.contributor.author | Ma, Zhenliang | - |
dc.contributor.author | Ling, Yancheng | - |
dc.contributor.author | Qin, Zhenlin | - |
dc.contributor.author | Zhang, Pengfei | - |
dc.contributor.author | Zhao, Zhan | - |
dc.date.accessioned | 2025-02-24T00:40:18Z | - |
dc.date.available | 2025-02-24T00:40:18Z | - |
dc.date.issued | 2025-01-31 | - |
dc.identifier.citation | Transportation Research Record: Journal of the Transportation Research Board, 2025 | - |
dc.identifier.issn | 0361-1981 | - |
dc.identifier.uri | http://hdl.handle.net/10722/354618 | - |
dc.description.abstract | Bus delays significantly affect urban public transportation by reducing operational efficiency and incurring high costs. Understanding the causes of these delays is essential for developing targeted mitigation strategies. While traditional research focuses on correlation-based analysis, it often fails to uncover the underlying causal mechanisms. This study examines various causal graph discovery algorithms combined with structural equation models (SEMs) to infer the causal relationships among factors that affect bus delays. These algorithms generate causal graphs for bus delays, revealing the interrelations and impacts of various operational factors. SEM is used to quantify the causal effects. This study evaluates the performance of these algorithms from the perspectives of both the statistical data fitting and the causal relationships generated. A case study is conducted using General Transit Feed Specification (GTFS) data from frequent bus routes in Stockholm, Sweden. The validation results demonstrate the effectiveness of data-driven causal discovery models in identifying causal links, particularly when combined with domain knowledge. The empirical analysis shows the complexity of factors contributing to bus delays, emphasizing the necessity of integrating causality into bus delay analysis. For example, a high correlation between origin delay and bus arrival delay (coefficient = 0.63) does not indicate direct causation, and a strong causation between dwell time and arrival delay does not imply a higher correlation (coefficient = 0.12). Comparing variable importance with linear regression (LR) reveals notable differences; origin delay, which is often overlooked by previous studies, is significant in the causal graph model (standardized coefficient = 0.601) but ranks much lower in LR (standardized coefficient = 0.003). These insights underscore the importance of automated, data-driven causal discovery in enhancing decision-making processes and improving the efficiency and reliability of transit services. | - |
dc.language | eng | - |
dc.publisher | SAGE Publications | - |
dc.relation.ispartof | Transportation Research Record: Journal of the Transportation Research Board | - |
dc.rights | This work is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License. | - |
dc.subject | big data | - |
dc.subject | data and data science | - |
dc.subject | data mining | - |
dc.subject | GTFS | - |
dc.subject | operations | - |
dc.subject | public transportation | - |
dc.subject | transformative trends in transit data | - |
dc.title | Causal Graph Discovery for Urban Bus Operation Delays: A Case Study in Stockholm | - |
dc.type | Article | - |
dc.identifier.doi | 10.1177/03611981241306754 | - |
dc.identifier.scopus | eid_2-s2.0-85216764386 | - |
dc.identifier.eissn | 2169-4052 | - |
dc.identifier.issnl | 0361-1981 | - |