File Download
There are no files associated with this item.
Links for fulltext
(May Require Subscription)
- Publisher Website: 10.1016/j.trc.2023.104032
- Scopus: eid_2-s2.0-85146878957
- WOS: WOS:001058451200001
- Find via
Supplementary
- Citations:
- Appears in Collections:
Article: A Bayesian clustering ensemble Gaussian process model for network-wide traffic flow clustering and prediction
Title | A Bayesian clustering ensemble Gaussian process model for network-wide traffic flow clustering and prediction |
---|---|
Authors | |
Keywords | Dirichlet process mixture model Gaussian Process Statistical learning Traffic flow prediction |
Issue Date | 1-Jul-2023 |
Publisher | Elsevier |
Citation | Transportation Research Part C: Emerging Technologies, 2023, v. 148 How to Cite? |
Abstract | Traffic flow prediction is an essential component in intelligent transportation systems. Recently, there has been a notable trend in applying machine learning models, especially deep learning, for network-wide traffic prediction. However, existing studies have limitations on model interpretability, model generalization, and over-reliance on image data processing or fine-designed deep learning structures for extracting traffic attributes. This paper attempts to tackle these limitations by proposing a Bayesian clustering ensemble Gaussian process (BCEGP) model for network-wide traffic flow clustering and prediction. The model utilizes a subset-based Dirichlet process mixture (SDPM) model to conduct a hard clustering among input data; then, within each cluster, it adopts the Gaussian Process (GP) to learn the probability relationship between inputs and outputs. During the prediction phase, the model conducts a soft clustering of the input as weights, and makes predictions via a weighted average of GPs’ outputs. The merits of the BCEGP model include: (a) data with similar spatial–temporal patterns are clustered, which helps understand traffic dynamics in a non-Euclidean and non-graphical manner that enhances information extracting for model development; (b) GPs provide analytically trackable functions/gradients of predicted traffic flows with features and reveal variances of predicted traffic flow, enhancing model applicability and interpretability to some extent; (c) the model incorporates an ensemble learning framework that achieves great generalization performance as good as deep learning models; (d) the subset-based clustering and cluster-based GP learning are conducted parallelly, and thus vastly accelerate the training efficiency compared with conventional GPs (but slower than deep learning models). We test the performance of the proposed model based on both synthesized and real-world datasets. For comparison, several widely used machine learning and deep learning models are trained under the real-world dataset. The results demonstrate that the BCEGP model performs well in predictive accuracy, computational speed, and applicability, which can be a promising method for transportation problems. |
Persistent Identifier | http://hdl.handle.net/10722/337918 |
ISSN | 2023 Impact Factor: 7.6 2023 SCImago Journal Rankings: 2.860 |
ISI Accession Number ID |
DC Field | Value | Language |
---|---|---|
dc.contributor.author | Zhu, Z | - |
dc.contributor.author | Xu, M | - |
dc.contributor.author | Ke, J | - |
dc.contributor.author | Yang, H | - |
dc.contributor.author | Chen, X | - |
dc.date.accessioned | 2024-03-11T10:24:55Z | - |
dc.date.available | 2024-03-11T10:24:55Z | - |
dc.date.issued | 2023-07-01 | - |
dc.identifier.citation | Transportation Research Part C: Emerging Technologies, 2023, v. 148 | - |
dc.identifier.issn | 0968-090X | - |
dc.identifier.uri | http://hdl.handle.net/10722/337918 | - |
dc.description.abstract | Traffic flow prediction is an essential component in intelligent transportation systems. Recently, there has been a notable trend in applying machine learning models, especially deep learning, for network-wide traffic prediction. However, existing studies have limitations on model interpretability, model generalization, and over-reliance on image data processing or fine-designed deep learning structures for extracting traffic attributes. This paper attempts to tackle these limitations by proposing a Bayesian clustering ensemble Gaussian process (BCEGP) model for network-wide traffic flow clustering and prediction. The model utilizes a subset-based Dirichlet process mixture (SDPM) model to conduct a hard clustering among input data; then, within each cluster, it adopts the Gaussian Process (GP) to learn the probability relationship between inputs and outputs. During the prediction phase, the model conducts a soft clustering of the input as weights, and makes predictions via a weighted average of GPs’ outputs. The merits of the BCEGP model include: (a) data with similar spatial–temporal patterns are clustered, which helps understand traffic dynamics in a non-Euclidean and non-graphical manner that enhances information extracting for model development; (b) GPs provide analytically trackable functions/gradients of predicted traffic flows with features and reveal variances of predicted traffic flow, enhancing model applicability and interpretability to some extent; (c) the model incorporates an ensemble learning framework that achieves great generalization performance as good as deep learning models; (d) the subset-based clustering and cluster-based GP learning are conducted parallelly, and thus vastly accelerate the training efficiency compared with conventional GPs (but slower than deep learning models). We test the performance of the proposed model based on both synthesized and real-world datasets. For comparison, several widely used machine learning and deep learning models are trained under the real-world dataset. The results demonstrate that the BCEGP model performs well in predictive accuracy, computational speed, and applicability, which can be a promising method for transportation problems. | - |
dc.language | eng | - |
dc.publisher | Elsevier | - |
dc.relation.ispartof | Transportation Research Part C: Emerging Technologies | - |
dc.rights | This work is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License. | - |
dc.subject | Dirichlet process mixture model | - |
dc.subject | Gaussian Process | - |
dc.subject | Statistical learning | - |
dc.subject | Traffic flow prediction | - |
dc.title | A Bayesian clustering ensemble Gaussian process model for network-wide traffic flow clustering and prediction | - |
dc.type | Article | - |
dc.identifier.doi | 10.1016/j.trc.2023.104032 | - |
dc.identifier.scopus | eid_2-s2.0-85146878957 | - |
dc.identifier.volume | 148 | - |
dc.identifier.eissn | 1879-2359 | - |
dc.identifier.isi | WOS:001058451200001 | - |
dc.identifier.issnl | 0968-090X | - |