File Download
There are no files associated with this item.
Links for fulltext
(May Require Subscription)
- Publisher Website: 10.1016/j.envpol.2020.114777
- Scopus: eid_2-s2.0-85086433299
- PMID: 32540592
- Find via
Supplementary
- Citations:
- Appears in Collections:
Article: A gradient boost approach for predicting near-road ultrafine particle concentrations using detailed traffic characterization
Title | A gradient boost approach for predicting near-road ultrafine particle concentrations using detailed traffic characterization |
---|---|
Authors | |
Keywords | Cross-validation K-means clustering Local traffic Machine learning Short-term fixed monitoring |
Issue Date | 2020 |
Citation | Environmental Pollution, 2020, v. 265, article no. 114777 How to Cite? |
Abstract | This study investigates the influence of meteorology, land use, built environment, and traffic characteristics on near-road ultrafine particle (UFP) concentrations. To achieve this objective, minute-level UFP concentrations were measured at various locations along a major arterial road in the Greater Toronto Area (GTA) between February and May 2019. Each location was visited five times, at least once in the morning, mid-day, and afternoon. Each visit lasted for 30 min, resulting in 2.5 h of minute-level data collected at each location. Local traffic information, including vehicle class and turning movements, were processed using computer vision techniques. The number of fast-food restaurants, cafes, trees, traffic signals, and building footprint, were found to have positive impacts on the mean UFP, while distance to the closest major road was negatively associated with UFP. We employed the Extreme Gradient Boosting (XGBoost) method to develop prediction models for UFP concentrations. The Shapley additive explanation (SHAP) measures were used to capture the influence of each feature on model output. The model results demonstrated that minute-level counts of local traffic from different directions had significant impacts on near-road UFP concentrations, model performance was robust under random cross-validation as coefficients of determination (R2) ranged from 0.63 to 0.69, but it revealed weaknesses when data at specific locations were eliminated from the training dataset. This result indicates that proper cross-validation techniques should be developed to better evaluate machine learning models for air quality predictions. |
Persistent Identifier | http://hdl.handle.net/10722/346783 |
ISSN | 2023 Impact Factor: 7.6 2023 SCImago Journal Rankings: 2.132 |
DC Field | Value | Language |
---|---|---|
dc.contributor.author | Xu, Junshi | - |
dc.contributor.author | Wang, An | - |
dc.contributor.author | Schmidt, Nicole | - |
dc.contributor.author | Adams, Matthew | - |
dc.contributor.author | Hatzopoulou, Marianne | - |
dc.date.accessioned | 2024-09-17T04:13:15Z | - |
dc.date.available | 2024-09-17T04:13:15Z | - |
dc.date.issued | 2020 | - |
dc.identifier.citation | Environmental Pollution, 2020, v. 265, article no. 114777 | - |
dc.identifier.issn | 0269-7491 | - |
dc.identifier.uri | http://hdl.handle.net/10722/346783 | - |
dc.description.abstract | This study investigates the influence of meteorology, land use, built environment, and traffic characteristics on near-road ultrafine particle (UFP) concentrations. To achieve this objective, minute-level UFP concentrations were measured at various locations along a major arterial road in the Greater Toronto Area (GTA) between February and May 2019. Each location was visited five times, at least once in the morning, mid-day, and afternoon. Each visit lasted for 30 min, resulting in 2.5 h of minute-level data collected at each location. Local traffic information, including vehicle class and turning movements, were processed using computer vision techniques. The number of fast-food restaurants, cafes, trees, traffic signals, and building footprint, were found to have positive impacts on the mean UFP, while distance to the closest major road was negatively associated with UFP. We employed the Extreme Gradient Boosting (XGBoost) method to develop prediction models for UFP concentrations. The Shapley additive explanation (SHAP) measures were used to capture the influence of each feature on model output. The model results demonstrated that minute-level counts of local traffic from different directions had significant impacts on near-road UFP concentrations, model performance was robust under random cross-validation as coefficients of determination (R2) ranged from 0.63 to 0.69, but it revealed weaknesses when data at specific locations were eliminated from the training dataset. This result indicates that proper cross-validation techniques should be developed to better evaluate machine learning models for air quality predictions. | - |
dc.language | eng | - |
dc.relation.ispartof | Environmental Pollution | - |
dc.subject | Cross-validation | - |
dc.subject | K-means clustering | - |
dc.subject | Local traffic | - |
dc.subject | Machine learning | - |
dc.subject | Short-term fixed monitoring | - |
dc.title | A gradient boost approach for predicting near-road ultrafine particle concentrations using detailed traffic characterization | - |
dc.type | Article | - |
dc.description.nature | link_to_subscribed_fulltext | - |
dc.identifier.doi | 10.1016/j.envpol.2020.114777 | - |
dc.identifier.pmid | 32540592 | - |
dc.identifier.scopus | eid_2-s2.0-85086433299 | - |
dc.identifier.volume | 265 | - |
dc.identifier.spage | article no. 114777 | - |
dc.identifier.epage | article no. 114777 | - |
dc.identifier.eissn | 1873-6424 | - |