File Download

There are no files associated with this item.

  Links for fulltext
     (May Require Subscription)
Supplementary

Article: Coupling mobile phone data with machine learning: How misclassification errors in ambient PM2.5 exposure estimates are produced?

TitleCoupling mobile phone data with machine learning: How misclassification errors in ambient PM2.5 exposure estimates are produced?
Authors
KeywordsPM2.5 exposure estimate
Misclassification errors
Mobile phone location data
Machine learning
Issue Date2020
PublisherElsevier BV. The Journal's web site is located at http://www.elsevier.com/locate/scitotenv
Citation
Science of the Total Environment, 2020, v. 745, p. article no. 141034 How to Cite?
AbstractBackground Most studies relying on time-activity diary or traditional air pollution modelling approach are insufficient to suggest the impacts of ignoring individual mobility and air pollution variations on misclassification errors in exposure estimates. Moreover, very few studies have examined whether such impacts differ across socioeconomic groups. Objectives We aim to examine how ignoring individual mobility and PM2.5 variations produces misclassification errors in ambient PM2.5 exposure estimates. Methods We developed a geo-informed backward propagation neural network model to estimate hourly PM2.5 concentrations in terms of remote sensing and geospatial big data. Combining the estimated PM2.5 concentrations and individual trajectories derived from 755,468 mobile phone users on a weekday in Shenzhen, China, we estimated four types of individual total PM2.5 exposures during weekdays at multi-temporal scales. The estimate ignoring individual mobility, PM2.5 variations or both was compared with the hypothetical error-free estimate using paired sample t-test. We then quantified the exposure misclassification error using Pearson correlation analysis. Moreover, we examined whether the misclassification error differs across different socioeconomic groups. Taking findings of ignoring individual mobility as an example, we further investigated whether such findings are robust to the different selections of time. Results We found that the estimate ignoring PM2.5 variations, individual mobility or both was statistically different from the hypothetical error-free estimate. Ignoring both factors produced the largest exposure misclassification error. The misclassification error was larger in the estimate ignoring PM2.5 variations than that ignoring individual mobility. People with high economic status suffered from a larger exposure misclassification error. The findings were robust to the different selections of time. Conclusions Ignoring individual mobility, PM2.5 variations or both leads to misclassification errors in ambient PM2.5 exposure estimates. A larger misclassification error occurs in the estimate neglecting PM2.5 variations than that ignoring individual mobility, which is seldom reported before.
Persistent Identifierhttp://hdl.handle.net/10722/286522
ISSN
2021 Impact Factor: 10.753
2020 SCImago Journal Rankings: 1.795
ISI Accession Number ID

 

DC FieldValueLanguage
dc.contributor.authorGuo, H-
dc.contributor.authorZhan, Q-
dc.contributor.authorHo, HC-
dc.contributor.authorYao, F-
dc.contributor.authorZhou, X-
dc.contributor.authorWu, J-
dc.contributor.authorLi, W-
dc.date.accessioned2020-08-31T07:05:01Z-
dc.date.available2020-08-31T07:05:01Z-
dc.date.issued2020-
dc.identifier.citationScience of the Total Environment, 2020, v. 745, p. article no. 141034-
dc.identifier.issn0048-9697-
dc.identifier.urihttp://hdl.handle.net/10722/286522-
dc.description.abstractBackground Most studies relying on time-activity diary or traditional air pollution modelling approach are insufficient to suggest the impacts of ignoring individual mobility and air pollution variations on misclassification errors in exposure estimates. Moreover, very few studies have examined whether such impacts differ across socioeconomic groups. Objectives We aim to examine how ignoring individual mobility and PM2.5 variations produces misclassification errors in ambient PM2.5 exposure estimates. Methods We developed a geo-informed backward propagation neural network model to estimate hourly PM2.5 concentrations in terms of remote sensing and geospatial big data. Combining the estimated PM2.5 concentrations and individual trajectories derived from 755,468 mobile phone users on a weekday in Shenzhen, China, we estimated four types of individual total PM2.5 exposures during weekdays at multi-temporal scales. The estimate ignoring individual mobility, PM2.5 variations or both was compared with the hypothetical error-free estimate using paired sample t-test. We then quantified the exposure misclassification error using Pearson correlation analysis. Moreover, we examined whether the misclassification error differs across different socioeconomic groups. Taking findings of ignoring individual mobility as an example, we further investigated whether such findings are robust to the different selections of time. Results We found that the estimate ignoring PM2.5 variations, individual mobility or both was statistically different from the hypothetical error-free estimate. Ignoring both factors produced the largest exposure misclassification error. The misclassification error was larger in the estimate ignoring PM2.5 variations than that ignoring individual mobility. People with high economic status suffered from a larger exposure misclassification error. The findings were robust to the different selections of time. Conclusions Ignoring individual mobility, PM2.5 variations or both leads to misclassification errors in ambient PM2.5 exposure estimates. A larger misclassification error occurs in the estimate neglecting PM2.5 variations than that ignoring individual mobility, which is seldom reported before.-
dc.languageeng-
dc.publisherElsevier BV. The Journal's web site is located at http://www.elsevier.com/locate/scitotenv-
dc.relation.ispartofScience of the Total Environment-
dc.subjectPM2.5 exposure estimate-
dc.subjectMisclassification errors-
dc.subjectMobile phone location data-
dc.subjectMachine learning-
dc.titleCoupling mobile phone data with machine learning: How misclassification errors in ambient PM2.5 exposure estimates are produced?-
dc.typeArticle-
dc.identifier.emailHo, HC: hcho21@hku.hk-
dc.identifier.emailLi, W: wfli@hku.hk-
dc.identifier.authorityHo, HC=rp02482-
dc.identifier.authorityLi, W=rp01507-
dc.description.naturelink_to_subscribed_fulltext-
dc.identifier.doi10.1016/j.scitotenv.2020.141034-
dc.identifier.pmid32758750-
dc.identifier.scopuseid_2-s2.0-85088741111-
dc.identifier.hkuros313416-
dc.identifier.volume745-
dc.identifier.spagearticle no. 141034-
dc.identifier.epagearticle no. 141034-
dc.identifier.isiWOS:000579365600071-
dc.publisher.placeNetherlands-
dc.identifier.issnl0048-9697-

Export via OAI-PMH Interface in XML Formats


OR


Export to Other Non-XML Formats