File Download
There are no files associated with this item.
Links for fulltext
(May Require Subscription)
- Publisher Website: 10.1109/TBDATA.2017.2723899
- WOS: WOS:000451911800011
- Find via
Supplementary
-
Citations:
- Web of Science: 0
- Appears in Collections:
Article: pg-Causality: Identifying Spatiotemporal Causal Pathways for Air Pollutants with Urban Big Data
Title | pg-Causality: Identifying Spatiotemporal Causal Pathways for Air Pollutants with Urban Big Data |
---|---|
Authors | |
Keywords | causality pattern mining Bayesian learning spatiotemporal (ST) big data urban computing |
Issue Date | 2018 |
Publisher | Institute of Electrical and Electronics Engineers. The Journal's web site is located at http://ieeexplore.ieee.org/xpl/RecentIssue.jsp?punumber=6687317 |
Citation | IEEE Transactions on Big Data, 2018, v. 4 n. 4, p. 571-585 How to Cite? |
Abstract | Many countries are suffering from severe air pollution. Understanding how different air pollutants accumulate and propagate is critical to making relevant public policies. In this paper, we use urban big data (air quality data and meteorological data) to identify the spatiotemporal (ST) causal pathways for air pollutants. This problem is challenging because: (1) there are numerous noisy and low-pollution periods in the raw air quality data, which may lead to unreliable causality analysis; (2) for large-scale data in the ST space, the computational complexity of constructing a causal structure is very high; and (3) the ST causal pathways are complex due to the interactions of multiple pollutants and the influence of environmental factors. Therefore, we present pg-Causality, a novel pattern-aided graphical causality analysis approach that combines the strengths of pattern mining and Bayesian learning to efficiently identify the ST causal pathways. First, pattern mining helps suppress the noise by capturing frequent evolving patterns (FEPs) of each monitoring sensor, and greatly reduce the complexity by selecting the pattern-matched sensors as 'causers'. Then, Bayesian learning carefully encodes the local and ST causal relations with a Gaussian Bayesian Network (GBN)-based graphical model, which also integrates environmental influences to minimize biases in the final results. We evaluate our approach with three real-world data sets containing 982 air quality sensors in 128 cities, in three regions of China from 01-Jun-2013 to 31-Dec-2016. Results show that our approach outperforms the traditional causal structure learning methods in time efficiency, inference accuracy and interpretability. |
Persistent Identifier | http://hdl.handle.net/10722/275017 |
ISSN | 2021 Impact Factor: 4.271 2020 SCImago Journal Rankings: 0.959 |
ISI Accession Number ID |
DC Field | Value | Language |
---|---|---|
dc.contributor.author | Zhu, Y | - |
dc.contributor.author | Zhang, C | - |
dc.contributor.author | Zhang, H | - |
dc.contributor.author | Zhi, S | - |
dc.contributor.author | Li, VOK | - |
dc.contributor.author | Han, J | - |
dc.contributor.author | Zheng, Y | - |
dc.date.accessioned | 2019-09-10T02:33:45Z | - |
dc.date.available | 2019-09-10T02:33:45Z | - |
dc.date.issued | 2018 | - |
dc.identifier.citation | IEEE Transactions on Big Data, 2018, v. 4 n. 4, p. 571-585 | - |
dc.identifier.issn | 2332-7790 | - |
dc.identifier.uri | http://hdl.handle.net/10722/275017 | - |
dc.description.abstract | Many countries are suffering from severe air pollution. Understanding how different air pollutants accumulate and propagate is critical to making relevant public policies. In this paper, we use urban big data (air quality data and meteorological data) to identify the spatiotemporal (ST) causal pathways for air pollutants. This problem is challenging because: (1) there are numerous noisy and low-pollution periods in the raw air quality data, which may lead to unreliable causality analysis; (2) for large-scale data in the ST space, the computational complexity of constructing a causal structure is very high; and (3) the ST causal pathways are complex due to the interactions of multiple pollutants and the influence of environmental factors. Therefore, we present pg-Causality, a novel pattern-aided graphical causality analysis approach that combines the strengths of pattern mining and Bayesian learning to efficiently identify the ST causal pathways. First, pattern mining helps suppress the noise by capturing frequent evolving patterns (FEPs) of each monitoring sensor, and greatly reduce the complexity by selecting the pattern-matched sensors as 'causers'. Then, Bayesian learning carefully encodes the local and ST causal relations with a Gaussian Bayesian Network (GBN)-based graphical model, which also integrates environmental influences to minimize biases in the final results. We evaluate our approach with three real-world data sets containing 982 air quality sensors in 128 cities, in three regions of China from 01-Jun-2013 to 31-Dec-2016. Results show that our approach outperforms the traditional causal structure learning methods in time efficiency, inference accuracy and interpretability. | - |
dc.language | eng | - |
dc.publisher | Institute of Electrical and Electronics Engineers. The Journal's web site is located at http://ieeexplore.ieee.org/xpl/RecentIssue.jsp?punumber=6687317 | - |
dc.relation.ispartof | IEEE Transactions on Big Data | - |
dc.rights | IEEE Transactions on Big Data. Copyright © Institute of Electrical and Electronics Engineers. | - |
dc.rights | ©20xx IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for all other uses, in any current or future media, including reprinting/republishing this material for advertising or promotional purposes, creating new collective works, for resale or redistribution to servers or lists, or reuse of any copyrighted component of this work in other works. | - |
dc.subject | causality | - |
dc.subject | pattern mining | - |
dc.subject | Bayesian learning | - |
dc.subject | spatiotemporal (ST) big data | - |
dc.subject | urban computing | - |
dc.title | pg-Causality: Identifying Spatiotemporal Causal Pathways for Air Pollutants with Urban Big Data | - |
dc.type | Article | - |
dc.identifier.email | Li, VOK: vli@eee.hku.hk | - |
dc.identifier.authority | Li, VOK=rp00150 | - |
dc.description.nature | link_to_subscribed_fulltext | - |
dc.identifier.doi | 10.1109/TBDATA.2017.2723899 | - |
dc.identifier.hkuros | 302924 | - |
dc.identifier.volume | 4 | - |
dc.identifier.issue | 4 | - |
dc.identifier.spage | 571 | - |
dc.identifier.epage | 585 | - |
dc.identifier.isi | WOS:000451911800011 | - |
dc.publisher.place | United States | - |
dc.identifier.issnl | 2332-7790 | - |