File Download
There are no files associated with this item.
Links for fulltext
(May Require Subscription)
- Publisher Website: 10.1109/TBDATA.2017.2669308
- Scopus: eid_2-s2.0-85096699397
- WOS: WOS:000591809700008
Supplementary
- Citations:
- Appears in Collections:
Article: Scalable Uncertainty-Aware Truth Discovery in Big Data Social Sensing Applications for Cyber-Physical Systems
Title | Scalable Uncertainty-Aware Truth Discovery in Big Data Social Sensing Applications for Cyber-Physical Systems |
---|---|
Authors | |
Keywords | Big data cyber-physical systems parallel implementation scalability social sensing truth discovery uncertainty-aware |
Issue Date | 2020 |
Citation | IEEE Transactions on Big Data, 2020, v. 6, n. 4, p. 702-713 How to Cite? |
Abstract | Social sensing is a new big data application paradigm for Cyber-Physical Systems (CPS), where a group of individuals volunteer (or are recruited) to report measurements or observations about the physical world at scale. A fundamental challenge in social sensing applications lies in discovering the correctness of reported observations and reliability of data sources without prior knowledge on either of them. We refer to this problem as truth discovery. While prior studies have made progress on addressing this challenge, two important limitations exist: (i) current solutions did not fully explore the uncertainty aspect of human reported data, which leads to sub-optimal truth discovery results; (ii) current truth discovery solutions are mostly designed as sequential algorithms that do not scale well to large-scale social sensing events. In this paper, we develop a Scalable Uncertainty-Aware Truth Discovery (SUTD) scheme to address the above limitations. The SUTD scheme solves a constraint estimation problem to jointly estimate the correctness of reported data and the reliability of data sources while explicitly considering the uncertainty on the reported data. To address the scalability challenge, the SUTD is designed to run a Graphic Processing Unit (GPU) with thousands of cores, which is shown to run two to three orders of magnitude faster than the sequential truth discovery solutions. In evaluation, we compare our SUTD scheme to the state-of-the-art solutions using three real world datasets collected from Twitter: Paris Attack, Oregon Shooting, and Baltimore Riots, all in 2015. The evaluation results show that our new scheme significantly outperforms the baselines in terms of both truth discovery accuracy and execution time. |
Persistent Identifier | http://hdl.handle.net/10722/308832 |
ISI Accession Number ID |
DC Field | Value | Language |
---|---|---|
dc.contributor.author | Huang, Chao | - |
dc.contributor.author | Wang, Dong | - |
dc.contributor.author | Chawla, Nitesh V. | - |
dc.date.accessioned | 2021-12-08T07:50:13Z | - |
dc.date.available | 2021-12-08T07:50:13Z | - |
dc.date.issued | 2020 | - |
dc.identifier.citation | IEEE Transactions on Big Data, 2020, v. 6, n. 4, p. 702-713 | - |
dc.identifier.uri | http://hdl.handle.net/10722/308832 | - |
dc.description.abstract | Social sensing is a new big data application paradigm for Cyber-Physical Systems (CPS), where a group of individuals volunteer (or are recruited) to report measurements or observations about the physical world at scale. A fundamental challenge in social sensing applications lies in discovering the correctness of reported observations and reliability of data sources without prior knowledge on either of them. We refer to this problem as truth discovery. While prior studies have made progress on addressing this challenge, two important limitations exist: (i) current solutions did not fully explore the uncertainty aspect of human reported data, which leads to sub-optimal truth discovery results; (ii) current truth discovery solutions are mostly designed as sequential algorithms that do not scale well to large-scale social sensing events. In this paper, we develop a Scalable Uncertainty-Aware Truth Discovery (SUTD) scheme to address the above limitations. The SUTD scheme solves a constraint estimation problem to jointly estimate the correctness of reported data and the reliability of data sources while explicitly considering the uncertainty on the reported data. To address the scalability challenge, the SUTD is designed to run a Graphic Processing Unit (GPU) with thousands of cores, which is shown to run two to three orders of magnitude faster than the sequential truth discovery solutions. In evaluation, we compare our SUTD scheme to the state-of-the-art solutions using three real world datasets collected from Twitter: Paris Attack, Oregon Shooting, and Baltimore Riots, all in 2015. The evaluation results show that our new scheme significantly outperforms the baselines in terms of both truth discovery accuracy and execution time. | - |
dc.language | eng | - |
dc.relation.ispartof | IEEE Transactions on Big Data | - |
dc.subject | Big data | - |
dc.subject | cyber-physical systems | - |
dc.subject | parallel implementation | - |
dc.subject | scalability | - |
dc.subject | social sensing | - |
dc.subject | truth discovery | - |
dc.subject | uncertainty-aware | - |
dc.title | Scalable Uncertainty-Aware Truth Discovery in Big Data Social Sensing Applications for Cyber-Physical Systems | - |
dc.type | Article | - |
dc.description.nature | link_to_subscribed_fulltext | - |
dc.identifier.doi | 10.1109/TBDATA.2017.2669308 | - |
dc.identifier.scopus | eid_2-s2.0-85096699397 | - |
dc.identifier.volume | 6 | - |
dc.identifier.issue | 4 | - |
dc.identifier.spage | 702 | - |
dc.identifier.epage | 713 | - |
dc.identifier.eissn | 2332-7790 | - |
dc.identifier.isi | WOS:000591809700008 | - |