File Download
There are no files associated with this item.
Links for fulltext
(May Require Subscription)
- Publisher Website: 10.1007/978-3-642-36257-6_12
- Scopus: eid_2-s2.0-85031414750
Supplementary
-
Citations:
- Scopus: 0
- Appears in Collections:
Book Chapter: Managing Quality of Probabilistic Databases
Title | Managing Quality of Probabilistic Databases |
---|---|
Authors | |
Issue Date | 2013 |
Publisher | Springer-Verlag |
Citation | Managing Quality of Probabilistic Databases. In Sadiq, S (Ed.), Handbook of Data Quality: Research and Practice, p. 271-291. Berlin; New York: Springer-Verlag, 2013 How to Cite? |
Abstract | Uncertain or imprecise data are pervasive in applications like location-based services, sensor monitoring, and data collection and integration. For these applications, probabilistic databases can be used to store uncertain data, and querying facilities are provided to yield answers with statistical confidence. Given that a limited amount of resources is available to “clean” the database (e.g., by probing some sensor data values to get their latest values), we address the problem of choosing the set of uncertain objects to be cleaned, in order to achieve the best improvement in the quality of query answers. For this purpose, we present the PWS-quality metric, which is a universal measure that quantifies the ambiguity of query answers under the possible world semantics. We study how PWS-quality can be efficiently evaluated for two major query classes: (1) queries that examine the satisfiability of tuples independent of other tuples (e.g., range queries) and (2) queries that require the knowledge of the relative ranking of the tuples (e.g., MAX queries). We then propose a polynomial-time solution to achieve an optimal improvement in PWS-quality. Other fast heuristics are also examined. |
Persistent Identifier | http://hdl.handle.net/10722/166461 |
ISBN |
DC Field | Value | Language |
---|---|---|
dc.contributor.author | Cheng, RCK | en_US |
dc.date.accessioned | 2012-09-20T08:36:32Z | - |
dc.date.available | 2012-09-20T08:36:32Z | - |
dc.date.issued | 2013 | en_US |
dc.identifier.citation | Managing Quality of Probabilistic Databases. In Sadiq, S (Ed.), Handbook of Data Quality: Research and Practice, p. 271-291. Berlin; New York: Springer-Verlag, 2013 | - |
dc.identifier.isbn | 9783642362569 | - |
dc.identifier.uri | http://hdl.handle.net/10722/166461 | - |
dc.description.abstract | Uncertain or imprecise data are pervasive in applications like location-based services, sensor monitoring, and data collection and integration. For these applications, probabilistic databases can be used to store uncertain data, and querying facilities are provided to yield answers with statistical confidence. Given that a limited amount of resources is available to “clean” the database (e.g., by probing some sensor data values to get their latest values), we address the problem of choosing the set of uncertain objects to be cleaned, in order to achieve the best improvement in the quality of query answers. For this purpose, we present the PWS-quality metric, which is a universal measure that quantifies the ambiguity of query answers under the possible world semantics. We study how PWS-quality can be efficiently evaluated for two major query classes: (1) queries that examine the satisfiability of tuples independent of other tuples (e.g., range queries) and (2) queries that require the knowledge of the relative ranking of the tuples (e.g., MAX queries). We then propose a polynomial-time solution to achieve an optimal improvement in PWS-quality. Other fast heuristics are also examined. | - |
dc.language | eng | en_US |
dc.publisher | Springer-Verlag | en_US |
dc.relation.ispartof | Handbook of Data Quality: Research and Practice | - |
dc.title | Managing Quality of Probabilistic Databases | en_US |
dc.type | Book_Chapter | en_US |
dc.identifier.email | Cheng, RCK: ckcheng@cs.hku.hk | en_US |
dc.identifier.authority | Cheng, RCK=rp00074 | en_US |
dc.identifier.doi | 10.1007/978-3-642-36257-6_12 | - |
dc.identifier.scopus | eid_2-s2.0-85031414750 | - |
dc.identifier.hkuros | 206199 | en_US |
dc.identifier.hkuros | 224491 | - |
dc.identifier.spage | 271 | - |
dc.identifier.epage | 291 | - |
dc.publisher.place | Berlin; New York | - |