File Download
There are no files associated with this item.
Links for fulltext
(May Require Subscription)
- Publisher Website: 10.1145/2043652.2043658
- Scopus: eid_2-s2.0-84855216736
- WOS: WOS:000298291800006
- Find via
Supplementary
- Citations:
- Appears in Collections:
Article: Capturing continuous data and answering aggregate queries in probabilistic XML
Title | Capturing continuous data and answering aggregate queries in probabilistic XML |
---|---|
Authors | |
Keywords | Aggregate queries Aggregation Continuous distributions Probabilistic databases Probabilistic XML |
Issue Date | 2011 |
Publisher | Association for Computing Machinery, Inc. |
Citation | ACM Transactions on Database Systems, 2011, v. 36 n. 4, article no. 25, p. 25:1-25:45 How to Cite? |
Abstract | Sources of data uncertainty and imprecision are numerous. A way to handle this uncertainty is to associate probabilistic annotations to data. Many such probabilistic database models have been proposed, both in the relational and in the semi-structured setting. The latter is particularly well adapted to the management of uncertain data coming from a variety of automatic processes. An important problem, in the context of probabilistic XML databases, is that of answering aggregate queries (count, sum, avg, etc.), which has received limited attention so far. In a model unifying the various (discrete) semi-structured probabilistic models studied up to now, we present algorithms to compute the distribution of the aggregation values (exploiting some regularity properties of the aggregate functions) and probabilistic moments (especially expectation and variance) of this distribution. We also prove the intractability of some of these problems and investigate approximation techniques. We finally extend the discrete model to a continuous one, in order to take into account continuous data values, such as measurements from sensor networks, and extend our algorithms and complexity results to the continuous case. |
Persistent Identifier | http://hdl.handle.net/10722/160534 |
ISSN | 2023 Impact Factor: 2.2 2023 SCImago Journal Rankings: 1.730 |
ISI Accession Number ID |
DC Field | Value | Language |
---|---|---|
dc.contributor.author | Abiteboul, S | en_US |
dc.contributor.author | Chan, HTH | en_US |
dc.contributor.author | Kharlamov, E | en_US |
dc.contributor.author | Nutt, W | en_US |
dc.contributor.author | Senellart, P | en_US |
dc.date.accessioned | 2012-08-16T06:13:30Z | - |
dc.date.available | 2012-08-16T06:13:30Z | - |
dc.date.issued | 2011 | en_US |
dc.identifier.citation | ACM Transactions on Database Systems, 2011, v. 36 n. 4, article no. 25, p. 25:1-25:45 | en_US |
dc.identifier.issn | 0362-5915 | - |
dc.identifier.uri | http://hdl.handle.net/10722/160534 | - |
dc.description.abstract | Sources of data uncertainty and imprecision are numerous. A way to handle this uncertainty is to associate probabilistic annotations to data. Many such probabilistic database models have been proposed, both in the relational and in the semi-structured setting. The latter is particularly well adapted to the management of uncertain data coming from a variety of automatic processes. An important problem, in the context of probabilistic XML databases, is that of answering aggregate queries (count, sum, avg, etc.), which has received limited attention so far. In a model unifying the various (discrete) semi-structured probabilistic models studied up to now, we present algorithms to compute the distribution of the aggregation values (exploiting some regularity properties of the aggregate functions) and probabilistic moments (especially expectation and variance) of this distribution. We also prove the intractability of some of these problems and investigate approximation techniques. We finally extend the discrete model to a continuous one, in order to take into account continuous data values, such as measurements from sensor networks, and extend our algorithms and complexity results to the continuous case. | - |
dc.language | eng | en_US |
dc.publisher | Association for Computing Machinery, Inc. | - |
dc.relation.ispartof | ACM Transactions on Database Systems | en_US |
dc.rights | ACM Transactions on Database Systems. Copyright © Association for Computing Machinery, Inc. | - |
dc.subject | Aggregate queries | - |
dc.subject | Aggregation | - |
dc.subject | Continuous distributions | - |
dc.subject | Probabilistic databases | - |
dc.subject | Probabilistic XML | - |
dc.title | Capturing continuous data and answering aggregate queries in probabilistic XML | en_US |
dc.type | Article | en_US |
dc.identifier.email | Chan, HTH: hubert@cs.hku.hk | en_US |
dc.identifier.authority | Chan, HTH=rp01312 | en_US |
dc.description.nature | link_to_subscribed_fulltext | - |
dc.identifier.doi | 10.1145/2043652.2043658 | - |
dc.identifier.scopus | eid_2-s2.0-84855216736 | - |
dc.identifier.hkuros | 202977 | en_US |
dc.identifier.volume | 36 | - |
dc.identifier.issue | 4, article no. 25 | - |
dc.identifier.spage | 25:1 | - |
dc.identifier.epage | 25:45 | - |
dc.identifier.eissn | 1557-4644 | - |
dc.identifier.isi | WOS:000298291800006 | - |
dc.publisher.place | United States | - |
dc.identifier.issnl | 0362-5915 | - |