File Download
Links for fulltext
(May Require Subscription)
- Publisher Website: 10.1145/1804669.1804679
- Scopus: eid_2-s2.0-77954522709
Conference Paper: Aggregate queries for discrete and continuous probabilistic XML
Title | Aggregate queries for discrete and continuous probabilistic XML |
---|---|
Authors | |
Keywords | aggregation algorithms complexity probabilistic databases XML |
Issue Date | 2010 |
Publisher | ACM. |
Citation | The 13th International Conference on Database Theory (ICDT'10), Lausanne, Switzerland; 23 -25 March 2010. In Conference Proceedings, 2010, p. 50-61 How to Cite? |
Abstract | Sources of data uncertainty and imprecision are numerous. A way to handle this uncertainty is to associate probabilistic annotations to data. Many such probabilistic database models have been proposed, both in the relational and in the semi-structured setting. The latter is particularly well adapted to the management of uncertain data coming from a variety of automatic processes. An important problem, in the context of probabilistic XML databases, is that of answering aggregate queries (count, sum, avg, etc.), which has received limited attention so far. In a model unifying the various (discrete) semi-structured probabilistic models studied up to now, we present algorithms to compute the distribution of the aggregation values (exploiting some regularity properties of the aggregate functions) and probabilistic moments (especially, expectation and variance) of this distribution. We also prove the intractability of some of these problems and investigate approximation techniques. We finally extend the discrete model to a continuous one, in order to take into account continuous data values, such as measurements from sensor networks, and present algorithms to compute distribution functions and moments for various classes of continuous distributions of data values. © 2010 ACM. |
Persistent Identifier | http://hdl.handle.net/10722/92644 |
ISBN | |
References |
DC Field | Value | Language |
---|---|---|
dc.contributor.author | Abiteboul, S | en_HK |
dc.contributor.author | Chan, THH | en_HK |
dc.contributor.author | Kharlamov, E | en_HK |
dc.contributor.author | Nutt, W | en_HK |
dc.contributor.author | Senellart, P | en_HK |
dc.date.accessioned | 2010-09-17T10:52:51Z | - |
dc.date.available | 2010-09-17T10:52:51Z | - |
dc.date.issued | 2010 | en_HK |
dc.identifier.citation | The 13th International Conference on Database Theory (ICDT'10), Lausanne, Switzerland; 23 -25 March 2010. In Conference Proceedings, 2010, p. 50-61 | en_HK |
dc.identifier.isbn | 978-1-60558-947-3 | - |
dc.identifier.uri | http://hdl.handle.net/10722/92644 | - |
dc.description.abstract | Sources of data uncertainty and imprecision are numerous. A way to handle this uncertainty is to associate probabilistic annotations to data. Many such probabilistic database models have been proposed, both in the relational and in the semi-structured setting. The latter is particularly well adapted to the management of uncertain data coming from a variety of automatic processes. An important problem, in the context of probabilistic XML databases, is that of answering aggregate queries (count, sum, avg, etc.), which has received limited attention so far. In a model unifying the various (discrete) semi-structured probabilistic models studied up to now, we present algorithms to compute the distribution of the aggregation values (exploiting some regularity properties of the aggregate functions) and probabilistic moments (especially, expectation and variance) of this distribution. We also prove the intractability of some of these problems and investigate approximation techniques. We finally extend the discrete model to a continuous one, in order to take into account continuous data values, such as measurements from sensor networks, and present algorithms to compute distribution functions and moments for various classes of continuous distributions of data values. © 2010 ACM. | en_HK |
dc.language | eng | en_HK |
dc.publisher | ACM. | - |
dc.relation.ispartof | ICDT '10 - Proceedings of the 13th International Conference on Database Theory | en_HK |
dc.subject | aggregation | en_HK |
dc.subject | algorithms | en_HK |
dc.subject | complexity | en_HK |
dc.subject | probabilistic databases | en_HK |
dc.subject | XML | en_HK |
dc.title | Aggregate queries for discrete and continuous probabilistic XML | en_HK |
dc.type | Conference_Paper | en_HK |
dc.identifier.email | Chan, THH:hubert@cs.hku.hk | en_HK |
dc.identifier.authority | Chan, THH=rp01312 | en_HK |
dc.description.nature | link_to_OA_fulltext | - |
dc.identifier.doi | 10.1145/1804669.1804679 | en_HK |
dc.identifier.scopus | eid_2-s2.0-77954522709 | en_HK |
dc.identifier.hkuros | 170691 | - |
dc.relation.references | http://www.scopus.com/mlt/select.url?eid=2-s2.0-77954522709&selection=ref&src=s&origin=recordpage | en_HK |
dc.identifier.spage | 50 | en_HK |
dc.identifier.epage | 61 | en_HK |
dc.publisher.place | United States | - |
dc.identifier.scopusauthorid | Abiteboul, S=7005292791 | en_HK |
dc.identifier.scopusauthorid | Chan, THH=12645073600 | en_HK |
dc.identifier.scopusauthorid | Kharlamov, E=34979864600 | en_HK |
dc.identifier.scopusauthorid | Nutt, W=7003716191 | en_HK |
dc.identifier.scopusauthorid | Senellart, P=23009962800 | en_HK |
dc.identifier.citeulike | 10166790 | - |
dc.customcontrol.immutable | sml 151016 - merged | - |