File Download
Links for fulltext
(May Require Subscription)
- Publisher Website: 10.1109/ICDE.2011.5767908
- Scopus: eid_2-s2.0-79957835085
- Find via
Supplementary
-
Citations:
- Scopus: 0
- Appears in Collections:
Conference Paper: A novel probabilistic pruning approach to speed up similarity queries in uncertain databases
Title | A novel probabilistic pruning approach to speed up similarity queries in uncertain databases |
---|---|
Authors | |
Keywords | Experimental evaluation Multi-dimensional space Possible world semantics Probabilistic density function Probability bound |
Issue Date | 2011 |
Publisher | IEEE, Computer Society. |
Citation | The 27th International Conference on Data Engineering (ICDE 2011), Hannover, Germany, 11-16 April 2011. In Proceedings of the International Conference on Data Engineering, 2011, p. 339-350 How to Cite? |
Abstract | In this paper, we propose a novel, effective and efficient probabilistic pruning criterion for probabilistic similarity queries on uncertain data. Our approach supports a general uncertainty model using continuous probabilistic density functions to describe the (possibly correlated) uncertain attributes of objects. In a nutshell, the problem to be solved is to compute the PDF of the random variable denoted by the probabilistic domination count: Given an uncertain database object B, an uncertain reference object R and a set D of uncertain database objects in a multi-dimensional space, the probabilistic domination count denotes the number of uncertain objects in D that are closer to R than B. This domination count can be used to answer a wide range of probabilistic similarity queries. Specifically, we propose a novel geometric pruning filter and introduce an iterative filter-refinement strategy for conservatively and progressively estimating the probabilistic domination count in an efficient way while keeping correctness according to the possible world semantics. In an experimental evaluation, we show that our proposed technique allows to acquire tight probability bounds for the probabilistic domination count quickly, even for large uncertain databases. © 2011 IEEE. |
Persistent Identifier | http://hdl.handle.net/10722/137648 |
ISSN | 2023 SCImago Journal Rankings: 1.306 |
References |
DC Field | Value | Language |
---|---|---|
dc.contributor.author | Bernecker, T | en_HK |
dc.contributor.author | Emrich, T | en_HK |
dc.contributor.author | Kriegel, HP | en_HK |
dc.contributor.author | Mamoulis, N | en_HK |
dc.contributor.author | Renz, M | en_HK |
dc.contributor.author | Züfle, A | en_HK |
dc.date.accessioned | 2011-08-26T14:30:31Z | - |
dc.date.available | 2011-08-26T14:30:31Z | - |
dc.date.issued | 2011 | en_HK |
dc.identifier.citation | The 27th International Conference on Data Engineering (ICDE 2011), Hannover, Germany, 11-16 April 2011. In Proceedings of the International Conference on Data Engineering, 2011, p. 339-350 | en_HK |
dc.identifier.issn | 1084-4627 | en_HK |
dc.identifier.uri | http://hdl.handle.net/10722/137648 | - |
dc.description.abstract | In this paper, we propose a novel, effective and efficient probabilistic pruning criterion for probabilistic similarity queries on uncertain data. Our approach supports a general uncertainty model using continuous probabilistic density functions to describe the (possibly correlated) uncertain attributes of objects. In a nutshell, the problem to be solved is to compute the PDF of the random variable denoted by the probabilistic domination count: Given an uncertain database object B, an uncertain reference object R and a set D of uncertain database objects in a multi-dimensional space, the probabilistic domination count denotes the number of uncertain objects in D that are closer to R than B. This domination count can be used to answer a wide range of probabilistic similarity queries. Specifically, we propose a novel geometric pruning filter and introduce an iterative filter-refinement strategy for conservatively and progressively estimating the probabilistic domination count in an efficient way while keeping correctness according to the possible world semantics. In an experimental evaluation, we show that our proposed technique allows to acquire tight probability bounds for the probabilistic domination count quickly, even for large uncertain databases. © 2011 IEEE. | en_HK |
dc.language | eng | en_US |
dc.publisher | IEEE, Computer Society. | - |
dc.relation.ispartof | Proceedings of the International Conference on Data Engineering | en_HK |
dc.subject | Experimental evaluation | - |
dc.subject | Multi-dimensional space | - |
dc.subject | Possible world semantics | - |
dc.subject | Probabilistic density function | - |
dc.subject | Probability bound | - |
dc.title | A novel probabilistic pruning approach to speed up similarity queries in uncertain databases | en_HK |
dc.type | Conference_Paper | en_HK |
dc.identifier.openurl | http://library.hku.hk:4550/resserv?sid=HKU:IR&issn=1084-4627&volume=&spage=339&epage=350&date=2011&atitle=A+novel+probabilistic+pruning+approach+to+speed+up+similarity+queries+in+uncertain+databases | - |
dc.identifier.email | Mamoulis, N:nikos@cs.hku.hk | en_HK |
dc.identifier.authority | Mamoulis, N=rp00155 | en_HK |
dc.description.nature | link_to_subscribed_fulltext | - |
dc.identifier.doi | 10.1109/ICDE.2011.5767908 | en_HK |
dc.identifier.scopus | eid_2-s2.0-79957835085 | en_HK |
dc.identifier.hkuros | 190938 | en_US |
dc.relation.references | http://www.scopus.com/mlt/select.url?eid=2-s2.0-79957835085&selection=ref&src=s&origin=recordpage | en_HK |
dc.identifier.spage | 339 | en_HK |
dc.identifier.epage | 350 | en_HK |
dc.publisher.place | United States | en_HK |
dc.description.other | The 27th International Conference on Data Engineering (ICDE 2011), Hannover, Germany, 11-16 April 2011. In Proceedings of the International Conference on Data Engineering, 2011, p. 339-350 | - |
dc.identifier.scopusauthorid | Bernecker, T=24512341500 | en_HK |
dc.identifier.scopusauthorid | Emrich, T=35104699500 | en_HK |
dc.identifier.scopusauthorid | Kriegel, HP=7005718994 | en_HK |
dc.identifier.scopusauthorid | Mamoulis, N=6701782749 | en_HK |
dc.identifier.scopusauthorid | Renz, M=22433777600 | en_HK |
dc.identifier.scopusauthorid | Züfle, A=26666444500 | en_HK |
dc.identifier.issnl | 1084-4627 | - |