File Download
Links for fulltext
(May Require Subscription)
- Publisher Website: 10.1145/1183614.1183719
- Scopus: eid_2-s2.0-34547646325
Supplementary
-
Citations:
- Scopus: 0
- Appears in Collections:
Conference Paper: Efficient join processing over uncertain data
Title | Efficient join processing over uncertain data |
---|---|
Authors | |
Keywords | Imprecise data Joins Uncertainty management |
Issue Date | 2006 |
Publisher | Association for Computing Machinery. |
Citation | The ACM 15th International Conference on Information and Knowledge Management (CIKM 2006), Arlington, VA., 5-11 November 2006. In Conference Proceedings, 2006, p. 738-747 How to Cite? |
Abstract | In many applications data values are inherently uncertain. This includes moving-objects, sensors and biological databases. There has been recent interest in the development of database management systems that can handle uncertain data. Some proposals for such systems include attribute values that are uncertain. In particular, an attribute value can be modeled as a range of possible values, associated with a probability density function. Previous efforts for this type of data have only addressed simple queries such as range and nearest-neighbor queries. Queries that join multiple relations have not been addressed in earlier work despite the significance of joins in databases. In this paper we address join queries over uncertain data. We propose a semantics for the join operation, define probabilistic operators over uncertain data, and propose join algorithms that provide efficient execution of probabilistic joins. The paper focuses on an important class of joins termed probabilistic threshold joins that avoid some of the semantic complexities of dealing with uncertain data. For this class of joins we develop three sets of optimization techniques: item-level, page-level, and index-level pruning. These techniques facilitate pruning with little space and time overhead, and are easily adapted to most join algorithms. We verify the performance of these techniques experimentally. Copyright 2006 ACM. |
Persistent Identifier | http://hdl.handle.net/10722/129570 |
ISBN | |
References |
DC Field | Value | Language |
---|---|---|
dc.contributor.author | Cheng, R | en_HK |
dc.contributor.author | Singh, S | en_HK |
dc.contributor.author | Prabhakar, S | en_HK |
dc.contributor.author | Shah, R | en_HK |
dc.contributor.author | Vitter, JS | en_HK |
dc.contributor.author | Xia, Y | en_HK |
dc.date.accessioned | 2010-12-23T08:39:22Z | - |
dc.date.available | 2010-12-23T08:39:22Z | - |
dc.date.issued | 2006 | en_HK |
dc.identifier.citation | The ACM 15th International Conference on Information and Knowledge Management (CIKM 2006), Arlington, VA., 5-11 November 2006. In Conference Proceedings, 2006, p. 738-747 | en_HK |
dc.identifier.isbn | 1-59593-433-2 | - |
dc.identifier.uri | http://hdl.handle.net/10722/129570 | - |
dc.description.abstract | In many applications data values are inherently uncertain. This includes moving-objects, sensors and biological databases. There has been recent interest in the development of database management systems that can handle uncertain data. Some proposals for such systems include attribute values that are uncertain. In particular, an attribute value can be modeled as a range of possible values, associated with a probability density function. Previous efforts for this type of data have only addressed simple queries such as range and nearest-neighbor queries. Queries that join multiple relations have not been addressed in earlier work despite the significance of joins in databases. In this paper we address join queries over uncertain data. We propose a semantics for the join operation, define probabilistic operators over uncertain data, and propose join algorithms that provide efficient execution of probabilistic joins. The paper focuses on an important class of joins termed probabilistic threshold joins that avoid some of the semantic complexities of dealing with uncertain data. For this class of joins we develop three sets of optimization techniques: item-level, page-level, and index-level pruning. These techniques facilitate pruning with little space and time overhead, and are easily adapted to most join algorithms. We verify the performance of these techniques experimentally. Copyright 2006 ACM. | en_HK |
dc.language | eng | en_US |
dc.publisher | Association for Computing Machinery. | - |
dc.relation.ispartof | Proceedings of the ACM 15th International Conference on Information and Knowledge Management, 2006, p. 738-747 | en_HK |
dc.subject | Imprecise data | en_HK |
dc.subject | Joins | en_HK |
dc.subject | Uncertainty management | en_HK |
dc.title | Efficient join processing over uncertain data | en_HK |
dc.type | Conference_Paper | en_HK |
dc.identifier.openurl | http://library.hku.hk:4550/resserv?sid=HKU:IR&issn=1595934332&volume=&spage=738&epage=747&date=2006&atitle=Efficient+join+processing+over+uncertain+data | - |
dc.identifier.email | Cheng, R:ckcheng@cs.hku.hk | en_HK |
dc.identifier.authority | Cheng, R=rp00074 | en_HK |
dc.description.nature | link_to_OA_fulltext | - |
dc.identifier.doi | 10.1145/1183614.1183719 | en_HK |
dc.identifier.scopus | eid_2-s2.0-34547646325 | en_HK |
dc.identifier.hkuros | 176478 | en_US |
dc.relation.references | http://www.scopus.com/mlt/select.url?eid=2-s2.0-34547646325&selection=ref&src=s&origin=recordpage | en_HK |
dc.identifier.spage | 738 | en_HK |
dc.identifier.epage | 747 | en_HK |
dc.publisher.place | United States | - |
dc.description.other | The ACM 15th International Conference on Information and Knowledge Management (CIKM 2006), Arlington, VA., 5-11 November 2006. In Conference Proceedings, 2006, p. 738-747 | - |
dc.identifier.scopusauthorid | Cheng, R=7201955416 | en_HK |
dc.identifier.scopusauthorid | Singh, S=14028945800 | en_HK |
dc.identifier.scopusauthorid | Prabhakar, S=7101672592 | en_HK |
dc.identifier.scopusauthorid | Shah, R=35365088300 | en_HK |
dc.identifier.scopusauthorid | Vitter, JS=7005508549 | en_HK |
dc.identifier.scopusauthorid | Xia, Y=8557162400 | en_HK |