Efficient join processing over uncertain data

Cheng, R; Singh, S; Prabhakar, S; Shah, R; Vitter, JS; Xia, Y

File Download

re01.htm

Links for fulltext

(May Require Subscription)

Publisher Website: 10.1145/1183614.1183719
Scopus: eid_2-s2.0-34547646325

Supplementary

Citations:
- Scopus: 0
Appears in Collections:
- Computer Science: Conference papers

Conference Paper: Efficient join processing over uncertain data

Title	Efficient join processing over uncertain data
Authors	Cheng, R Singh, S Prabhakar, S Shah, R Vitter, JS Xia, Y
Keywords	Imprecise data Joins Uncertainty management
Issue Date	2006
Publisher	Association for Computing Machinery.
Citation	The ACM 15th International Conference on Information and Knowledge Management (CIKM 2006), Arlington, VA., 5-11 November 2006. In Conference Proceedings, 2006, p. 738-747 How to Cite? DOI: http://dx.doi.org/10.1145/1183614.1183719
Abstract	In many applications data values are inherently uncertain. This includes moving-objects, sensors and biological databases. There has been recent interest in the development of database management systems that can handle uncertain data. Some proposals for such systems include attribute values that are uncertain. In particular, an attribute value can be modeled as a range of possible values, associated with a probability density function. Previous efforts for this type of data have only addressed simple queries such as range and nearest-neighbor queries. Queries that join multiple relations have not been addressed in earlier work despite the significance of joins in databases. In this paper we address join queries over uncertain data. We propose a semantics for the join operation, define probabilistic operators over uncertain data, and propose join algorithms that provide efficient execution of probabilistic joins. The paper focuses on an important class of joins termed probabilistic threshold joins that avoid some of the semantic complexities of dealing with uncertain data. For this class of joins we develop three sets of optimization techniques: item-level, page-level, and index-level pruning. These techniques facilitate pruning with little space and time overhead, and are easily adapted to most join algorithms. We verify the performance of these techniques experimentally. Copyright 2006 ACM.
Persistent Identifier	http://hdl.handle.net/10722/129570
ISBN	1-59593-433-2
References	References in Scopus

DC Field	Value	Language
dc.contributor.author	Cheng, R	en_HK
dc.contributor.author	Singh, S	en_HK
dc.contributor.author	Prabhakar, S	en_HK
dc.contributor.author	Shah, R	en_HK
dc.contributor.author	Vitter, JS	en_HK
dc.contributor.author	Xia, Y	en_HK
dc.date.accessioned	2010-12-23T08:39:22Z	-
dc.date.available	2010-12-23T08:39:22Z	-
dc.date.issued	2006	en_HK
dc.identifier.citation	The ACM 15th International Conference on Information and Knowledge Management (CIKM 2006), Arlington, VA., 5-11 November 2006. In Conference Proceedings, 2006, p. 738-747	en_HK
dc.identifier.isbn	1-59593-433-2	-
dc.identifier.uri	http://hdl.handle.net/10722/129570	-
dc.description.abstract	In many applications data values are inherently uncertain. This includes moving-objects, sensors and biological databases. There has been recent interest in the development of database management systems that can handle uncertain data. Some proposals for such systems include attribute values that are uncertain. In particular, an attribute value can be modeled as a range of possible values, associated with a probability density function. Previous efforts for this type of data have only addressed simple queries such as range and nearest-neighbor queries. Queries that join multiple relations have not been addressed in earlier work despite the significance of joins in databases. In this paper we address join queries over uncertain data. We propose a semantics for the join operation, define probabilistic operators over uncertain data, and propose join algorithms that provide efficient execution of probabilistic joins. The paper focuses on an important class of joins termed probabilistic threshold joins that avoid some of the semantic complexities of dealing with uncertain data. For this class of joins we develop three sets of optimization techniques: item-level, page-level, and index-level pruning. These techniques facilitate pruning with little space and time overhead, and are easily adapted to most join algorithms. We verify the performance of these techniques experimentally. Copyright 2006 ACM.	en_HK
dc.language	eng	en_US
dc.publisher	Association for Computing Machinery.	-
dc.relation.ispartof	Proceedings of the ACM 15th International Conference on Information and Knowledge Management, 2006, p. 738-747	en_HK
dc.subject	Imprecise data	en_HK
dc.subject	Joins	en_HK
dc.subject	Uncertainty management	en_HK
dc.title	Efficient join processing over uncertain data	en_HK
dc.type	Conference_Paper	en_HK
dc.identifier.openurl	http://library.hku.hk:4550/resserv?sid=HKU:IR&issn=1595934332&volume=&spage=738&epage=747&date=2006&atitle=Efficient+join+processing+over+uncertain+data	-
dc.identifier.email	Cheng, R:ckcheng@cs.hku.hk	en_HK
dc.identifier.authority	Cheng, R=rp00074	en_HK
dc.description.nature	link_to_OA_fulltext	-
dc.identifier.doi	10.1145/1183614.1183719	en_HK
dc.identifier.scopus	eid_2-s2.0-34547646325	en_HK
dc.identifier.hkuros	176478	en_US
dc.relation.references	http://www.scopus.com/mlt/select.url?eid=2-s2.0-34547646325&selection=ref&src=s&origin=recordpage	en_HK
dc.identifier.spage	738	en_HK
dc.identifier.epage	747	en_HK
dc.publisher.place	United States	-
dc.description.other	The ACM 15th International Conference on Information and Knowledge Management (CIKM 2006), Arlington, VA., 5-11 November 2006. In Conference Proceedings, 2006, p. 738-747	-
dc.identifier.scopusauthorid	Cheng, R=7201955416	en_HK
dc.identifier.scopusauthorid	Singh, S=14028945800	en_HK
dc.identifier.scopusauthorid	Prabhakar, S=7101672592	en_HK
dc.identifier.scopusauthorid	Shah, R=35365088300	en_HK
dc.identifier.scopusauthorid	Vitter, JS=7005508549	en_HK
dc.identifier.scopusauthorid	Xia, Y=8557162400	en_HK

File Download

Links for fulltext

(May Require Subscription)

Supplementary

Conference Paper: Efficient join processing over uncertain data

Export via OAI-PMH Interface in XML Formats

OR

Export to Other Non-XML Formats