File Download
  Links for fulltext
     (May Require Subscription)
Supplementary

Conference Paper: Efficient join processing over uncertain data

TitleEfficient join processing over uncertain data
Authors
KeywordsImprecise data
Joins
Uncertainty management
Issue Date2006
PublisherAssociation for Computing Machinery.
Citation
The ACM 15th International Conference on Information and Knowledge Management (CIKM 2006), Arlington, VA., 5-11 November 2006. In Conference Proceedings, 2006, p. 738-747 How to Cite?
AbstractIn many applications data values are inherently uncertain. This includes moving-objects, sensors and biological databases. There has been recent interest in the development of database management systems that can handle uncertain data. Some proposals for such systems include attribute values that are uncertain. In particular, an attribute value can be modeled as a range of possible values, associated with a probability density function. Previous efforts for this type of data have only addressed simple queries such as range and nearest-neighbor queries. Queries that join multiple relations have not been addressed in earlier work despite the significance of joins in databases. In this paper we address join queries over uncertain data. We propose a semantics for the join operation, define probabilistic operators over uncertain data, and propose join algorithms that provide efficient execution of probabilistic joins. The paper focuses on an important class of joins termed probabilistic threshold joins that avoid some of the semantic complexities of dealing with uncertain data. For this class of joins we develop three sets of optimization techniques: item-level, page-level, and index-level pruning. These techniques facilitate pruning with little space and time overhead, and are easily adapted to most join algorithms. We verify the performance of these techniques experimentally. Copyright 2006 ACM.
Persistent Identifierhttp://hdl.handle.net/10722/129570
ISBN
References

 

DC FieldValueLanguage
dc.contributor.authorCheng, Ren_HK
dc.contributor.authorSingh, Sen_HK
dc.contributor.authorPrabhakar, Sen_HK
dc.contributor.authorShah, Ren_HK
dc.contributor.authorVitter, JSen_HK
dc.contributor.authorXia, Yen_HK
dc.date.accessioned2010-12-23T08:39:22Z-
dc.date.available2010-12-23T08:39:22Z-
dc.date.issued2006en_HK
dc.identifier.citationThe ACM 15th International Conference on Information and Knowledge Management (CIKM 2006), Arlington, VA., 5-11 November 2006. In Conference Proceedings, 2006, p. 738-747en_HK
dc.identifier.isbn1-59593-433-2-
dc.identifier.urihttp://hdl.handle.net/10722/129570-
dc.description.abstractIn many applications data values are inherently uncertain. This includes moving-objects, sensors and biological databases. There has been recent interest in the development of database management systems that can handle uncertain data. Some proposals for such systems include attribute values that are uncertain. In particular, an attribute value can be modeled as a range of possible values, associated with a probability density function. Previous efforts for this type of data have only addressed simple queries such as range and nearest-neighbor queries. Queries that join multiple relations have not been addressed in earlier work despite the significance of joins in databases. In this paper we address join queries over uncertain data. We propose a semantics for the join operation, define probabilistic operators over uncertain data, and propose join algorithms that provide efficient execution of probabilistic joins. The paper focuses on an important class of joins termed probabilistic threshold joins that avoid some of the semantic complexities of dealing with uncertain data. For this class of joins we develop three sets of optimization techniques: item-level, page-level, and index-level pruning. These techniques facilitate pruning with little space and time overhead, and are easily adapted to most join algorithms. We verify the performance of these techniques experimentally. Copyright 2006 ACM.en_HK
dc.languageengen_US
dc.publisherAssociation for Computing Machinery.-
dc.relation.ispartofProceedings of the ACM 15th International Conference on Information and Knowledge Management, 2006, p. 738-747en_HK
dc.subjectImprecise dataen_HK
dc.subjectJoinsen_HK
dc.subjectUncertainty managementen_HK
dc.titleEfficient join processing over uncertain dataen_HK
dc.typeConference_Paperen_HK
dc.identifier.openurlhttp://library.hku.hk:4550/resserv?sid=HKU:IR&issn=1595934332&volume=&spage=738&epage=747&date=2006&atitle=Efficient+join+processing+over+uncertain+data-
dc.identifier.emailCheng, R:ckcheng@cs.hku.hken_HK
dc.identifier.authorityCheng, R=rp00074en_HK
dc.description.naturelink_to_OA_fulltext-
dc.identifier.doi10.1145/1183614.1183719en_HK
dc.identifier.scopuseid_2-s2.0-34547646325en_HK
dc.identifier.hkuros176478en_US
dc.relation.referenceshttp://www.scopus.com/mlt/select.url?eid=2-s2.0-34547646325&selection=ref&src=s&origin=recordpageen_HK
dc.identifier.spage738en_HK
dc.identifier.epage747en_HK
dc.publisher.placeUnited States-
dc.description.otherThe ACM 15th International Conference on Information and Knowledge Management (CIKM 2006), Arlington, VA., 5-11 November 2006. In Conference Proceedings, 2006, p. 738-747-
dc.identifier.scopusauthoridCheng, R=7201955416en_HK
dc.identifier.scopusauthoridSingh, S=14028945800en_HK
dc.identifier.scopusauthoridPrabhakar, S=7101672592en_HK
dc.identifier.scopusauthoridShah, R=35365088300en_HK
dc.identifier.scopusauthoridVitter, JS=7005508549en_HK
dc.identifier.scopusauthoridXia, Y=8557162400en_HK

Export via OAI-PMH Interface in XML Formats


OR


Export to Other Non-XML Formats