File Download
Supplementary

Conference Paper: Fast evaluation of iceberg pattern-based aggregate queries

TitleFast evaluation of iceberg pattern-based aggregate queries
Authors
KeywordsOLAP
Iceberg
Probabilistic Algorithm
Issue Date2013
PublisherACM.
Citation
The 22nd ACM International Conference on Information and Knowledge Management (CIKM 2013), San Francisco, CA., 27 October-1 November 2013. In Conference Proceedings, 2013, p. 1-6 How to Cite?
AbstractA Sequence OLAP (S-OLAP) system provides a platform on which pattern-based aggregate (PBA) queries on a sequence database are evaluated. In its simplest form, a PBA query consists of a pattern template T and an aggregate function F. A pattern template is a sequence of variables, each is defined over a domain. For example, the template T = (X,Y ,Y ,X) consists of two variables X and Y . Each variable is instantiated with all possible values in its corresponding domain to derive all possible patterns of the template. Sequences are grouped based on the patterns they possess. The answer to a PBA query is a sequence cuboid (s-cuboid), which is a multidimensional array of cells. Each cell is associated with a pattern instantiated from the query’s pattern template. The value of each s-cuboid cell is obtained by applying the aggregate function F to the set of data sequences that belong to that cell. Since a pattern template can involve many variables and can be arbitrarily long, the induced s-cuboid for a PBA query can be huge. For most analytical tasks, however, only iceberg cells with very large aggregate values are of interest. This paper proposes an efficient approach to identify and evaluate iceberg cells of s-cuboids. Experimental results show that our algorithms are orders of magnitude faster than existing approaches.
Persistent Identifierhttp://hdl.handle.net/10722/189634
ISBN

 

DC FieldValueLanguage
dc.contributor.authorHe, Zen_US
dc.contributor.authorWong, Pen_US
dc.contributor.authorKao, Ben_US
dc.contributor.authorLo, Een_US
dc.contributor.authorCheng, Ren_US
dc.date.accessioned2013-09-17T14:50:31Z-
dc.date.available2013-09-17T14:50:31Z-
dc.date.issued2013en_US
dc.identifier.citationThe 22nd ACM International Conference on Information and Knowledge Management (CIKM 2013), San Francisco, CA., 27 October-1 November 2013. In Conference Proceedings, 2013, p. 1-6en_US
dc.identifier.isbn978-1-4503-2263-8-
dc.identifier.urihttp://hdl.handle.net/10722/189634-
dc.description.abstractA Sequence OLAP (S-OLAP) system provides a platform on which pattern-based aggregate (PBA) queries on a sequence database are evaluated. In its simplest form, a PBA query consists of a pattern template T and an aggregate function F. A pattern template is a sequence of variables, each is defined over a domain. For example, the template T = (X,Y ,Y ,X) consists of two variables X and Y . Each variable is instantiated with all possible values in its corresponding domain to derive all possible patterns of the template. Sequences are grouped based on the patterns they possess. The answer to a PBA query is a sequence cuboid (s-cuboid), which is a multidimensional array of cells. Each cell is associated with a pattern instantiated from the query’s pattern template. The value of each s-cuboid cell is obtained by applying the aggregate function F to the set of data sequences that belong to that cell. Since a pattern template can involve many variables and can be arbitrarily long, the induced s-cuboid for a PBA query can be huge. For most analytical tasks, however, only iceberg cells with very large aggregate values are of interest. This paper proposes an efficient approach to identify and evaluate iceberg cells of s-cuboids. Experimental results show that our algorithms are orders of magnitude faster than existing approaches.-
dc.languageengen_US
dc.publisherACM.-
dc.relation.ispartof22nd ACM International Conference on Information and Knowledge Management, CIKM 2013 Proceedingsen_US
dc.subjectOLAP-
dc.subjectIceberg-
dc.subjectProbabilistic Algorithm-
dc.titleFast evaluation of iceberg pattern-based aggregate queriesen_US
dc.typeConference_Paperen_US
dc.identifier.emailKao, B: kao@cs.hku.hken_US
dc.identifier.emailCheng, R: ckcheng@cs.hku.hken_US
dc.identifier.authorityKao, B=rp00123en_US
dc.identifier.authorityCheng, R=rp00074en_US
dc.description.naturelink_to_OA_fulltext-
dc.identifier.hkuros222851en_US
dc.identifier.spage1-
dc.identifier.epage6-
dc.publisher.placeUnited States-
dc.customcontrol.immutablesml 131023-

Export via OAI-PMH Interface in XML Formats


OR


Export to Other Non-XML Formats