File Download
  Links for fulltext
     (May Require Subscription)
Supplementary

Article: Forward Scan based Plane Sweep Algorithm for Parallel Interval Joins

TitleForward Scan based Plane Sweep Algorithm for Parallel Interval Joins
Authors
Issue Date2017
PublisherVery Large Data Base (VLDB) Endowment Inc. The Journal's web site is located at http://vldb.org/pvldb/index.html
Citation
Proceedings of the VLDB Endowment (PVLDB), 2017, v. 10, p. 1346-1357 How to Cite?
AbstractThe interval join is a basic operation that finds application in temporal, spatial, and uncertain databases. Although a number of centralized and distributed algorithms have been proposed for the efficient evaluation of interval joins, classic plane sweep approaches have not been considered at their full potential. A recent piece of related work proposes an optimized approach based on plane sweep (PS) for modern hardware, showing that it greatly outperforms previous work. However, this approach depends on the development of a complex data structure and its parallelization has not been adequately studied. In this paper, we explore the applicability of a largely ignored forward scan (FS) based plane sweep algorithm, which is extremely simple to implement. We propose two optimizations of FS that greatly reduce its cost, making it competitive to the state-of-the-art single-threaded PS algorithm while achieving a lower memory footprint. In addition, we show the drawbacks of a previously proposed hash-based partitioning approach for parallel join processing and suggest a domain-based partitioning approach that does not produce duplicate results. Within our approach we propose a novel breakdown of the partition join jobs into a small number of independent mini-join jobs with varying cost and manage to avoid redundant comparisons. Finally, we show how these mini-joins can be scheduled in multiple CPU cores and propose an adaptive domain partitioning, aiming at load balancing. We include an experimental study that demonstrates the efficiency of our optimized FS and the scalability of our parallelization framework.
Persistent Identifierhttp://hdl.handle.net/10722/245814
ISSN
2023 Impact Factor: 2.6
2023 SCImago Journal Rankings: 2.666
ISI Accession Number ID

 

DC FieldValueLanguage
dc.contributor.authorBouros, P-
dc.contributor.authorMamoulis, N-
dc.date.accessioned2017-09-18T02:17:22Z-
dc.date.available2017-09-18T02:17:22Z-
dc.date.issued2017-
dc.identifier.citationProceedings of the VLDB Endowment (PVLDB), 2017, v. 10, p. 1346-1357-
dc.identifier.issn2150-8097-
dc.identifier.urihttp://hdl.handle.net/10722/245814-
dc.description.abstractThe interval join is a basic operation that finds application in temporal, spatial, and uncertain databases. Although a number of centralized and distributed algorithms have been proposed for the efficient evaluation of interval joins, classic plane sweep approaches have not been considered at their full potential. A recent piece of related work proposes an optimized approach based on plane sweep (PS) for modern hardware, showing that it greatly outperforms previous work. However, this approach depends on the development of a complex data structure and its parallelization has not been adequately studied. In this paper, we explore the applicability of a largely ignored forward scan (FS) based plane sweep algorithm, which is extremely simple to implement. We propose two optimizations of FS that greatly reduce its cost, making it competitive to the state-of-the-art single-threaded PS algorithm while achieving a lower memory footprint. In addition, we show the drawbacks of a previously proposed hash-based partitioning approach for parallel join processing and suggest a domain-based partitioning approach that does not produce duplicate results. Within our approach we propose a novel breakdown of the partition join jobs into a small number of independent mini-join jobs with varying cost and manage to avoid redundant comparisons. Finally, we show how these mini-joins can be scheduled in multiple CPU cores and propose an adaptive domain partitioning, aiming at load balancing. We include an experimental study that demonstrates the efficiency of our optimized FS and the scalability of our parallelization framework.-
dc.languageeng-
dc.publisherVery Large Data Base (VLDB) Endowment Inc. The Journal's web site is located at http://vldb.org/pvldb/index.html-
dc.relation.ispartofProceedings of the VLDB Endowment (PVLDB)-
dc.rightsProceedings of the VLDB Endowment (PVLDB). Copyright © Very Large Data Base (VLDB) Endowment Inc.-
dc.titleForward Scan based Plane Sweep Algorithm for Parallel Interval Joins-
dc.typeArticle-
dc.identifier.emailMamoulis, N: nikos@cs.hku.hk-
dc.identifier.authorityMamoulis, N=rp00155-
dc.description.naturepublished_or_final_version-
dc.identifier.doi10.14778/3137628.3137644-
dc.identifier.scopuseid_2-s2.0-85037047212-
dc.identifier.hkuros276647-
dc.identifier.volume10-
dc.identifier.spage1346-
dc.identifier.epage1357-
dc.identifier.isiWOS:000416492900016-
dc.publisher.placeUnited States-
dc.identifier.issnl2150-8097-

Export via OAI-PMH Interface in XML Formats


OR


Export to Other Non-XML Formats