File Download

There are no files associated with this item.

Supplementary

Conference Paper: PLOVER: Fast, Multi-core Scalable Virtual Machine Fault-tolerance

TitlePLOVER: Fast, Multi-core Scalable Virtual Machine Fault-tolerance
Authors
Issue Date2018
PublisherUSENIX Association.
Citation
Proceedings of the 15th USENIX Symposium on Networked Systems Design and Implementation 2018 (NSDI '18), Renton, WA, USA, 9–11 April 2018, p. 483-499 How to Cite?
AbstractCloud computing enables a vast deployment of online services in virtualized infrastructures, making it crucial to provide fast fault-tolerance for virtual machines (VM). Unfortunately, despite much effort, achieving fast and multi-core scalable VM fault-tolerance is still an open problem. A main reason is that the dominant primarybackup approach (e.g., REMUS) transfers an excessive amount of memory pages, all of them, updated by a service replicated on the primary VM and the backup VM. This approach makes the two VMs identical but greatly degrades the performance of services. State machine replication (SMR) enforces the same total order of inputs for a service replicated across physical hosts. This makes most updated memory pages across hosts the same and they do not need to be transferred. We present Virtualized SMR (VSMR), a new approach to tackle this open problem. VSMR enforces the same order of inputs for a VM replicated across hosts. It uses commodity hardware to efficiently compute updated page hashes and to compare them across replicas. Therefore, VSMR can efficiently enforce identical VMs by transferring only divergent pages. An extensive evaluation on PLOVER, the first VSMR system, shows that PLOVER’s throughput on multi-core is 2.2X to 3.8X higher than three popular primary-backup systems. Meanwhile, PLOVER consumed 9.2X less network bandwidth than both of them. PLOVER’s source code and raw results are released on github.com/ hku-systems/plover.
Persistent Identifierhttp://hdl.handle.net/10722/260640
ISBN

 

DC FieldValueLanguage
dc.contributor.authorWang, C-
dc.contributor.authorChen, XS-
dc.contributor.authorJia, WW-
dc.contributor.authorLi, BX-
dc.contributor.authorQiu, HR-
dc.contributor.authorZhao, SX-
dc.contributor.authorCui, H-
dc.date.accessioned2018-09-14T08:44:55Z-
dc.date.available2018-09-14T08:44:55Z-
dc.date.issued2018-
dc.identifier.citationProceedings of the 15th USENIX Symposium on Networked Systems Design and Implementation 2018 (NSDI '18), Renton, WA, USA, 9–11 April 2018, p. 483-499-
dc.identifier.isbn978-1-931971-43-0-
dc.identifier.urihttp://hdl.handle.net/10722/260640-
dc.description.abstractCloud computing enables a vast deployment of online services in virtualized infrastructures, making it crucial to provide fast fault-tolerance for virtual machines (VM). Unfortunately, despite much effort, achieving fast and multi-core scalable VM fault-tolerance is still an open problem. A main reason is that the dominant primarybackup approach (e.g., REMUS) transfers an excessive amount of memory pages, all of them, updated by a service replicated on the primary VM and the backup VM. This approach makes the two VMs identical but greatly degrades the performance of services. State machine replication (SMR) enforces the same total order of inputs for a service replicated across physical hosts. This makes most updated memory pages across hosts the same and they do not need to be transferred. We present Virtualized SMR (VSMR), a new approach to tackle this open problem. VSMR enforces the same order of inputs for a VM replicated across hosts. It uses commodity hardware to efficiently compute updated page hashes and to compare them across replicas. Therefore, VSMR can efficiently enforce identical VMs by transferring only divergent pages. An extensive evaluation on PLOVER, the first VSMR system, shows that PLOVER’s throughput on multi-core is 2.2X to 3.8X higher than three popular primary-backup systems. Meanwhile, PLOVER consumed 9.2X less network bandwidth than both of them. PLOVER’s source code and raw results are released on github.com/ hku-systems/plover.-
dc.languageeng-
dc.publisherUSENIX Association.-
dc.relation.ispartofProceedings of the 15th USENIX Symposium on Networked Systems Design and Implementation 2018 (NSDI '18)-
dc.titlePLOVER: Fast, Multi-core Scalable Virtual Machine Fault-tolerance-
dc.typeConference_Paper-
dc.identifier.emailCui, H: heming@hku.hk-
dc.identifier.authorityCui, H=rp02008-
dc.description.naturelink_to_subscribed_fulltext-
dc.identifier.hkuros291705-
dc.identifier.spage483-
dc.identifier.epage499-
dc.publisher.placeUnited States-

Export via OAI-PMH Interface in XML Formats


OR


Export to Other Non-XML Formats