File Download

There are no files associated with this item.

  Links for fulltext
     (May Require Subscription)
Supplementary

Conference Paper: A Sum-of-Ratios Multi-Dimensional- Knapsack Decomposition for DNN Resource Scheduling

TitleA Sum-of-Ratios Multi-Dimensional- Knapsack Decomposition for DNN Resource Scheduling
Authors
Issue Date2021
PublisherIEEE Computer Society. The Journal's web site is located at http://ieeexplore.ieee.org/xpl/conhome.jsp?punumber=1000359
Citation
IEEE International Conference on Computer Communications (INFOCOM), Virtual Conference, Vancouver, BC, Canada, 10-13 May 2021, p. 1-10 How to Cite?
AbstractIn recent years, to sustain the resource-intensive computational needs for training deep neural networks (DNNs), it is widely accepted that exploiting the parallelism in large-scale computing clusters is critical for the efficient deployments of DNN training jobs. However, existing resource schedulers for traditional computing clusters are not well suited for DNN training, which results in unsatisfactory job completion time performance. The limitations of these resource scheduling schemes motivate us to propose a new computing cluster resource scheduling framework that is able to leverage the special layered structure of DNN jobs and significantly improve their job completion times. Our contributions in this paper are three-fold: i) We develop a new resource scheduling analytical model by considering DNN’s layered structure, which enables us to analytically formulate the resource scheduling optimization problem for DNN training in computing clusters; ii) Based on the proposed performance analytical model, we then develop an efficient resource scheduling algorithm based on the widely adopted parameter-server architecture using a sum-of-ratios multi-dimensional-knapsack decomposition (SMD) method to offer strong performance guarantee; iii) We conduct extensive numerical experiments to demonstrate the effectiveness of the proposed schedule algorithm and its superior performance over the state of the art.
Persistent Identifierhttp://hdl.handle.net/10722/301292
ISSN
2020 SCImago Journal Rankings: 1.183
ISI Accession Number ID

 

DC FieldValueLanguage
dc.contributor.authorYu, M-
dc.contributor.authorWu, C-
dc.contributor.authorJi, B-
dc.contributor.authorLiu, J-
dc.date.accessioned2021-07-27T08:08:58Z-
dc.date.available2021-07-27T08:08:58Z-
dc.date.issued2021-
dc.identifier.citationIEEE International Conference on Computer Communications (INFOCOM), Virtual Conference, Vancouver, BC, Canada, 10-13 May 2021, p. 1-10-
dc.identifier.issn0743-166X-
dc.identifier.urihttp://hdl.handle.net/10722/301292-
dc.description.abstractIn recent years, to sustain the resource-intensive computational needs for training deep neural networks (DNNs), it is widely accepted that exploiting the parallelism in large-scale computing clusters is critical for the efficient deployments of DNN training jobs. However, existing resource schedulers for traditional computing clusters are not well suited for DNN training, which results in unsatisfactory job completion time performance. The limitations of these resource scheduling schemes motivate us to propose a new computing cluster resource scheduling framework that is able to leverage the special layered structure of DNN jobs and significantly improve their job completion times. Our contributions in this paper are three-fold: i) We develop a new resource scheduling analytical model by considering DNN’s layered structure, which enables us to analytically formulate the resource scheduling optimization problem for DNN training in computing clusters; ii) Based on the proposed performance analytical model, we then develop an efficient resource scheduling algorithm based on the widely adopted parameter-server architecture using a sum-of-ratios multi-dimensional-knapsack decomposition (SMD) method to offer strong performance guarantee; iii) We conduct extensive numerical experiments to demonstrate the effectiveness of the proposed schedule algorithm and its superior performance over the state of the art.-
dc.languageeng-
dc.publisherIEEE Computer Society. The Journal's web site is located at http://ieeexplore.ieee.org/xpl/conhome.jsp?punumber=1000359-
dc.relation.ispartofIEEE INFOCOM - IEEE Conference on Computer Communications-
dc.rightsIEEE INFOCOM - IEEE Conference on Computer Communications. Copyright © IEEE Computer Society.-
dc.rights©2021 IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for all other uses, in any current or future media, including reprinting/republishing this material for advertising or promotional purposes, creating new collective works, for resale or redistribution to servers or lists, or reuse of any copyrighted component of this work in other works.-
dc.titleA Sum-of-Ratios Multi-Dimensional- Knapsack Decomposition for DNN Resource Scheduling-
dc.typeConference_Paper-
dc.identifier.emailWu, C: cwu@cs.hku.hk-
dc.identifier.authorityWu, C=rp01397-
dc.description.naturelink_to_subscribed_fulltext-
dc.identifier.doi10.1109/INFOCOM42981.2021.9488916-
dc.identifier.scopuseid_2-s2.0-85111902614-
dc.identifier.hkuros323508-
dc.identifier.spage1-
dc.identifier.epage10-
dc.identifier.isiWOS:000702210400247-
dc.publisher.placeUnited States-

Export via OAI-PMH Interface in XML Formats


OR


Export to Other Non-XML Formats