High-Dimensional Stochastic Gradient Quantization for Communication-Efficient Edge Learning

Du, Y; Yang, S; Huang, K

File Download

There are no files associated with this item.

Links for fulltext

(May Require Subscription)

Publisher Website: 10.1109/GlobalSIP45357.2019.8969082
Scopus: eid_2-s2.0-85079266840

Supplementary

Citations:
- Scopus: 0
Appears in Collections:
- Electrical & Electronic Engineering: Conference papers

Conference Paper: High-Dimensional Stochastic Gradient Quantization for Communication-Efficient Edge Learning

Title	High-Dimensional Stochastic Gradient Quantization for Communication-Efficient Edge Learning
Authors	Du, Y Yang, S Huang, K
Keywords	approximation theory gradient methods learning (artificial intelligence) quantisation (signal) stochastic processes
Issue Date	2019
Publisher	IEEE. The Journal's web site is located at https://ieeexplore.ieee.org/xpl/conhome.jsp?punumber=1803434
Citation	The 7th IEEE Global Conference on Signal and Information Processing (GlobalSIP), Ottawa, Ontario, Canada, 11-14 November 2019, p. 1-5 How to Cite? DOI: http://dx.doi.org/10.1109/GlobalSIP45357.2019.8969082
Abstract	Edge machine learning involves the deployment of machine learning algorithms at the network edge so as to leverage massive mobile data and distributed computation resources. Many edge learning frameworks (e.g., federated learning) have been developed based on distributed gradient descent. Based on the approach, stochastic gradients are computed at edge devices and then transmitted to an edge server for aggregation for updating a global AI model. Since each gradient is typically high-dimensional (with millions to billions of coefficients), communication overhead may become a bottleneck for edge learning. In this work, we propose a novel gradient compression scheme to reduce the aforementioned overhead. Specifically, in the proposed scheme, the norm of the stochastic gradient is quantized using a uniform quantizer while the normalized stochastic gradient is decomposed into block gradients. A Grassmannian codebook is applied to quantizing each normalized block gradients. Their quantized versions are assembled using a so-called hinge vector, which is quantized using another Grassmannian codebook. Furthermore, a practical bit-allocation strategy is developed. By simulations, we show that similar learning performance can be achieved with substantially lower communication overhead as compared to the one-bit scalar quantization schemes used in the state-of-the-art design, namely signed SGD.
Persistent Identifier	http://hdl.handle.net/10722/290714
ISBN	9781728127248

DC Field	Value	Language
dc.contributor.author	Du, Y	-
dc.contributor.author	Yang, S	-
dc.contributor.author	Huang, K	-
dc.date.accessioned	2020-11-02T05:46:04Z	-
dc.date.available	2020-11-02T05:46:04Z	-
dc.date.issued	2019	-
dc.identifier.citation	The 7th IEEE Global Conference on Signal and Information Processing (GlobalSIP), Ottawa, Ontario, Canada, 11-14 November 2019, p. 1-5	-
dc.identifier.isbn	9781728127248	-
dc.identifier.uri	http://hdl.handle.net/10722/290714	-
dc.description.abstract	Edge machine learning involves the deployment of machine learning algorithms at the network edge so as to leverage massive mobile data and distributed computation resources. Many edge learning frameworks (e.g., federated learning) have been developed based on distributed gradient descent. Based on the approach, stochastic gradients are computed at edge devices and then transmitted to an edge server for aggregation for updating a global AI model. Since each gradient is typically high-dimensional (with millions to billions of coefficients), communication overhead may become a bottleneck for edge learning. In this work, we propose a novel gradient compression scheme to reduce the aforementioned overhead. Specifically, in the proposed scheme, the norm of the stochastic gradient is quantized using a uniform quantizer while the normalized stochastic gradient is decomposed into block gradients. A Grassmannian codebook is applied to quantizing each normalized block gradients. Their quantized versions are assembled using a so-called hinge vector, which is quantized using another Grassmannian codebook. Furthermore, a practical bit-allocation strategy is developed. By simulations, we show that similar learning performance can be achieved with substantially lower communication overhead as compared to the one-bit scalar quantization schemes used in the state-of-the-art design, namely signed SGD.	-
dc.language	eng	-
dc.publisher	IEEE. The Journal's web site is located at https://ieeexplore.ieee.org/xpl/conhome.jsp?punumber=1803434	-
dc.relation.ispartof	IEEE Global Conference on Signal and Information Processing (GlobalSIP) Proceedings	-
dc.rights	IEEE Global Conference on Signal and Information Processing (GlobalSIP) Proceedings. Copyright © IEEE.	-
dc.rights	©2019 IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for all other uses, in any current or future media, including reprinting/republishing this material for advertising or promotional purposes, creating new collective works, for resale or redistribution to servers or lists, or reuse of any copyrighted component of this work in other works.	-
dc.subject	approximation theory	-
dc.subject	gradient methods	-
dc.subject	learning (artificial intelligence)	-
dc.subject	quantisation (signal)	-
dc.subject	stochastic processes	-
dc.title	High-Dimensional Stochastic Gradient Quantization for Communication-Efficient Edge Learning	-
dc.type	Conference_Paper	-
dc.identifier.email	Huang, K: huangkb@eee.hku.hk	-
dc.identifier.authority	Huang, K=rp01875	-
dc.description.nature	link_to_subscribed_fulltext	-
dc.identifier.doi	10.1109/GlobalSIP45357.2019.8969082	-
dc.identifier.scopus	eid_2-s2.0-85079266840	-
dc.identifier.hkuros	318020	-
dc.identifier.spage	1	-
dc.identifier.epage	5	-
dc.publisher.place	United States	-

File Download

Links for fulltext

(May Require Subscription)

Supplementary

Conference Paper: High-Dimensional Stochastic Gradient Quantization for Communication-Efficient Edge Learning

Export via OAI-PMH Interface in XML Formats

OR

Export to Other Non-XML Formats