Differentiable Dynamic Quantization with Mixed Precision and Adaptive Resolution

Zhang, Z; Shao, W; Gu, J; Wang, X; Luo, P

File Download

There are no files associated with this item.

Links for fulltext

(May Require Subscription)

Find via

Supplementary

Citations:
Appears in Collections:
- Computer Science: Conference papers

Conference Paper: Differentiable Dynamic Quantization with Mixed Precision and Adaptive Resolution

Title	Differentiable Dynamic Quantization with Mixed Precision and Adaptive Resolution
Authors	Zhang, Z Shao, W Gu, J Wang, X Luo, P
Issue Date	2021
Publisher	ML Research Press. The Journal's web site is located at http://proceedings.mlr.press/
Citation	The 38th International Conference on Machine Learning (ICML), Virtual Conference, 18-24 July 2021. In Proceedings of Machine Learning Research (PMLR), v. 139: Proceedings of ICML 2021, p. 12546-12556 How to Cite?
Abstract	Model quantization is challenging due to many tedious hyper-parameters such as precision (bitwidth), dynamic range (minimum and maximum discrete values) and stepsize (interval between discrete values). Unlike prior arts that carefully tune these values, we present a fully differentiable approach to learn all of them, named Differentiable Dynamic Quantization (DDQ), which has several benefits. (1) DDQ is able to quantize challenging lightweight architectures like MobileNets, where different layers prefer different quantization parameters. (2) DDQ is hardware-friendly and can be easily implemented using low-precision matrix-vector multiplication, making it capable in many hardware such as ARM. (3) Extensive experiments show that DDQ outperforms prior arts on many networks and benchmarks, especially when models are already efficient and compact. e.g., DDQ is the first approach that achieves lossless 4-bit quantization for MobileNetV2 on ImageNet.
Description	Applications (CV and NLP) Session
Persistent Identifier	http://hdl.handle.net/10722/301433
ISSN	2640-3498

DC Field	Value	Language
dc.contributor.author	Zhang, Z	-
dc.contributor.author	Shao, W	-
dc.contributor.author	Gu, J	-
dc.contributor.author	Wang, X	-
dc.contributor.author	Luo, P	-
dc.date.accessioned	2021-07-27T08:11:00Z	-
dc.date.available	2021-07-27T08:11:00Z	-
dc.date.issued	2021	-
dc.identifier.citation	The 38th International Conference on Machine Learning (ICML), Virtual Conference, 18-24 July 2021. In Proceedings of Machine Learning Research (PMLR), v. 139: Proceedings of ICML 2021, p. 12546-12556	-
dc.identifier.issn	2640-3498	-
dc.identifier.uri	http://hdl.handle.net/10722/301433	-
dc.description	Applications (CV and NLP) Session	-
dc.description.abstract	Model quantization is challenging due to many tedious hyper-parameters such as precision (bitwidth), dynamic range (minimum and maximum discrete values) and stepsize (interval between discrete values). Unlike prior arts that carefully tune these values, we present a fully differentiable approach to learn all of them, named Differentiable Dynamic Quantization (DDQ), which has several benefits. (1) DDQ is able to quantize challenging lightweight architectures like MobileNets, where different layers prefer different quantization parameters. (2) DDQ is hardware-friendly and can be easily implemented using low-precision matrix-vector multiplication, making it capable in many hardware such as ARM. (3) Extensive experiments show that DDQ outperforms prior arts on many networks and benchmarks, especially when models are already efficient and compact. e.g., DDQ is the first approach that achieves lossless 4-bit quantization for MobileNetV2 on ImageNet.	-
dc.language	eng	-
dc.publisher	ML Research Press. The Journal's web site is located at http://proceedings.mlr.press/	-
dc.relation.ispartof	Proceedings of Machine Learning Research (PMLR)	-
dc.relation.ispartof	The 38th International Conference on Machine Learning (ICML), 2021	-
dc.title	Differentiable Dynamic Quantization with Mixed Precision and Adaptive Resolution	-
dc.type	Conference_Paper	-
dc.identifier.email	Luo, P: pluo@hku.hk	-
dc.identifier.authority	Luo, P=rp02575	-
dc.identifier.hkuros	323759	-
dc.identifier.volume	139: Proceedings of ICML 2021	-
dc.identifier.spage	12546	-
dc.identifier.epage	12556	-
dc.publisher.place	United States	-

File Download

Links for fulltext

(May Require Subscription)

Supplementary

Conference Paper: Differentiable Dynamic Quantization with Mixed Precision and Adaptive Resolution

Export via OAI-PMH Interface in XML Formats

OR

Export to Other Non-XML Formats