An End-to-End Learning Framework for Video Compression

Lu, Guo; Zhang, Xiaoyun; Ouyang, Wanli; Chen, Li; Gao, Zhiyong; Xu, Dong

File Download

There are no files associated with this item.

Links for fulltext

(May Require Subscription)

Publisher Website: 10.1109/TPAMI.2020.2988453
Scopus: eid_2-s2.0-85114602705
PMID: 32324541
WOS: WOS:000692232400006
Find via

Supplementary

Citations:
- Scopus: 0
- Web of Science: 0
- PubMed Central: 0
Appears in Collections:
- Computer Science: Journal/Magazine Articles

Article: An End-to-End Learning Framework for Video Compression

Title	An End-to-End Learning Framework for Video Compression
Authors	Lu, Guo Zhang, Xiaoyun Ouyang, Wanli Chen, Li Gao, Zhiyong Xu, Dong
Keywords	end-to-end optimization image compression neural network Video compression
Issue Date	2021
Citation	IEEE Transactions on Pattern Analysis and Machine Intelligence, 2021, v. 43, n. 10, p. 3292-3308 How to Cite? DOI: http://dx.doi.org/10.1109/TPAMI.2020.2988453
Abstract	Traditional video compression approaches build upon the hybrid coding framework with motion-compensated prediction and residual transform coding. In this paper, we propose the first end-to-end deep video compression framework to take advantage of both the classical compression architecture and the powerful non-linear representation ability of neural networks. Our framework employs pixel-wise motion information, which is learned from an optical flow network and further compressed by an auto-encoder network to save bits. The other compression components are also implemented by the well-designed networks for high efficiency. All the modules are jointly optimized by using the rate-distortion trade-off and can collaborate with each other. More importantly, the proposed deep video compression framework is very flexible and can be easily extended by using lightweight or advanced networks for higher speed or better efficiency. We also propose to introduce the adaptive quantization layer to reduce the number of parameters for variable bitrate coding. Comprehensive experimental results demonstrate the effectiveness of the proposed framework on the benchmark datasets.
Persistent Identifier	http://hdl.handle.net/10722/321964
ISSN	0162-8828 2023 Impact Factor: 20.8 2023 SCImago Journal Rankings: 6.158
ISI Accession Number ID	WOS:000692232400006

DC Field	Value	Language
dc.contributor.author	Lu, Guo	-
dc.contributor.author	Zhang, Xiaoyun	-
dc.contributor.author	Ouyang, Wanli	-
dc.contributor.author	Chen, Li	-
dc.contributor.author	Gao, Zhiyong	-
dc.contributor.author	Xu, Dong	-
dc.date.accessioned	2022-11-03T02:22:40Z	-
dc.date.available	2022-11-03T02:22:40Z	-
dc.date.issued	2021	-
dc.identifier.citation	IEEE Transactions on Pattern Analysis and Machine Intelligence, 2021, v. 43, n. 10, p. 3292-3308	-
dc.identifier.issn	0162-8828	-
dc.identifier.uri	http://hdl.handle.net/10722/321964	-
dc.description.abstract	Traditional video compression approaches build upon the hybrid coding framework with motion-compensated prediction and residual transform coding. In this paper, we propose the first end-to-end deep video compression framework to take advantage of both the classical compression architecture and the powerful non-linear representation ability of neural networks. Our framework employs pixel-wise motion information, which is learned from an optical flow network and further compressed by an auto-encoder network to save bits. The other compression components are also implemented by the well-designed networks for high efficiency. All the modules are jointly optimized by using the rate-distortion trade-off and can collaborate with each other. More importantly, the proposed deep video compression framework is very flexible and can be easily extended by using lightweight or advanced networks for higher speed or better efficiency. We also propose to introduce the adaptive quantization layer to reduce the number of parameters for variable bitrate coding. Comprehensive experimental results demonstrate the effectiveness of the proposed framework on the benchmark datasets.	-
dc.language	eng	-
dc.relation.ispartof	IEEE Transactions on Pattern Analysis and Machine Intelligence	-
dc.subject	end-to-end optimization	-
dc.subject	image compression	-
dc.subject	neural network	-
dc.subject	Video compression	-
dc.title	An End-to-End Learning Framework for Video Compression	-
dc.type	Article	-
dc.description.nature	link_to_subscribed_fulltext	-
dc.identifier.doi	10.1109/TPAMI.2020.2988453	-
dc.identifier.pmid	32324541	-
dc.identifier.scopus	eid_2-s2.0-85114602705	-
dc.identifier.volume	43	-
dc.identifier.issue	10	-
dc.identifier.spage	3292	-
dc.identifier.epage	3308	-
dc.identifier.eissn	1939-3539	-
dc.identifier.isi	WOS:000692232400006	-

File Download

Links for fulltext

(May Require Subscription)

Supplementary

Article: An End-to-End Learning Framework for Video Compression

Export via OAI-PMH Interface in XML Formats

OR

Export to Other Non-XML Formats