File Download

There are no files associated with this item.

  Links for fulltext
     (May Require Subscription)
Supplementary

Article: An End-to-End Learning Framework for Video Compression

TitleAn End-to-End Learning Framework for Video Compression
Authors
Keywordsend-to-end optimization
image compression
neural network
Video compression
Issue Date2021
Citation
IEEE Transactions on Pattern Analysis and Machine Intelligence, 2021, v. 43, n. 10, p. 3292-3308 How to Cite?
AbstractTraditional video compression approaches build upon the hybrid coding framework with motion-compensated prediction and residual transform coding. In this paper, we propose the first end-to-end deep video compression framework to take advantage of both the classical compression architecture and the powerful non-linear representation ability of neural networks. Our framework employs pixel-wise motion information, which is learned from an optical flow network and further compressed by an auto-encoder network to save bits. The other compression components are also implemented by the well-designed networks for high efficiency. All the modules are jointly optimized by using the rate-distortion trade-off and can collaborate with each other. More importantly, the proposed deep video compression framework is very flexible and can be easily extended by using lightweight or advanced networks for higher speed or better efficiency. We also propose to introduce the adaptive quantization layer to reduce the number of parameters for variable bitrate coding. Comprehensive experimental results demonstrate the effectiveness of the proposed framework on the benchmark datasets.
Persistent Identifierhttp://hdl.handle.net/10722/321964
ISSN
2021 Impact Factor: 24.314
2020 SCImago Journal Rankings: 3.811
ISI Accession Number ID

 

DC FieldValueLanguage
dc.contributor.authorLu, Guo-
dc.contributor.authorZhang, Xiaoyun-
dc.contributor.authorOuyang, Wanli-
dc.contributor.authorChen, Li-
dc.contributor.authorGao, Zhiyong-
dc.contributor.authorXu, Dong-
dc.date.accessioned2022-11-03T02:22:40Z-
dc.date.available2022-11-03T02:22:40Z-
dc.date.issued2021-
dc.identifier.citationIEEE Transactions on Pattern Analysis and Machine Intelligence, 2021, v. 43, n. 10, p. 3292-3308-
dc.identifier.issn0162-8828-
dc.identifier.urihttp://hdl.handle.net/10722/321964-
dc.description.abstractTraditional video compression approaches build upon the hybrid coding framework with motion-compensated prediction and residual transform coding. In this paper, we propose the first end-to-end deep video compression framework to take advantage of both the classical compression architecture and the powerful non-linear representation ability of neural networks. Our framework employs pixel-wise motion information, which is learned from an optical flow network and further compressed by an auto-encoder network to save bits. The other compression components are also implemented by the well-designed networks for high efficiency. All the modules are jointly optimized by using the rate-distortion trade-off and can collaborate with each other. More importantly, the proposed deep video compression framework is very flexible and can be easily extended by using lightweight or advanced networks for higher speed or better efficiency. We also propose to introduce the adaptive quantization layer to reduce the number of parameters for variable bitrate coding. Comprehensive experimental results demonstrate the effectiveness of the proposed framework on the benchmark datasets.-
dc.languageeng-
dc.relation.ispartofIEEE Transactions on Pattern Analysis and Machine Intelligence-
dc.subjectend-to-end optimization-
dc.subjectimage compression-
dc.subjectneural network-
dc.subjectVideo compression-
dc.titleAn End-to-End Learning Framework for Video Compression-
dc.typeArticle-
dc.description.naturelink_to_subscribed_fulltext-
dc.identifier.doi10.1109/TPAMI.2020.2988453-
dc.identifier.pmid32324541-
dc.identifier.scopuseid_2-s2.0-85114602705-
dc.identifier.volume43-
dc.identifier.issue10-
dc.identifier.spage3292-
dc.identifier.epage3308-
dc.identifier.eissn1939-3539-
dc.identifier.isiWOS:000692232400006-

Export via OAI-PMH Interface in XML Formats


OR


Export to Other Non-XML Formats