File Download

There are no files associated with this item.

  Links for fulltext
     (May Require Subscription)
Supplementary

Conference Paper: Dvc: An end-to-end deep video compression framework

TitleDvc: An end-to-end deep video compression framework
Authors
KeywordsLow-level Vision
Vision Applications and Systems
Issue Date2019
Citation
Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2019, v. 2019-June, p. 10998-11007 How to Cite?
AbstractConventional video compression approaches use the predictive coding architecture and encode the corresponding motion information and residual information. In this paper, taking advantage of both classical architecture in the conventional video compression method and the powerful non-linear representation ability of neural networks, we propose the first end-to-end video compression deep model that jointly optimizes all the components for video compression. Specifically, learning based optical flow estimation is utilized to obtain the motion information and reconstruct the current frames. Then we employ two auto-encoder style neural networks to compress the corresponding motion and residual information. All the modules are jointly learned through a single loss function, in which they collaborate with each other by considering the trade-off between reducing the number of compression bits and improving quality of the decoded video. Experimental results show that the proposed approach can outperform the widely used video coding standard H.264 in terms of PSNR and be even on par with the latest standard H.265 in terms of MS-SSIM. Code is released at https://github.com/GuoLusjtu/DVC.
Persistent Identifierhttp://hdl.handle.net/10722/321876
ISSN
2020 SCImago Journal Rankings: 4.658
ISI Accession Number ID

 

DC FieldValueLanguage
dc.contributor.authorLu, Guo-
dc.contributor.authorOuyang, Wanli-
dc.contributor.authorXu, Dong-
dc.contributor.authorZhang, Xiaoyun-
dc.contributor.authorCai, Chunlei-
dc.contributor.authorGao, Zhiyong-
dc.date.accessioned2022-11-03T02:22:03Z-
dc.date.available2022-11-03T02:22:03Z-
dc.date.issued2019-
dc.identifier.citationProceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2019, v. 2019-June, p. 10998-11007-
dc.identifier.issn1063-6919-
dc.identifier.urihttp://hdl.handle.net/10722/321876-
dc.description.abstractConventional video compression approaches use the predictive coding architecture and encode the corresponding motion information and residual information. In this paper, taking advantage of both classical architecture in the conventional video compression method and the powerful non-linear representation ability of neural networks, we propose the first end-to-end video compression deep model that jointly optimizes all the components for video compression. Specifically, learning based optical flow estimation is utilized to obtain the motion information and reconstruct the current frames. Then we employ two auto-encoder style neural networks to compress the corresponding motion and residual information. All the modules are jointly learned through a single loss function, in which they collaborate with each other by considering the trade-off between reducing the number of compression bits and improving quality of the decoded video. Experimental results show that the proposed approach can outperform the widely used video coding standard H.264 in terms of PSNR and be even on par with the latest standard H.265 in terms of MS-SSIM. Code is released at https://github.com/GuoLusjtu/DVC.-
dc.languageeng-
dc.relation.ispartofProceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition-
dc.subjectLow-level Vision-
dc.subjectVision Applications and Systems-
dc.titleDvc: An end-to-end deep video compression framework-
dc.typeConference_Paper-
dc.description.naturelink_to_subscribed_fulltext-
dc.identifier.doi10.1109/CVPR.2019.01126-
dc.identifier.scopuseid_2-s2.0-85078774931-
dc.identifier.volume2019-June-
dc.identifier.spage10998-
dc.identifier.epage11007-
dc.identifier.isiWOS:000542649304063-

Export via OAI-PMH Interface in XML Formats


OR


Export to Other Non-XML Formats