Dvc: An end-to-end deep video compression framework

Lu, Guo; Ouyang, Wanli; Xu, Dong; Zhang, Xiaoyun; Cai, Chunlei; Gao, Zhiyong

File Download

There are no files associated with this item.

Links for fulltext

(May Require Subscription)

Publisher Website: 10.1109/CVPR.2019.01126
Scopus: eid_2-s2.0-85078774931
WOS: WOS:000542649304063
Find via

Supplementary

Citations:
- Scopus: 0
- Web of Science: 0
Appears in Collections:
- Computer Science: Conference papers

Conference Paper: Dvc: An end-to-end deep video compression framework

Title	Dvc: An end-to-end deep video compression framework
Authors	Lu, Guo Ouyang, Wanli Xu, Dong Zhang, Xiaoyun Cai, Chunlei Gao, Zhiyong
Keywords	Low-level Vision Vision Applications and Systems
Issue Date	2019
Citation	Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2019, v. 2019-June, p. 10998-11007 How to Cite? DOI: http://dx.doi.org/10.1109/CVPR.2019.01126
Abstract	Conventional video compression approaches use the predictive coding architecture and encode the corresponding motion information and residual information. In this paper, taking advantage of both classical architecture in the conventional video compression method and the powerful non-linear representation ability of neural networks, we propose the first end-to-end video compression deep model that jointly optimizes all the components for video compression. Specifically, learning based optical flow estimation is utilized to obtain the motion information and reconstruct the current frames. Then we employ two auto-encoder style neural networks to compress the corresponding motion and residual information. All the modules are jointly learned through a single loss function, in which they collaborate with each other by considering the trade-off between reducing the number of compression bits and improving quality of the decoded video. Experimental results show that the proposed approach can outperform the widely used video coding standard H.264 in terms of PSNR and be even on par with the latest standard H.265 in terms of MS-SSIM. Code is released at https://github.com/GuoLusjtu/DVC.
Persistent Identifier	http://hdl.handle.net/10722/321876
ISSN	1063-6919 2023 SCImago Journal Rankings: 10.331
ISI Accession Number ID	WOS:000542649304063

DC Field	Value	Language
dc.contributor.author	Lu, Guo	-
dc.contributor.author	Ouyang, Wanli	-
dc.contributor.author	Xu, Dong	-
dc.contributor.author	Zhang, Xiaoyun	-
dc.contributor.author	Cai, Chunlei	-
dc.contributor.author	Gao, Zhiyong	-
dc.date.accessioned	2022-11-03T02:22:03Z	-
dc.date.available	2022-11-03T02:22:03Z	-
dc.date.issued	2019	-
dc.identifier.citation	Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2019, v. 2019-June, p. 10998-11007	-
dc.identifier.issn	1063-6919	-
dc.identifier.uri	http://hdl.handle.net/10722/321876	-
dc.description.abstract	Conventional video compression approaches use the predictive coding architecture and encode the corresponding motion information and residual information. In this paper, taking advantage of both classical architecture in the conventional video compression method and the powerful non-linear representation ability of neural networks, we propose the first end-to-end video compression deep model that jointly optimizes all the components for video compression. Specifically, learning based optical flow estimation is utilized to obtain the motion information and reconstruct the current frames. Then we employ two auto-encoder style neural networks to compress the corresponding motion and residual information. All the modules are jointly learned through a single loss function, in which they collaborate with each other by considering the trade-off between reducing the number of compression bits and improving quality of the decoded video. Experimental results show that the proposed approach can outperform the widely used video coding standard H.264 in terms of PSNR and be even on par with the latest standard H.265 in terms of MS-SSIM. Code is released at https://github.com/GuoLusjtu/DVC.	-
dc.language	eng	-
dc.relation.ispartof	Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition	-
dc.subject	Low-level Vision	-
dc.subject	Vision Applications and Systems	-
dc.title	Dvc: An end-to-end deep video compression framework	-
dc.type	Conference_Paper	-
dc.description.nature	link_to_subscribed_fulltext	-
dc.identifier.doi	10.1109/CVPR.2019.01126	-
dc.identifier.scopus	eid_2-s2.0-85078774931	-
dc.identifier.volume	2019-June	-
dc.identifier.spage	10998	-
dc.identifier.epage	11007	-
dc.identifier.isi	WOS:000542649304063	-

File Download

Links for fulltext

(May Require Subscription)

Supplementary

Conference Paper: Dvc: An end-to-end deep video compression framework

Export via OAI-PMH Interface in XML Formats

OR

Export to Other Non-XML Formats