Self-Supervised scene de-occlusion

Zhan, Xiaohang; Pan, Xingang; Dai, Bo; Liu, Ziwei; Lin, Dahua; Loy, Chen Change

File Download

There are no files associated with this item.

Links for fulltext

(May Require Subscription)

Publisher Website: 10.1109/CVPR42600.2020.00384
Scopus: eid_2-s2.0-85094832567
Find via

Supplementary

Citations:
- Scopus: 0
Appears in Collections:
- HKU Musketeers Foundation Institute of Data Science: Conference papers

Conference Paper: Self-Supervised scene de-occlusion

Title	Self-Supervised scene de-occlusion
Authors	Zhan, Xiaohang Pan, Xingang Dai, Bo Liu, Ziwei Lin, Dahua Loy, Chen Change
Issue Date	2020
Citation	Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2020, p. 3783-3791 How to Cite? DOI: http://dx.doi.org/10.1109/CVPR42600.2020.00384
Abstract	Natural scene understanding is a challenging task, particularly when encountering images of multiple objects that are partially occluded. This obstacle is given rise by varying object ordering and positioning. Existing scene understanding paradigms are able to parse only the visible parts, resulting in incomplete and unstructured scene interpretation. In this paper, we investigate the problem of scene de-occlusion, which aims to recover the underlying occlusion ordering and complete the invisible parts of occluded objects. We make the first attempt to address the problem through a novel and unified framework that recovers hidden scene structures without ordering and amodal annotations as supervisions. This is achieved via Partial Completion Network (PCNet)-mask (M) and -content (C), that learn to recover fractions of object masks and contents, respectively, in a self-supervised manner. Based on PCNet-M and PCNet-C, we devise a novel inference scheme to accomplish scene de-occlusion, via progressive ordering recovery, amodal completion and content completion. Extensive experiments on real-world scenes demonstrate the superior performance of our approach to other alternatives. Remarkably, our approach that is trained in a self-supervised manner achieves comparable results to fully-supervised methods. The proposed scene de-occlusion framework benefits many applications, including high-quality and controllable image manipulation and scene recomposition (see Fig. 1), as well as the conversion of existing modal mask annotations to amodal mask annotations. Project page: https://xiaohangzhan. github.io/projects/deocclusion/.
Persistent Identifier	http://hdl.handle.net/10722/352216
ISSN	1063-6919 2023 SCImago Journal Rankings: 10.331

DC Field	Value	Language
dc.contributor.author	Zhan, Xiaohang	-
dc.contributor.author	Pan, Xingang	-
dc.contributor.author	Dai, Bo	-
dc.contributor.author	Liu, Ziwei	-
dc.contributor.author	Lin, Dahua	-
dc.contributor.author	Loy, Chen Change	-
dc.date.accessioned	2024-12-16T03:57:22Z	-
dc.date.available	2024-12-16T03:57:22Z	-
dc.date.issued	2020	-
dc.identifier.citation	Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2020, p. 3783-3791	-
dc.identifier.issn	1063-6919	-
dc.identifier.uri	http://hdl.handle.net/10722/352216	-
dc.description.abstract	Natural scene understanding is a challenging task, particularly when encountering images of multiple objects that are partially occluded. This obstacle is given rise by varying object ordering and positioning. Existing scene understanding paradigms are able to parse only the visible parts, resulting in incomplete and unstructured scene interpretation. In this paper, we investigate the problem of scene de-occlusion, which aims to recover the underlying occlusion ordering and complete the invisible parts of occluded objects. We make the first attempt to address the problem through a novel and unified framework that recovers hidden scene structures without ordering and amodal annotations as supervisions. This is achieved via Partial Completion Network (PCNet)-mask (M) and -content (C), that learn to recover fractions of object masks and contents, respectively, in a self-supervised manner. Based on PCNet-M and PCNet-C, we devise a novel inference scheme to accomplish scene de-occlusion, via progressive ordering recovery, amodal completion and content completion. Extensive experiments on real-world scenes demonstrate the superior performance of our approach to other alternatives. Remarkably, our approach that is trained in a self-supervised manner achieves comparable results to fully-supervised methods. The proposed scene de-occlusion framework benefits many applications, including high-quality and controllable image manipulation and scene recomposition (see Fig. 1), as well as the conversion of existing modal mask annotations to amodal mask annotations. Project page: https://xiaohangzhan. github.io/projects/deocclusion/.	-
dc.language	eng	-
dc.relation.ispartof	Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition	-
dc.title	Self-Supervised scene de-occlusion	-
dc.type	Conference_Paper	-
dc.description.nature	link_to_subscribed_fulltext	-
dc.identifier.doi	10.1109/CVPR42600.2020.00384	-
dc.identifier.scopus	eid_2-s2.0-85094832567	-
dc.identifier.spage	3783	-
dc.identifier.epage	3791	-

File Download

Links for fulltext

(May Require Subscription)

Supplementary

Conference Paper: Self-Supervised scene de-occlusion

Export via OAI-PMH Interface in XML Formats

OR

Export to Other Non-XML Formats