Synthesizing Supervision for Learning Deep Saliency Network without Human Annotation

Zhang, Dingwen; Han, Junwei; Zhang, Yu; Xu, Dong

File Download

There are no files associated with this item.

Links for fulltext

(May Require Subscription)

Publisher Website: 10.1109/TPAMI.2019.2900649
Scopus: eid_2-s2.0-85086060874
PMID: 30794509
WOS: WOS:000542967200018
Find via

Supplementary

Citations:
- Scopus: 0
- Web of Science: 0
- PubMed Central: 0
Appears in Collections:
- Computer Science: Journal/Magazine Articles

Article: Synthesizing Supervision for Learning Deep Saliency Network without Human Annotation

Title	Synthesizing Supervision for Learning Deep Saliency Network without Human Annotation
Authors	Zhang, Dingwen Han, Junwei Zhang, Yu Xu, Dong
Keywords	annotation-free Salient object detection supervision synthesis weakly supervised semantic segmentation
Issue Date	2020
Citation	IEEE Transactions on Pattern Analysis and Machine Intelligence, 2020, v. 42, n. 7, p. 1755-1769 How to Cite? DOI: http://dx.doi.org/10.1109/TPAMI.2019.2900649
Abstract	Recently, the research field of salient object detection is undergoing a rapid and remarkable development along with the wide usage of deep neural networks. Being trained with a large number of images annotated with strong pixel-level ground-truth masks, the deep salient object detectors have achieved the state-of-the-art performance. However, it is expensive and time-consuming to provide the pixel-level ground-truth masks for each training image. To address this problem, this paper proposes one of the earliest frameworks to learn deep salient object detectors without requiring any human annotation. The supervisory signals used in our learning framework are generated through a novel supervision synthesis scheme, in which the key insights are 'knowledge source transition' and 'supervision by fusion'. Specifically, in the proposed learning framework, both the external knowledge source and the internal knowledge source are explored dynamically to provide informative cues for synthesizing supervision required in our approach, while a two-stream fusion mechanism is also established to implement the supervision synthesis process. Comprehensive experiments on four benchmark datasets demonstrate that the deep salient object detector trained by our newly proposed learning framework often works well without requiring any human annotated masks, which even approaches to its upper-bound obtained under the fully supervised learning fashion (within only 3 percent performance gap). Besides, we also apply the salient object detector learnt with our annotation-free learning framework to assist the weakly supervised semantic segmentation task, which demonstrates that our approach can also alleviate the heavy supplementary supervision required in the existing weakly supervised semantic segmentation framework.
Persistent Identifier	http://hdl.handle.net/10722/321888
ISSN	0162-8828 2023 Impact Factor: 20.8 2023 SCImago Journal Rankings: 6.158
ISI Accession Number ID	WOS:000542967200018

DC Field	Value	Language
dc.contributor.author	Zhang, Dingwen	-
dc.contributor.author	Han, Junwei	-
dc.contributor.author	Zhang, Yu	-
dc.contributor.author	Xu, Dong	-
dc.date.accessioned	2022-11-03T02:22:08Z	-
dc.date.available	2022-11-03T02:22:08Z	-
dc.date.issued	2020	-
dc.identifier.citation	IEEE Transactions on Pattern Analysis and Machine Intelligence, 2020, v. 42, n. 7, p. 1755-1769	-
dc.identifier.issn	0162-8828	-
dc.identifier.uri	http://hdl.handle.net/10722/321888	-
dc.description.abstract	Recently, the research field of salient object detection is undergoing a rapid and remarkable development along with the wide usage of deep neural networks. Being trained with a large number of images annotated with strong pixel-level ground-truth masks, the deep salient object detectors have achieved the state-of-the-art performance. However, it is expensive and time-consuming to provide the pixel-level ground-truth masks for each training image. To address this problem, this paper proposes one of the earliest frameworks to learn deep salient object detectors without requiring any human annotation. The supervisory signals used in our learning framework are generated through a novel supervision synthesis scheme, in which the key insights are 'knowledge source transition' and 'supervision by fusion'. Specifically, in the proposed learning framework, both the external knowledge source and the internal knowledge source are explored dynamically to provide informative cues for synthesizing supervision required in our approach, while a two-stream fusion mechanism is also established to implement the supervision synthesis process. Comprehensive experiments on four benchmark datasets demonstrate that the deep salient object detector trained by our newly proposed learning framework often works well without requiring any human annotated masks, which even approaches to its upper-bound obtained under the fully supervised learning fashion (within only 3 percent performance gap). Besides, we also apply the salient object detector learnt with our annotation-free learning framework to assist the weakly supervised semantic segmentation task, which demonstrates that our approach can also alleviate the heavy supplementary supervision required in the existing weakly supervised semantic segmentation framework.	-
dc.language	eng	-
dc.relation.ispartof	IEEE Transactions on Pattern Analysis and Machine Intelligence	-
dc.subject	annotation-free	-
dc.subject	Salient object detection	-
dc.subject	supervision synthesis	-
dc.subject	weakly supervised semantic segmentation	-
dc.title	Synthesizing Supervision for Learning Deep Saliency Network without Human Annotation	-
dc.type	Article	-
dc.description.nature	link_to_subscribed_fulltext	-
dc.identifier.doi	10.1109/TPAMI.2019.2900649	-
dc.identifier.pmid	30794509	-
dc.identifier.scopus	eid_2-s2.0-85086060874	-
dc.identifier.volume	42	-
dc.identifier.issue	7	-
dc.identifier.spage	1755	-
dc.identifier.epage	1769	-
dc.identifier.eissn	1939-3539	-
dc.identifier.isi	WOS:000542967200018	-

File Download

Links for fulltext

(May Require Subscription)

Supplementary

Article: Synthesizing Supervision for Learning Deep Saliency Network without Human Annotation

Export via OAI-PMH Interface in XML Formats

OR

Export to Other Non-XML Formats