Detco: Unsupervised contrastive learning for object detection

Xie, E; Ding, J; Wang, W; Zhan, X; Xu, H; Sun, P; Li, Z; Luo, P

File Download

There are no files associated with this item.

Links for fulltext

(May Require Subscription)

Publisher Website: 10.1109/ICCV48922.2021.00828
WOS: WOS:000798743207012

Supplementary

Citations:
- Web of Science: 0
Appears in Collections:
- Computer Science: Conference papers

Conference Paper: Detco: Unsupervised contrastive learning for object detection

Title	Detco: Unsupervised contrastive learning for object detection
Authors	Xie, E Ding, J Wang, W Zhan, X Xu, H Sun, P Li, Z Luo, P
Keywords	Computer vision Three-dimensional displays Object detection Detectors Feature extraction
Issue Date	2021
Publisher	IEEE Computer Society.
Citation	2021 IEEE/CVF International Conference on Computer Vision (ICCV) (Virtual), Montreal, QC, Canada, October 10-17, 2021. In Proceedings: 2021 IEEE/CVF International Conference on Computer Vision: ICCV 2021, 11-17 October 2021, p. 8392-8401 How to Cite? DOI: http://dx.doi.org/10.1109/ICCV48922.2021.00828
Abstract	We present DetCo, a simple yet effective self-supervised approach for object detection. Unsupervised pre-training methods have been recently designed for object detection, but they are usually deficient in image classification, or the opposite. Unlike them, DetCo transfers well on downstream instance-level dense prediction tasks, while maintaining competitive image-level classification accuracy. The advantages are derived from (1) multi-level supervision to intermediate representations, (2) contrastive learning between global image and local patches. These two designs facilitate discriminative and consistent global and local representation at each level of feature pyramid, improving detection and classification, simultaneously.Extensive experiments on VOC, COCO, Cityscapes, and ImageNet demonstrate that DetCo not only outperforms recent methods on a series of 2D and 3D instance-level detection tasks, but also competitive on image classification. For example, on ImageNet classification, DetCo is 6.9% and 5.0% top-1 accuracy better than InsLoc and DenseCL, which are two contemporary works designed for object detection. Moreover, on COCO detection, DetCo is 6.9 AP better than SwAV with Mask R-CNN C4. Notably, DetCo largely boosts up Sparse R-CNN, a recent strong detector, from 45.0 AP to 46.5 AP (+1.5 AP), establishing a new SOTA on COCO.
Persistent Identifier	http://hdl.handle.net/10722/315801
ISI Accession Number ID	WOS:000798743207012

DC Field	Value	Language
dc.contributor.author	Xie, E	-
dc.contributor.author	Ding, J	-
dc.contributor.author	Wang, W	-
dc.contributor.author	Zhan, X	-
dc.contributor.author	Xu, H	-
dc.contributor.author	Sun, P	-
dc.contributor.author	Li, Z	-
dc.contributor.author	Luo, P	-
dc.date.accessioned	2022-08-19T09:04:41Z	-
dc.date.available	2022-08-19T09:04:41Z	-
dc.date.issued	2021	-
dc.identifier.citation	2021 IEEE/CVF International Conference on Computer Vision (ICCV) (Virtual), Montreal, QC, Canada, October 10-17, 2021. In Proceedings: 2021 IEEE/CVF International Conference on Computer Vision: ICCV 2021, 11-17 October 2021, p. 8392-8401	-
dc.identifier.uri	http://hdl.handle.net/10722/315801	-
dc.description.abstract	We present DetCo, a simple yet effective self-supervised approach for object detection. Unsupervised pre-training methods have been recently designed for object detection, but they are usually deficient in image classification, or the opposite. Unlike them, DetCo transfers well on downstream instance-level dense prediction tasks, while maintaining competitive image-level classification accuracy. The advantages are derived from (1) multi-level supervision to intermediate representations, (2) contrastive learning between global image and local patches. These two designs facilitate discriminative and consistent global and local representation at each level of feature pyramid, improving detection and classification, simultaneously.Extensive experiments on VOC, COCO, Cityscapes, and ImageNet demonstrate that DetCo not only outperforms recent methods on a series of 2D and 3D instance-level detection tasks, but also competitive on image classification. For example, on ImageNet classification, DetCo is 6.9% and 5.0% top-1 accuracy better than InsLoc and DenseCL, which are two contemporary works designed for object detection. Moreover, on COCO detection, DetCo is 6.9 AP better than SwAV with Mask R-CNN C4. Notably, DetCo largely boosts up Sparse R-CNN, a recent strong detector, from 45.0 AP to 46.5 AP (+1.5 AP), establishing a new SOTA on COCO.	-
dc.language	eng	-
dc.publisher	IEEE Computer Society.	-
dc.relation.ispartof	Proceedings: 2021 IEEE/CVF International Conference on Computer Vision: ICCV 2021, 11-17 October 2021, Virtual event	-
dc.rights	Proceedings: 2021 IEEE/CVF International Conference on Computer Vision: ICCV 2021, 11-17 October 2021, Virtual event. Copyright © IEEE Computer Society.	-
dc.subject	Computer vision	-
dc.subject	Three-dimensional displays	-
dc.subject	Object detection	-
dc.subject	Detectors	-
dc.subject	Feature extraction	-
dc.title	Detco: Unsupervised contrastive learning for object detection	-
dc.type	Conference_Paper	-
dc.identifier.email	Luo, P: pluo@hku.hk	-
dc.identifier.authority	Luo, P=rp02575	-
dc.identifier.doi	10.1109/ICCV48922.2021.00828	-
dc.identifier.hkuros	335602	-
dc.identifier.spage	8392	-
dc.identifier.epage	8401	-
dc.identifier.isi	WOS:000798743207012	-
dc.publisher.place	United States	-

File Download

Links for fulltext

(May Require Subscription)

Supplementary

Conference Paper: Detco: Unsupervised contrastive learning for object detection

Export via OAI-PMH Interface in XML Formats

OR

Export to Other Non-XML Formats