Progressive End-to-End Object Detection in Crowded Scenes

Zheng, Anlin; Zhang, Yuang; Zhang, Xiangyu; Qi, Xiaojuan; Sun, Jian

File Download

There are no files associated with this item.

Links for fulltext

(May Require Subscription)

Publisher Website: 10.1109/CVPR52688.2022.00093
Scopus: eid_2-s2.0-85141805247
WOS: WOS:000867754201010

Supplementary

Citations:
- Scopus: 0
- Web of Science: 0
Appears in Collections:
- Electrical & Electronic Engineering: Conference papers

Conference Paper: Progressive End-to-End Object Detection in Crowded Scenes

Title	Progressive End-to-End Object Detection in Crowded Scenes
Authors	Zheng, Anlin Zhang, Yuang Zhang, Xiangyu Qi, Xiaojuan Sun, Jian
Keywords	categorization Deep learning architectures and techniques Recognition: detection Representation learning retrieval Scene analysis and understanding Vision applications and systems
Issue Date	27-Sep-2022
Abstract	In this paper, we propose a new query-based detection framework for crowd detection. Previous query-based detectors suffer from two drawbacks: first, multiple predictions will be inferred for a single object, typically in crowded scenes; second, the performance saturates as the depth of the decoding stage increases. Benefiting from the nature of the one-to-one label assignment rule, we propose a progressive predicting method to address the above issues. Specifically, we first select accepted queries prone to generate true positive predictions, then refine the rest noisy queries according to the previously accepted predictions. Experiments show that our method can significantly boost the performance of query-based detectors in crowded scenes. Equipped with our approach, Sparse RCNN achieves 92.0% AP, 41.4% MR ⁻² and 83.2% JI on the challenging CrowdHuman [35] dataset, outperforming the box-based method MIP [8] that specifies in handling crowded scenarios. Moreover, the proposed method, robust to crowdedness, can still obtain consistent improvements on moderately and slightly crowded datasets like CityPersons [47] and COCO [26]. Code will be made publicly available at https://github.com/megvii-model/Iter-E2EDET.
Persistent Identifier	http://hdl.handle.net/10722/333841
ISI Accession Number ID	WOS:000867754201010

DC Field	Value	Language
dc.contributor.author	Zheng, Anlin	-
dc.contributor.author	Zhang, Yuang	-
dc.contributor.author	Zhang, Xiangyu	-
dc.contributor.author	Qi, Xiaojuan	-
dc.contributor.author	Sun, Jian	-
dc.date.accessioned	2023-10-06T08:39:31Z	-
dc.date.available	2023-10-06T08:39:31Z	-
dc.date.issued	2022-09-27	-
dc.identifier.uri	http://hdl.handle.net/10722/333841	-
dc.description.abstract	<p>In this paper, we propose a new query-based detection framework for crowd detection. Previous query-based detectors suffer from two drawbacks: first, multiple predictions will be inferred for a single object, typically in crowded scenes; second, the performance saturates as the depth of the decoding stage increases. Benefiting from the nature of the one-to-one label assignment rule, we propose a progressive predicting method to address the above issues. Specifically, we first select accepted queries prone to generate true positive predictions, then refine the rest noisy queries according to the previously accepted predictions. Experiments show that our method can significantly boost the performance of query-based detectors in crowded scenes. Equipped with our approach, Sparse RCNN achieves 92.0% AP, 41.4% MR <sup>−2</sup> and 83.2% JI on the challenging CrowdHuman [35] dataset, outperforming the box-based method MIP [8] that specifies in handling crowded scenarios. Moreover, the proposed method, robust to crowdedness, can still obtain consistent improvements on moderately and slightly crowded datasets like CityPersons [47] and COCO [26]. Code will be made publicly available at https://github.com/megvii-model/Iter-E2EDET.<br></p>	-
dc.language	eng	-
dc.relation.ispartof	2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) (18/06/2022-24/06/2022, New Orleans, LA, USA)	-
dc.subject	categorization	-
dc.subject	Deep learning architectures and techniques	-
dc.subject	Recognition: detection	-
dc.subject	Representation learning	-
dc.subject	retrieval	-
dc.subject	Scene analysis and understanding	-
dc.subject	Vision applications and systems	-
dc.title	Progressive End-to-End Object Detection in Crowded Scenes	-
dc.type	Conference_Paper	-
dc.identifier.doi	10.1109/CVPR52688.2022.00093	-
dc.identifier.scopus	eid_2-s2.0-85141805247	-
dc.identifier.volume	2022-June	-
dc.identifier.spage	847	-
dc.identifier.epage	856	-
dc.identifier.isi	WOS:000867754201010	-

File Download

Links for fulltext

(May Require Subscription)

Supplementary

Conference Paper: Progressive End-to-End Object Detection in Crowded Scenes

Export via OAI-PMH Interface in XML Formats

OR

Export to Other Non-XML Formats