DISP6D: Disentangled Implicit Shape and Pose Learning for Scalable 6D Pose Estimation

Wen, YL; Li, XY; Pan, H; Yang, L; Wang, Z; Komura, T; Wang, WP

File Download

There are no files associated with this item.

Links for fulltext

(May Require Subscription)

Publisher Website: 10.1007/978-3-031-20077-9_24
Scopus: eid_2-s2.0-85142754501

Supplementary

Citations:
- Scopus: 0
Appears in Collections:
- Computer Science: Conference papers

Conference Paper: DISP6D: Disentangled Implicit Shape and Pose Learning for Scalable 6D Pose Estimation

Title	DISP6D: Disentangled Implicit Shape and Pose Learning for Scalable 6D Pose Estimation
Authors	Wen, YL Li, XY Pan, H Yang, L Wang, Z Komura, T Wang, WP
Keywords	6D pose estimation Disentanglement Re-entanglement Scalability Sim-to-real Symmetry ambiguity
Issue Date	6-Nov-2022
Abstract	Scalable 6D pose estimation for rigid objects from RGB images aims at handling multiple objects and generalizing to novel objects. Building on a well-known auto-encoding framework to cope with object symmetry and the lack of labeled training data, we achieve scalability by disentangling the latent representation of auto-encoder into shape and pose sub-spaces. The latent shape space models the similarity of different objects through contrastive metric learning, and the latent pose code is compared with canonical rotations for rotation retrieval. Because different object symmetries induce inconsistent latent pose spaces, we re-entangle the shape representation with canonical rotations to generate shape-dependent pose codebooks for rotation retrieval. We show state-of-the-art performance on two benchmarks containing textureless CAD objects without category and daily objects with categories respectively, and further demonstrate improved scalability by extending to a more challenging setting of daily objects across categories.
Persistent Identifier	http://hdl.handle.net/10722/333852

DC Field	Value	Language
dc.contributor.author	Wen, YL	-
dc.contributor.author	Li, XY	-
dc.contributor.author	Pan, H	-
dc.contributor.author	Yang, L	-
dc.contributor.author	Wang, Z	-
dc.contributor.author	Komura, T	-
dc.contributor.author	Wang, WP	-
dc.date.accessioned	2023-10-06T08:39:37Z	-
dc.date.available	2023-10-06T08:39:37Z	-
dc.date.issued	2022-11-06	-
dc.identifier.uri	http://hdl.handle.net/10722/333852	-
dc.description.abstract	<p>Scalable 6D pose estimation for rigid objects from RGB images aims at handling multiple objects and generalizing to novel objects. Building on a well-known auto-encoding framework to cope with object symmetry and the lack of labeled training data, we achieve scalability by disentangling the latent representation of auto-encoder into shape and pose sub-spaces. The latent shape space models the similarity of different objects through contrastive metric learning, and the latent pose code is compared with canonical rotations for rotation retrieval. Because different object symmetries induce inconsistent latent pose spaces, we re-entangle the shape representation with canonical rotations to generate shape-dependent pose codebooks for rotation retrieval. We show state-of-the-art performance on two benchmarks containing textureless CAD objects without category and daily objects with categories respectively, and further demonstrate improved scalability by extending to a more challenging setting of daily objects across categories.</p>	-
dc.language	eng	-
dc.relation.ispartof	17th European Conference on Computer Vision, ECCV 2022 (23/10/2022-27/10/2022, Tel Aviv, Israel)	-
dc.subject	6D pose estimation	-
dc.subject	Disentanglement	-
dc.subject	Re-entanglement	-
dc.subject	Scalability	-
dc.subject	Sim-to-real	-
dc.subject	Symmetry ambiguity	-
dc.title	DISP6D: Disentangled Implicit Shape and Pose Learning for Scalable 6D Pose Estimation	-
dc.type	Conference_Paper	-
dc.identifier.doi	10.1007/978-3-031-20077-9_24	-
dc.identifier.scopus	eid_2-s2.0-85142754501	-
dc.identifier.volume	13669 LNCS	-
dc.identifier.spage	404	-
dc.identifier.epage	421	-

File Download

Links for fulltext

(May Require Subscription)

Supplementary

Conference Paper: DISP6D: Disentangled Implicit Shape and Pose Learning for Scalable 6D Pose Estimation

Export via OAI-PMH Interface in XML Formats

OR

Export to Other Non-XML Formats