Title | DISP6D: Disentangled Implicit Shape and Pose Learning for Scalable 6D Pose Estimation |
Author | |
Corresponding Author | Wen,Yilin |
DOI | |
Publication Years | 2022
|
Conference Name | 17th European Conference on Computer Vision (ECCV)
|
ISSN | 0302-9743
|
EISSN | 1611-3349
|
ISBN | 978-3-031-20076-2
|
Source Title | |
Volume | 13669 LNCS
|
Pages | 404-421
|
Conference Date | OCT 23-27, 2022
|
Conference Place | null,Tel Aviv,ISRAEL
|
Publication Place | GEWERBESTRASSE 11, CHAM, CH-6330, SWITZERLAND
|
Publisher | |
Abstract | Scalable 6D pose estimation for rigid objects from RGB images aims at handling multiple objects and generalizing to novel objects. Building on a well-known auto-encoding framework to cope with object symmetry and the lack of labeled training data, we achieve scalability by disentangling the latent representation of auto-encoder into shape and pose sub-spaces. The latent shape space models the similarity of different objects through contrastive metric learning, and the latent pose code is compared with canonical rotations for rotation retrieval. Because different object symmetries induce inconsistent latent pose spaces, we re-entangle the shape representation with canonical rotations to generate shape-dependent pose codebooks for rotation retrieval. We show state-of-the-art performance on two benchmarks containing textureless CAD objects without category and daily objects with categories respectively, and further demonstrate improved scalability by extending to a more challenging setting of daily objects across categories. |
Keywords | |
SUSTech Authorship | Others
|
Language | English
|
URL | [Source Record] |
Indexed By | |
WOS Research Area | Computer Science
; Imaging Science & Photographic Technology
|
WOS Subject | Computer Science, Artificial Intelligence
; Imaging Science & Photographic Technology
|
WOS Accession No | WOS:000897132300024
|
Scopus EID | 2-s2.0-85142754501
|
Data Source | Scopus
|
Citation statistics |
Cited Times [WOS]:0
|
Document Type | Conference paper |
Identifier | http://kc.sustech.edu.cn/handle/2SGJ60CL/416580 |
Department | Southern University of Science and Technology |
Affiliation | 1.The University of Hong Kong,Hong Kong 2.Brown University,Providence,United States 3.Microsoft Research Asia,Beijing,China 4.Centre for Garment Production Limited,Hong Kong 5.SUSTech,Shenzhen,China 6.Texas A &M University,College Station,United States |
Recommended Citation GB/T 7714 |
Wen,Yilin,Li,Xiangyu,Pan,Hao,et al. DISP6D: Disentangled Implicit Shape and Pose Learning for Scalable 6D Pose Estimation[C]. GEWERBESTRASSE 11, CHAM, CH-6330, SWITZERLAND:SPRINGER INTERNATIONAL PUBLISHING AG,2022:404-421.
|
Files in This Item: | There are no files associated with this item. |
|
Items in the repository are protected by copyright, with all rights reserved, unless otherwise indicated.
Edit Comment