中文版 | English
Title

DISP6D: Disentangled Implicit Shape and Pose Learning for Scalable 6D Pose Estimation

Author
Corresponding AuthorWen,Yilin
DOI
Publication Years
2022
Conference Name
17th European Conference on Computer Vision (ECCV)
ISSN
0302-9743
EISSN
1611-3349
ISBN
978-3-031-20076-2
Source Title
Volume
13669 LNCS
Pages
404-421
Conference Date
OCT 23-27, 2022
Conference Place
null,Tel Aviv,ISRAEL
Publication Place
GEWERBESTRASSE 11, CHAM, CH-6330, SWITZERLAND
Publisher
Abstract
Scalable 6D pose estimation for rigid objects from RGB images aims at handling multiple objects and generalizing to novel objects. Building on a well-known auto-encoding framework to cope with object symmetry and the lack of labeled training data, we achieve scalability by disentangling the latent representation of auto-encoder into shape and pose sub-spaces. The latent shape space models the similarity of different objects through contrastive metric learning, and the latent pose code is compared with canonical rotations for rotation retrieval. Because different object symmetries induce inconsistent latent pose spaces, we re-entangle the shape representation with canonical rotations to generate shape-dependent pose codebooks for rotation retrieval. We show state-of-the-art performance on two benchmarks containing textureless CAD objects without category and daily objects with categories respectively, and further demonstrate improved scalability by extending to a more challenging setting of daily objects across categories.
Keywords
SUSTech Authorship
Others
Language
English
URL[Source Record]
Indexed By
WOS Research Area
Computer Science ; Imaging Science & Photographic Technology
WOS Subject
Computer Science, Artificial Intelligence ; Imaging Science & Photographic Technology
WOS Accession No
WOS:000897132300024
Scopus EID
2-s2.0-85142754501
Data Source
Scopus
Citation statistics
Cited Times [WOS]:0
Document TypeConference paper
Identifierhttp://kc.sustech.edu.cn/handle/2SGJ60CL/416580
DepartmentSouthern University of Science and Technology
Affiliation
1.The University of Hong Kong,Hong Kong
2.Brown University,Providence,United States
3.Microsoft Research Asia,Beijing,China
4.Centre for Garment Production Limited,Hong Kong
5.SUSTech,Shenzhen,China
6.Texas A &M University,College Station,United States
Recommended Citation
GB/T 7714
Wen,Yilin,Li,Xiangyu,Pan,Hao,et al. DISP6D: Disentangled Implicit Shape and Pose Learning for Scalable 6D Pose Estimation[C]. GEWERBESTRASSE 11, CHAM, CH-6330, SWITZERLAND:SPRINGER INTERNATIONAL PUBLISHING AG,2022:404-421.
Files in This Item:
There are no files associated with this item.
Related Services
Recommend this item
Bookmark
Usage statistics
Export to Endnote
Export to Excel
Export to Csv
Altmetrics Score
Google Scholar
Similar articles in Google Scholar
[Wen,Yilin]'s Articles
[Li,Xiangyu]'s Articles
[Pan,Hao]'s Articles
Baidu Scholar
Similar articles in Baidu Scholar
[Wen,Yilin]'s Articles
[Li,Xiangyu]'s Articles
[Pan,Hao]'s Articles
Bing Scholar
Similar articles in Bing Scholar
[Wen,Yilin]'s Articles
[Li,Xiangyu]'s Articles
[Pan,Hao]'s Articles
Terms of Use
No data!
Social Bookmark/Share
No comment.

Items in the repository are protected by copyright, with all rights reserved, unless otherwise indicated.