中文版 | English
Title

Trans6D: Transformer-Based 6D Object Pose Estimation and Refinement

Author
Corresponding AuthorZhang,Zhongqun
DOI
Publication Years
2023
ISSN
0302-9743
EISSN
1611-3349
Source Title
Volume
13808 LNCS
Pages
112-128
Abstract
Estimating 6D object pose from a monocular RGB image remains challenging due to factors such as texture-less and occlusion. Although convolution neural network (CNN)-based methods have made remarkable progress, they are not efficient in capturing global dependencies and often suffer from information loss due to downsampling operations. To extract robust feature representation, we propose a Transformer-based 6D object pose estimation approach (Trans6D). Specifically, we first build two transformer-based strong baselines and compare their performance: pure Transformers following the ViT (Trans6D-pure) and hybrid Transformers integrating CNNs with Transformers (Trans6D-hybrid). Furthermore, two novel modules have been proposed to make the Trans6D-pure more accurate and robust: (i) a patch-aware feature fusion module. It decreases the number of tokens without information loss via shifted windows, cross-attention, and token pooling operations, which is used to predict dense 2D-3D correspondence maps; (ii) a pure Transformer-based pose refinement module (Trans6D+) which refines the estimated poses iteratively. Extensive experiments show that the proposed approach achieves state-of-the-art performances on two datasets.
Keywords
SUSTech Authorship
Others
Language
English
URL[Source Record]
Scopus EID
2-s2.0-85151354860
Data Source
Scopus
Citation statistics
Cited Times [WOS]:0
Document TypeConference paper
Identifierhttp://kc.sustech.edu.cn/handle/2SGJ60CL/524279
DepartmentSouthern University of Science and Technology
Affiliation
1.University of Birmingham,Birmingham,United Kingdom
2.Defense Innovation Institute,Boston,China
3.SUSTech,Shenzhen,China
Recommended Citation
GB/T 7714
Zhang,Zhongqun,Chen,Wei,Zheng,Linfang,et al. Trans6D: Transformer-Based 6D Object Pose Estimation and Refinement[C],2023:112-128.
Files in This Item:
There are no files associated with this item.
Related Services
Recommend this item
Bookmark
Usage statistics
Export to Endnote
Export to Excel
Export to Csv
Altmetrics Score
Google Scholar
Similar articles in Google Scholar
[Zhang,Zhongqun]'s Articles
[Chen,Wei]'s Articles
[Zheng,Linfang]'s Articles
Baidu Scholar
Similar articles in Baidu Scholar
[Zhang,Zhongqun]'s Articles
[Chen,Wei]'s Articles
[Zheng,Linfang]'s Articles
Bing Scholar
Similar articles in Bing Scholar
[Zhang,Zhongqun]'s Articles
[Chen,Wei]'s Articles
[Zheng,Linfang]'s Articles
Terms of Use
No data!
Social Bookmark/Share
No comment.

Items in the repository are protected by copyright, with all rights reserved, unless otherwise indicated.