Title | Towards Generic 3D Tracking in RGBD Videos: Benchmark and Baseline |
Author | |
Corresponding Author | Feng Zheng |
DOI | |
Publication Years | 2022
|
Conference Name | 17th European Conference on Computer Vision (ECCV)
|
ISSN | 0302-9743
|
EISSN | 1611-3349
|
ISBN | 978-3-031-20046-5
|
Source Title | |
Volume | 13682
|
Conference Date | OCT 23-27, 2022
|
Conference Place | null,Tel Aviv,ISRAEL
|
Publication Place | GEWERBESTRASSE 11, CHAM, CH-6330, SWITZERLAND
|
Publisher | |
Abstract | Tracking in 3D scenes is gaining momentum because of its numerous applications in robotics, autonomous driving, and scene understanding. Currently, 3D tracking is limited to specific model-based approaches involving point clouds, which impedes 3D trackers from applying in natural 3D scenes. RGBD sensors provide a more reasonable and acceptable solution for 3D object tracking due to their readily available synchronised color and depth information. Thus, in this paper, we investigate a novel problem: is it possible to track a generic (class-agnostic) 3D object in RGBD videos and predict 3D bounding boxes of the object of interest? To inspire research on this topic, we newly construct a standard benchmark for generic 3D object tracking, `Track-it-in-3D', which contains 300 RGBD video sequences with dense 3D annotations and corresponding evaluation protocols. Furthermore, we propose an effective tracking baseline to estimate 3D bounding boxes for arbitrary objects in RGBD videos, by fusing appearance and spatial information effectively. Resources are available on https://github.com/yjybuaa/Track- it-in-3D. |
Keywords | |
SUSTech Authorship | First
; Corresponding
|
Language | English
|
URL | [Source Record] |
Indexed By | |
WOS Research Area | Computer Science
; Imaging Science & Photographic Technology
|
WOS Subject | Computer Science, Artificial Intelligence
; Imaging Science & Photographic Technology
|
WOS Accession No | WOS:000904116000007
|
Data Source | Web of Science
|
Citation statistics |
Cited Times [WOS]:0
|
Document Type | Conference paper |
Identifier | http://kc.sustech.edu.cn/handle/2SGJ60CL/415624 |
Department | Department of Computer Science and Engineering |
Affiliation | 1.Department of Computer Science and Engineering, Southern University of Science and Technology, Shenzhen, China 2.University of Birmingham, Birmingham, United Kingdom |
First Author Affilication | Department of Computer Science and Engineering |
Corresponding Author Affilication | Department of Computer Science and Engineering |
First Author's First Affilication | Department of Computer Science and Engineering |
Recommended Citation GB/T 7714 |
Jinyu Yang,Zhongqun Zhang,Zhe Li,et al. Towards Generic 3D Tracking in RGBD Videos: Benchmark and Baseline[C]. GEWERBESTRASSE 11, CHAM, CH-6330, SWITZERLAND:SPRINGER INTERNATIONAL PUBLISHING AG,2022.
|
Files in This Item: | There are no files associated with this item. |
|
Items in the repository are protected by copyright, with all rights reserved, unless otherwise indicated.
Edit Comment