Title | Skating-Mixer: Long-Term Sport Audio-Visual Modeling with MLPs |
Author | |
Corresponding Author | Zheng,Feng |
Publication Years | 2023-06-27
|
Source Title | |
Volume | 37
|
Pages | 2901-2909
|
Abstract | Figure skating scoring is challenging because it requires judging the technical moves of the players as well as their coordination with the background music. Most learning-based methods cannot solve it well for two reasons: 1) each move in figure skating changes quickly, hence simply applying traditional frame sampling will lose a lot of valuable information, especially in 3 to 5 minutes long videos; 2) prior methods rarely considered the critical audio-visual relationship in their models. Due to these reasons, we introduce a novel architecture, named Skating-Mixer. It extends the MLP framework in a multimodal fashion and effectively learns longterm representations through our designed memory recurrent unit (MRU). Aside from the model, we collected a highquality audio-visual FS 1000 dataset, which contains over 1000 videos on 8 types of programs with 7 different rating metrics, overtaking other datasets in both quantity and diversity. Experiments show the proposed method achieves SOTAs over all major metrics on the public Fis-V and our FS 1000 dataset. In addition, we include an analysis applying our method to the recent competitions in Beijing 2022 Winter Olympic Games, proving our method has strong applicability. |
SUSTech Authorship | First
; Corresponding
|
Language | English
|
URL | [Source Record] |
Funding Project | National Natural Science Foundation of China[61972188];National Natural Science Foundation of China[62122035];
|
Scopus EID | 2-s2.0-85167997687
|
Data Source | Scopus
|
Document Type | Conference paper |
Identifier | http://kc.sustech.edu.cn/handle/2SGJ60CL/559911 |
Department | Southern University of Science and Technology |
Affiliation | 1.Southern University of Science and Technology,China 2.The Chinese University of Hong Kong,Hong Kong 3.AI Initiative,King Abdullah University of Science and Technology (KAUST),Saudi Arabia 4.Harbin Institute of Technology (Shenzhen),China |
First Author Affilication | Southern University of Science and Technology |
Corresponding Author Affilication | Southern University of Science and Technology |
First Author's First Affilication | Southern University of Science and Technology |
Recommended Citation GB/T 7714 |
Xia,Jingfei,Zhuge,Mingchen,Geng,Tiantian,et al. Skating-Mixer: Long-Term Sport Audio-Visual Modeling with MLPs[C],2023:2901-2909.
|
Files in This Item: | There are no files associated with this item. |
|
Items in the repository are protected by copyright, with all rights reserved, unless otherwise indicated.
Edit Comment