中文版 | English
Title

Skating-Mixer: Long-Term Sport Audio-Visual Modeling with MLPs

Author
Corresponding AuthorZheng,Feng
Publication Years
2023-06-27
Source Title
Volume
37
Pages
2901-2909
Abstract
Figure skating scoring is challenging because it requires judging the technical moves of the players as well as their coordination with the background music. Most learning-based methods cannot solve it well for two reasons: 1) each move in figure skating changes quickly, hence simply applying traditional frame sampling will lose a lot of valuable information, especially in 3 to 5 minutes long videos; 2) prior methods rarely considered the critical audio-visual relationship in their models. Due to these reasons, we introduce a novel architecture, named Skating-Mixer. It extends the MLP framework in a multimodal fashion and effectively learns longterm representations through our designed memory recurrent unit (MRU). Aside from the model, we collected a highquality audio-visual FS 1000 dataset, which contains over 1000 videos on 8 types of programs with 7 different rating metrics, overtaking other datasets in both quantity and diversity. Experiments show the proposed method achieves SOTAs over all major metrics on the public Fis-V and our FS 1000 dataset. In addition, we include an analysis applying our method to the recent competitions in Beijing 2022 Winter Olympic Games, proving our method has strong applicability.
SUSTech Authorship
First ; Corresponding
Language
English
URL[Source Record]
Funding Project
National Natural Science Foundation of China[61972188];National Natural Science Foundation of China[62122035];
Scopus EID
2-s2.0-85167997687
Data Source
Scopus
Document TypeConference paper
Identifierhttp://kc.sustech.edu.cn/handle/2SGJ60CL/559911
DepartmentSouthern University of Science and Technology
Affiliation
1.Southern University of Science and Technology,China
2.The Chinese University of Hong Kong,Hong Kong
3.AI Initiative,King Abdullah University of Science and Technology (KAUST),Saudi Arabia
4.Harbin Institute of Technology (Shenzhen),China
First Author AffilicationSouthern University of Science and Technology
Corresponding Author AffilicationSouthern University of Science and Technology
First Author's First AffilicationSouthern University of Science and Technology
Recommended Citation
GB/T 7714
Xia,Jingfei,Zhuge,Mingchen,Geng,Tiantian,et al. Skating-Mixer: Long-Term Sport Audio-Visual Modeling with MLPs[C],2023:2901-2909.
Files in This Item:
There are no files associated with this item.
Related Services
Fulltext link
Recommend this item
Bookmark
Usage statistics
Export to Endnote
Export to Excel
Export to Csv
Altmetrics Score
Google Scholar
Similar articles in Google Scholar
[Xia,Jingfei]'s Articles
[Zhuge,Mingchen]'s Articles
[Geng,Tiantian]'s Articles
Baidu Scholar
Similar articles in Baidu Scholar
[Xia,Jingfei]'s Articles
[Zhuge,Mingchen]'s Articles
[Geng,Tiantian]'s Articles
Bing Scholar
Similar articles in Bing Scholar
[Xia,Jingfei]'s Articles
[Zhuge,Mingchen]'s Articles
[Geng,Tiantian]'s Articles
Terms of Use
No data!
Social Bookmark/Share
No comment.

Items in the repository are protected by copyright, with all rights reserved, unless otherwise indicated.