Title | ConferencingSpeech 2022 Challenge: Non-intrusive Objective Speech Quality Assessment (NISQA) Challenge for Online Conferencing Applications |
Author | |
DOI | |
Publication Years | 2022
|
Conference Name | Interspeech Conference
|
ISSN | 2308-457X
|
EISSN | 1990-9772
|
Source Title | |
Volume | 2022-September
|
Pages | 3308-3312
|
Conference Date | SEP 18-22, 2022
|
Conference Place | null,Incheon,SOUTH KOREA
|
Publication Place | C/O EMMANUELLE FOXONET, 4 RUE DES FAUVETTES, LIEU DIT LOUS TOURILS, BAIXAS, F-66390, FRANCE
|
Publisher | |
Abstract | With the advances in speech communication systems such as online conferencing applications, we can seamlessly work with people regardless of where they are. However, during online meetings, speech quality can be significantly affected by background noise, reverberation, packet loss, and network jitter, to name a few. Because of its nature, speech quality is traditionally assessed in subjective tests in laboratories and lately also through crowdsourcing following the international standards from the ITU-T Rec. P.800 series. However, those approaches are costly and cannot be applied to customer data. Therefore, an effective objective assessment approach is needed to evaluate or monitor the speech quality of the ongoing conversation. The ConferencingSpeech 2022 challenge targets the non-intrusive deep neural network models for the speech quality assessment task. We open-sourced a training corpus with more than 86K speech clips in different languages, with a wide range of synthesized and live degradations and their corresponding subjective quality scores through crowdsourcing. 18 teams submitted their models for evaluation in this challenge. The blind test sets included about 4300 clips from wide ranges of degradations. This paper describes the challenge, the datasets, and the evaluation methods and reports the final results. |
Keywords | |
SUSTech Authorship | Others
|
Language | English
|
URL | [Source Record] |
Indexed By | |
Funding Project | Deutsche Stiftung für Herzforschung[MO 1038/32-1];
|
WOS Research Area | Acoustics
; Audiology & Speech-Language Pathology
; Computer Science
; Engineering
|
WOS Subject | Acoustics
; Audiology & Speech-Language Pathology
; Computer Science, Artificial Intelligence
; Engineering, Electrical & Electronic
|
WOS Accession No | WOS:000900724503095
|
Scopus EID | 2-s2.0-85140094917
|
Data Source | Scopus
|
Citation statistics |
Cited Times [WOS]:0
|
Document Type | Conference paper |
Identifier | http://kc.sustech.edu.cn/handle/2SGJ60CL/406913 |
Department | Southern University of Science and Technology |
Affiliation | 1.Tencent Ethereal Audio Lab,China 2.Technical University of Berlin,Germany 3.Microsoft Corp.,United States 4.Indiana University,Bloomington,United States 5.Southern University of Science and Technology,China 6.XiDian University,China |
Recommended Citation GB/T 7714 |
Yi,Gaoxiong,Xiao,Wei,Xiao,Yiming,et al. ConferencingSpeech 2022 Challenge: Non-intrusive Objective Speech Quality Assessment (NISQA) Challenge for Online Conferencing Applications[C]. C/O EMMANUELLE FOXONET, 4 RUE DES FAUVETTES, LIEU DIT LOUS TOURILS, BAIXAS, F-66390, FRANCE:ISCA-INT SPEECH COMMUNICATION ASSOC,2022:3308-3312.
|
Files in This Item: | There are no files associated with this item. |
|
Items in the repository are protected by copyright, with all rights reserved, unless otherwise indicated.
Edit Comment