中文版 | English
Title

ConferencingSpeech 2022 Challenge: Non-intrusive Objective Speech Quality Assessment (NISQA) Challenge for Online Conferencing Applications

Author
DOI
Publication Years
2022
Conference Name
Interspeech Conference
ISSN
2308-457X
EISSN
1990-9772
Source Title
Volume
2022-September
Pages
3308-3312
Conference Date
SEP 18-22, 2022
Conference Place
null,Incheon,SOUTH KOREA
Publication Place
C/O EMMANUELLE FOXONET, 4 RUE DES FAUVETTES, LIEU DIT LOUS TOURILS, BAIXAS, F-66390, FRANCE
Publisher
Abstract
With the advances in speech communication systems such as online conferencing applications, we can seamlessly work with people regardless of where they are. However, during online meetings, speech quality can be significantly affected by background noise, reverberation, packet loss, and network jitter, to name a few. Because of its nature, speech quality is traditionally assessed in subjective tests in laboratories and lately also through crowdsourcing following the international standards from the ITU-T Rec. P.800 series. However, those approaches are costly and cannot be applied to customer data. Therefore, an effective objective assessment approach is needed to evaluate or monitor the speech quality of the ongoing conversation. The ConferencingSpeech 2022 challenge targets the non-intrusive deep neural network models for the speech quality assessment task. We open-sourced a training corpus with more than 86K speech clips in different languages, with a wide range of synthesized and live degradations and their corresponding subjective quality scores through crowdsourcing. 18 teams submitted their models for evaluation in this challenge. The blind test sets included about 4300 clips from wide ranges of degradations. This paper describes the challenge, the datasets, and the evaluation methods and reports the final results.
Keywords
SUSTech Authorship
Others
Language
English
URL[Source Record]
Indexed By
Funding Project
Deutsche Stiftung für Herzforschung[MO 1038/32-1];
WOS Research Area
Acoustics ; Audiology & Speech-Language Pathology ; Computer Science ; Engineering
WOS Subject
Acoustics ; Audiology & Speech-Language Pathology ; Computer Science, Artificial Intelligence ; Engineering, Electrical & Electronic
WOS Accession No
WOS:000900724503095
Scopus EID
2-s2.0-85140094917
Data Source
Scopus
Citation statistics
Cited Times [WOS]:0
Document TypeConference paper
Identifierhttp://kc.sustech.edu.cn/handle/2SGJ60CL/406913
DepartmentSouthern University of Science and Technology
Affiliation
1.Tencent Ethereal Audio Lab,China
2.Technical University of Berlin,Germany
3.Microsoft Corp.,United States
4.Indiana University,Bloomington,United States
5.Southern University of Science and Technology,China
6.XiDian University,China
Recommended Citation
GB/T 7714
Yi,Gaoxiong,Xiao,Wei,Xiao,Yiming,et al. ConferencingSpeech 2022 Challenge: Non-intrusive Objective Speech Quality Assessment (NISQA) Challenge for Online Conferencing Applications[C]. C/O EMMANUELLE FOXONET, 4 RUE DES FAUVETTES, LIEU DIT LOUS TOURILS, BAIXAS, F-66390, FRANCE:ISCA-INT SPEECH COMMUNICATION ASSOC,2022:3308-3312.
Files in This Item:
There are no files associated with this item.
Related Services
Recommend this item
Bookmark
Usage statistics
Export to Endnote
Export to Excel
Export to Csv
Altmetrics Score
Google Scholar
Similar articles in Google Scholar
[Yi,Gaoxiong]'s Articles
[Xiao,Wei]'s Articles
[Xiao,Yiming]'s Articles
Baidu Scholar
Similar articles in Baidu Scholar
[Yi,Gaoxiong]'s Articles
[Xiao,Wei]'s Articles
[Xiao,Yiming]'s Articles
Bing Scholar
Similar articles in Bing Scholar
[Yi,Gaoxiong]'s Articles
[Xiao,Wei]'s Articles
[Xiao,Yiming]'s Articles
Terms of Use
No data!
Social Bookmark/Share
No comment.

Items in the repository are protected by copyright, with all rights reserved, unless otherwise indicated.