Title | Indicator-Aware Talking Face Generation Based on Evolutionary Multi-objective Optimization |
Author | |
Corresponding Author | Wenjing Hong |
DOI | |
Publication Years | 2022
|
Conference Name | IEEE Symposium Series on Computational Intelligence (IEEE SSCI)
|
ISBN | 978-1-6654-8769-6
|
Source Title | |
Pages | 619-625
|
Conference Date | 4-7 Dec. 2022
|
Conference Place | Singapore, Singapore
|
Publication Place | 345 E 47TH ST, NEW YORK, NY 10017 USA
|
Publisher | |
Abstract | Audio-driven talking face generation (ATFG) is typically multifaceted, requiring high-quality faces, lip movements synchronized with the audio, and plausible facial expressions. Previous works have mainly focused on minimizing a single loss function constructed on the basis of consideration of the multiple performance requirements for an ATFG model. However, as a proxy, the loss function does not always yield optimal performance when minimised. Moreover, it is unlikely that there is a single model that can be optimal in terms of all relevant quality indicators. In this paper, we formulate the training of an ATFG model as an indicator-aware multi-objective optimization problem and propose a novel approach, namely Evolutionary Multi-objective Indicator-aware audio-driven talking face Generation (EMIG), to solve this problem. EMIG explicitly uses the quality indicators in the design process of ATFG models and can be easily adapted to the preferences of users. Experimental studies demonstrate the potential advantages of evolutionary multi-objective optimization for solving the task of ATFG and the effectiveness of EMIG over five state-of-the-art ATFG methods. |
Keywords | |
SUSTech Authorship | First
; Corresponding
|
Language | English
|
URL | [Source Record] |
Indexed By | |
Funding Project | National Natural Science Foundation of China[62106098]
|
WOS Research Area | Computer Science
|
WOS Subject | Computer Science, Artificial Intelligence
; Computer Science, Interdisciplinary Applications
; Computer Science, Theory & Methods
|
WOS Accession No | WOS:000971973800083
|
EI Accession Number | 20230713582872
|
Data Source | IEEE
|
PDF url | https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=10022251 |
Citation statistics |
Cited Times [WOS]:0
|
Document Type | Conference paper |
Identifier | http://kc.sustech.edu.cn/handle/2SGJ60CL/425463 |
Department | Department of Computer Science and Engineering |
Affiliation | 1.Department of Computer Science and Engineering, Southern University of Science and Technology, Shenzhen, China 2.Ping An Technology, Shenzhen, China |
First Author Affilication | Department of Computer Science and Engineering |
Corresponding Author Affilication | Department of Computer Science and Engineering |
First Author's First Affilication | Department of Computer Science and Engineering |
Recommended Citation GB/T 7714 |
Ge Guo,Wenjing Hong,Chaoyong Zhou,et al. Indicator-Aware Talking Face Generation Based on Evolutionary Multi-objective Optimization[C]. 345 E 47TH ST, NEW YORK, NY 10017 USA:IEEE,2022:619-625.
|
Files in This Item: | There are no files associated with this item. |
|
Items in the repository are protected by copyright, with all rights reserved, unless otherwise indicated.
Edit Comment