中文版 | English
Title

Indicator-Aware Talking Face Generation Based on Evolutionary Multi-objective Optimization

Author
Corresponding AuthorWenjing Hong
DOI
Publication Years
2022
Conference Name
IEEE Symposium Series on Computational Intelligence (IEEE SSCI)
ISBN
978-1-6654-8769-6
Source Title
Pages
619-625
Conference Date
4-7 Dec. 2022
Conference Place
Singapore, Singapore
Publication Place
345 E 47TH ST, NEW YORK, NY 10017 USA
Publisher
Abstract

Audio-driven talking face generation (ATFG) is typically multifaceted, requiring high-quality faces, lip movements synchronized with the audio, and plausible facial expressions. Previous works have mainly focused on minimizing a single loss function constructed on the basis of consideration of the multiple performance requirements for an ATFG model. However, as a proxy, the loss function does not always yield optimal performance when minimised. Moreover, it is unlikely that there is a single model that can be optimal in terms of all relevant quality indicators. In this paper, we formulate the training of an ATFG model as an indicator-aware multi-objective optimization problem and propose a novel approach, namely Evolutionary Multi-objective Indicator-aware audio-driven talking face Generation (EMIG), to solve this problem. EMIG explicitly uses the quality indicators in the design process of ATFG models and can be easily adapted to the preferences of users. Experimental studies demonstrate the potential advantages of evolutionary multi-objective optimization for solving the task of ATFG and the effectiveness of EMIG over five state-of-the-art ATFG methods.

Keywords
SUSTech Authorship
First ; Corresponding
Language
English
URL[Source Record]
Indexed By
Funding Project
National Natural Science Foundation of China[62106098]
WOS Research Area
Computer Science
WOS Subject
Computer Science, Artificial Intelligence ; Computer Science, Interdisciplinary Applications ; Computer Science, Theory & Methods
WOS Accession No
WOS:000971973800083
EI Accession Number
20230713582872
Data Source
IEEE
PDF urlhttps://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=10022251
Citation statistics
Cited Times [WOS]:0
Document TypeConference paper
Identifierhttp://kc.sustech.edu.cn/handle/2SGJ60CL/425463
DepartmentDepartment of Computer Science and Engineering
Affiliation
1.Department of Computer Science and Engineering, Southern University of Science and Technology, Shenzhen, China
2.Ping An Technology, Shenzhen, China
First Author AffilicationDepartment of Computer Science and Engineering
Corresponding Author AffilicationDepartment of Computer Science and Engineering
First Author's First AffilicationDepartment of Computer Science and Engineering
Recommended Citation
GB/T 7714
Ge Guo,Wenjing Hong,Chaoyong Zhou,et al. Indicator-Aware Talking Face Generation Based on Evolutionary Multi-objective Optimization[C]. 345 E 47TH ST, NEW YORK, NY 10017 USA:IEEE,2022:619-625.
Files in This Item:
There are no files associated with this item.
Related Services
Recommend this item
Bookmark
Usage statistics
Export to Endnote
Export to Excel
Export to Csv
Altmetrics Score
Google Scholar
Similar articles in Google Scholar
[Ge Guo]'s Articles
[Wenjing Hong]'s Articles
[Chaoyong Zhou]'s Articles
Baidu Scholar
Similar articles in Baidu Scholar
[Ge Guo]'s Articles
[Wenjing Hong]'s Articles
[Chaoyong Zhou]'s Articles
Bing Scholar
Similar articles in Bing Scholar
[Ge Guo]'s Articles
[Wenjing Hong]'s Articles
[Chaoyong Zhou]'s Articles
Terms of Use
No data!
Social Bookmark/Share
No comment.

Items in the repository are protected by copyright, with all rights reserved, unless otherwise indicated.