中文版 | English
Title

Learning Conflict-Noticed Architecture for Multi-Task Learning

Author
Corresponding AuthorZhang,Yu
Publication Years
2023-06-27
Source Title
Volume
37
Pages
11078-11086
Abstract
Multi-task learning has been widely used in many applications to enable more efficient learning by sharing part of the architecture across multiple tasks. However, a major challenge is the gradient conflict when optimizing the shared parameters, where the gradients of different tasks could have opposite directions. Directly averaging those gradients will impair the performance of some tasks and cause negative transfer. Different from most existing works that manipulate gradients to mitigate the gradient conflict, in this paper, we address this problem from the perspective of architecture learning and propose a Conflict-Noticed Architecture Learning (CoNAL) method to alleviate the gradient conflict by learning architectures. By introducing purely-specific modules specific to each task in the search space, the CoNAL method can automatically learn when to switch to purely-specific modules in the tree-structured network architectures when the gradient conflict occurs. To handle multi-task problems with a large number of tasks, we propose a progressive extension of the CoNAL method. Extensive experiments on computer vision, natural language processing, and reinforcement learning benchmarks demonstrate the effectiveness of the proposed methods. The code of CoNAL is publicly available.
SUSTech Authorship
First ; Corresponding
Language
English
URL[Source Record]
Funding Project
National Natural Science Foundation of China[62076118];National Natural Science Foundation of China[62136005];Shenzhen Technical Project[JCYJ20210324105000003];
Scopus EID
2-s2.0-85168235527
Data Source
Scopus
Document TypeConference paper
Identifierhttp://kc.sustech.edu.cn/handle/2SGJ60CL/559910
DepartmentDepartment of Computer Science and Engineering
Affiliation
1.Department of Computer Science and Engineering,Southern University of Science and Technology,Shenzhen,China
2.University of Technology Sydney,Australia
3.Peng Cheng Laboratory,Shenzhen,China
First Author AffilicationDepartment of Computer Science and Engineering
Corresponding Author AffilicationDepartment of Computer Science and Engineering
First Author's First AffilicationDepartment of Computer Science and Engineering
Recommended Citation
GB/T 7714
Yue,Zhixiong,Zhang,Yu,Liang,Jie. Learning Conflict-Noticed Architecture for Multi-Task Learning[C],2023:11078-11086.
Files in This Item:
There are no files associated with this item.
Related Services
Fulltext link
Recommend this item
Bookmark
Usage statistics
Export to Endnote
Export to Excel
Export to Csv
Altmetrics Score
Google Scholar
Similar articles in Google Scholar
[Yue,Zhixiong]'s Articles
[Zhang,Yu]'s Articles
[Liang,Jie]'s Articles
Baidu Scholar
Similar articles in Baidu Scholar
[Yue,Zhixiong]'s Articles
[Zhang,Yu]'s Articles
[Liang,Jie]'s Articles
Bing Scholar
Similar articles in Bing Scholar
[Yue,Zhixiong]'s Articles
[Zhang,Yu]'s Articles
[Liang,Jie]'s Articles
Terms of Use
No data!
Social Bookmark/Share
No comment.

Items in the repository are protected by copyright, with all rights reserved, unless otherwise indicated.