menu
techminis

A naukri.com initiative

google-web-stories
Home

>

ML News

>

Resolving ...
source image

Arxiv

2d

read

294

img
dot

Image Credit: Arxiv

Resolving Token-Space Gradient Conflicts: Token Space Manipulation for Transformer-Based Multi-Task Learning

  • Multi-Task Learning (MTL) in shared networks can lead to negative transfer due to differences in task objectives.
  • Pre-trained transformers have limitations in adaptability, motivating the development of Dynamic Token Modulation and Expansion (DTME-MTL).
  • DTME-MTL addresses gradient conflicts in token space to enhance adaptability and reduce overfitting without duplicating network parameters.
  • Experiments show that DTME-MTL offers a scalable and efficient solution for improving transformer-based MTL models.

Read Full Article

like

17 Likes

For uninterrupted reading, download the app