menu
techminis

A naukri.com initiative

google-web-stories
Home

>

ML News

>

Domain-Spe...
source image

Arxiv

1w

read

203

img
dot

Image Credit: Arxiv

Domain-Specific Pruning of Large Mixture-of-Experts Models with Few-shot Demonstrations

  • Mixture-of-Experts models achieve performance and inference efficiency by activating only a subset of experts.
  • Large-scale Mixture-of-Experts models face the limitation of storing all experts which leads to significant memory overhead.
  • A pruning framework called EASY-EP is proposed, which utilizes domain-specific demonstrations to identify and retain the most relevant experts.
  • EASY-EP can achieve comparable performance and higher throughput while reducing memory usage by half in the DeepSeek-R1 model.

Read Full Article

like

12 Likes

For uninterrupted reading, download the app