menu
techminis

A naukri.com initiative

google-web-stories
Home

>

Open Source News

>

Nvidia AI ...
source image

Marktechpost

1M

read

332

img
dot

Nvidia AI Releases Llama-3.1-Nemotron-51B: A New LLM that Enables Running 4x Larger Workloads on a Single GPU During Inference

  • Nvidia unveiled Llama-3.1-Nemotron-51B for the running of larger workloads on a single GPU during inference with high efficiency.
  • Llama-3.1-Nemotron-51B is the optimized version of Meta’s Llama-3.1-70B model.
  • Nvidia employs Neural Architecture Search techniques in the model.
  • Llama-3.1-Nemotron-51B delivers similar or better performance, computational complexity, and cost-effectiveness than its predecessor.
  • The model significantly reduces memory consumption whilst preserving model abilities.
  • Nvidia employs block-distillation and explore different combinations of attention and feed-forward networks for architecture optimization.
  • The newly released model is more accessible and cost-effective to smaller organizations with reduced costs.
  • Llama-3.1-Nemotron-51B also offers a foundation that can be adapted to various requirements.
  • The new model has far-reaching implications for the future of generative AI and LLMs.
  • Nvidia’s Llama-3.1-Nemotron-51B sets a new standard for cost-effectiveness and accessibility in the world of AI.

Read Full Article

like

19 Likes

For uninterrupted reading, download the app