menu
techminis

A naukri.com initiative

google-web-stories
Home

>

Open Source News

>

Zyphra’s n...
source image

VentureBeat

4w

read

155

img
dot

Image Credit: VentureBeat

Zyphra’s new Zyda-2 dataset lets enterprises train small LLMs with high accuracy

  • Zyphra Technologies has released Zyda-2, an open pretraining dataset comprising 5 trillion tokens.
  • Zyda-2 has been distilled to retain the strengths of existing datasets while eliminating weaknesses.
  • Zamba2 small language model trained on Zyda-2 performs significantly better than other state-of-the-art language modeling datasets.
  • The dataset aims to help enterprises train high-accuracy small language models for edge and consumer devices.

Read Full Article

like

9 Likes

For uninterrupted reading, download the app