menu
techminis

A naukri.com initiative

google-web-stories
Home

>

Data Science News

>

This Devel...
source image

Analyticsindiamag

1M

read

90

img
dot

Image Credit: Analyticsindiamag

This Developer Ran the 671 Billion Parameter DeepSeek-R1 Model—Without a GPU

  • Software engineer John Leimgruber managed to run the massive, 671 billion parameter DeepSeek-R1 model without a GPU.
  • John used a quantised, non-distilled version of the model which retained good quality despite compression.
  • The model is built on 8 bits, making it efficient by default and reducing the file size.
  • John successfully ran the model on a fast NVMe SSD by loading the KV cache into RAM and using memory mapping.

Read Full Article

like

5 Likes

For uninterrupted reading, download the app