Name: DeepSeek R1 671B
Author: DeepSeek

Question 1

What are the VRAM requirements for DeepSeek R1 671B?

Accepted Answer

DeepSeek R1 671B requires approximately 458.2 GB of VRAM at Q4_K_M quantization, 799.4 GB at Q8, and 1503.6 GB at FP16. These numbers assume 8k context window; VRAM scales linearly with context length due to the KV cache.

Question 2

How many parameters does DeepSeek R1 671B have?

Accepted Answer

DeepSeek R1 671B has 671 billion total parameters, but only 37 billion are active per token thanks to its Mixture of Experts (MoE) architecture. This makes inference significantly faster than the total parameter count suggests.

Question 3

Is DeepSeek R1 671B good at reasoning and math?

Accepted Answer

Yes. With a MATH score of 97.3 and MMLU-Pro of 85, DeepSeek R1 671B handles complex multi-step reasoning, analytical tasks, and problem-solving well.

Question 4

Can DeepSeek R1 671B run on a 16 GB GPU?

Accepted Answer

No. At Q4_K_M, DeepSeek R1 671B needs 458.2 GB of VRAM — more than 16 GB. You will need a multi-GPU server.

Question 5

Can DeepSeek R1 671B run on a 24 GB GPU?

Accepted Answer

No. Even at Q4_K_M, DeepSeek R1 671B needs 458.2 GB. Consider a multi-GPU server with 80 GB+ total VRAM.

Question 6

What is the smallest quantization for DeepSeek R1 671B that fits in 24 GB of VRAM?

Accepted Answer

DeepSeek R1 671B cannot fit in 24 GB of VRAM at any standard quantization level. The minimum needed is 286.9 GB at Q2_K.

Question 7

What GPU do I need to run DeepSeek R1 671B locally?

Accepted Answer

You need a multi-GPU server. At Q4_K_M, DeepSeek R1 671B needs 458.2 GB VRAM, more than any single consumer GPU. Consider 2–4× H100 or A100 GPUs.

Quant	Weights	KV cache	Total
FP32	2684.0 GB	0.51 GB	3006.7 GB
BF16	1342.0 GB	0.51 GB	1503.6 GB
FP16	1342.0 GB	0.51 GB	1503.6 GB
Q8_0	713.3 GB	0.51 GB	799.4 GB
Q6_K	550.9 GB	0.51 GB	617.6 GB
Q5_K_M	477.8 GB	0.51 GB	535.7 GB
Q4_K_M	408.6 GB	0.51 GB	458.3 GB
Q3_K_M	322.8 GB	0.51 GB	362.1 GB
Q2_Krec	255.7 GB	0.51 GB	286.9 GB
NVFP4cuda	335.5 GB	0.51 GB	376.3 GB

DeepSeek R1 671B

VRAM at each quantization

Benchmarks

GPUs that run DeepSeek R1 671B natively (3)

Notes

Compare DeepSeek R1 671B with other models

How to run DeepSeek R1 671B locally

Ollama

llama.cpp

Who is DeepSeek R1 671B for?

Best for

Not ideal for

Continue reading

Frequently asked questions