Name: DeepSeek V4 Flash 284B
Author: DeepSeek

Question 1

What are the VRAM requirements for DeepSeek V4 Flash 284B?

Accepted Answer

DeepSeek V4 Flash 284B requires approximately 194.5 GB of VRAM at Q4_K_M quantization, 338.9 GB at Q8, and 637.0 GB at FP16. These numbers assume 8k context window; VRAM scales linearly with context length due to the KV cache.

Question 2

How many parameters does DeepSeek V4 Flash 284B have?

Accepted Answer

DeepSeek V4 Flash 284B has 284 billion total parameters, but only 13 billion are active per token thanks to its Mixture of Experts (MoE) architecture. This makes inference significantly faster than the total parameter count suggests.

Question 3

How capable is DeepSeek V4 Flash 284B?

Accepted Answer

DeepSeek V4 Flash 284B achieves an MMLU-Pro score of 86.3, placing it among the most capable open-weight models available — competitive with frontier systems on general knowledge and reasoning.

Question 4

Can DeepSeek V4 Flash 284B run on a 16 GB GPU?

Accepted Answer

No. At Q4_K_M, DeepSeek V4 Flash 284B needs 194.5 GB of VRAM — more than 16 GB. You will need a multi-GPU server.

Question 5

Can DeepSeek V4 Flash 284B run on a 24 GB GPU?

Accepted Answer

No. Even at Q4_K_M, DeepSeek V4 Flash 284B needs 194.5 GB. Consider a multi-GPU server with 80 GB+ total VRAM.

Question 6

What is the smallest quantization for DeepSeek V4 Flash 284B that fits in 24 GB of VRAM?

Accepted Answer

DeepSeek V4 Flash 284B cannot fit in 24 GB of VRAM at any standard quantization level. The minimum needed is 122.0 GB at Q2_K.

Question 7

What GPU do I need to run DeepSeek V4 Flash 284B locally?

Accepted Answer

You need a multi-GPU server. At Q4_K_M, DeepSeek V4 Flash 284B needs 194.5 GB VRAM, more than any single consumer GPU. Consider 2–4× H100 or A100 GPUs.

Quant	Weights	KV cache	Total
FP32	1136.0 GB	0.72 GB	1273.1 GB
BF16	568.0 GB	0.72 GB	637.0 GB
FP16	568.0 GB	0.72 GB	637.0 GB
Q8_0	301.9 GB	0.72 GB	338.9 GB
Q6_K	233.2 GB	0.72 GB	261.9 GB
Q5_K_M	202.2 GB	0.72 GB	227.3 GB
Q4_K_M	173.0 GB	0.72 GB	194.5 GB
Q3_K_M	136.6 GB	0.72 GB	153.8 GB
Q2_Krec	108.2 GB	0.72 GB	122.0 GB
NVFP4cuda	142.0 GB	0.72 GB	159.8 GB

DeepSeek V4 Flash 284B

VRAM at each quantization

Benchmarks

GPUs that run DeepSeek V4 Flash 284B natively (7)

Notes

Continue reading

Frequently asked questions