CanItRun Logocanitrun.

AMD Radeon RX 7900 XT

The AMD Radeon RX 7900 XT has 20 GB VRAM and 800 GB/s memory bandwidth. It can run 42 of our 70 tracked models natively in VRAM at 8k context.

With 20 GB GDDR6, the AMD Radeon RX 7900 XT is a consumer-tier GPU that can run 42 models natively. It handles 30B-class models at Q4 quantization.

AMD Radeon RX 7900 XT: 20GB GDDR6 at 800 GB/s — RDNA 3 mid-range.

7B-14B at Q4 native. 27B tight at Q4. ~7-12 t/s for 7B.

ROCm Linux support. Vulkan on Windows. 20GB is awkward capacity.

VendorAMD
ArchitectureRDNA 3
VRAM20 GB
Memory typeGDDR6
Memory bandwidth800 GB/s
Compute backendROCM
TierConsumer
Released2022
Models (native)42 / 70
Models (offload)9 / 70
Software: ROCm is Linux-only; on Windows use the Vulkan backend instead. Requires llama.cpp compiled with ROCm support.

Popular models for this GPU

Models this GPU runs natively in VRAM (42)

Models that fit with CPU offload (9)

These use system RAM for layers that don't fit in VRAM — expect much slower inference.

Too large for this GPU (19)

Frequently asked questions

How much VRAM does the AMD Radeon RX 7900 XT have?
The AMD Radeon RX 7900 XT has 20 GB of GDDR6 with 800 GB/s memory bandwidth.
What is the AMD Radeon RX 7900 XT best for?
With 20 GB of VRAM, the AMD Radeon RX 7900 XT handles smaller models (7B–14B) at Q4–Q5 quantization — ideal for entry-level local LLM experimentation and lightweight inference.
What LLMs can the AMD Radeon RX 7900 XT run locally?
The AMD Radeon RX 7900 XT can run 42 of the 70 open-weight models tracked by CanItRun natively in VRAM at 8k context. Top options include: Llama 3.1 8B Instruct at Q8_0, Llama 3.2 3B Instruct at FP32, Llama 3.2 1B Instruct at FP32.
Can the AMD Radeon RX 7900 XT run Llama 3.3 70B Instruct?
The AMD Radeon RX 7900 XT can run Llama 3.3 70B Instruct with CPU offload at Q3_K_M quantization, but inference will be slower than native VRAM execution.
Can the AMD Radeon RX 7900 XT run Qwen 3.6 27B?
Yes. The AMD Radeon RX 7900 XT runs Qwen 3.6 27B natively in VRAM at Q4_K_M quantization, achieving approximately 52.6 tokens per second.
Can the AMD Radeon RX 7900 XT run Llama 3.1 8B Instruct?
Yes. The AMD Radeon RX 7900 XT runs Llama 3.1 8B Instruct natively in VRAM at Q8_0 quantization, achieving approximately 100 tokens per second.