Comparisons
53 GPU comparisons · 44 model comparisons — VRAM, bandwidth, benchmarks, and tokens per second, side by side.
GPU comparisons
Generation upgrades
- NVIDIA RTX 5090vsNVIDIA RTX 409032 GB·24 GB+8 GB VRAMCUDA · +78% bandwidth
- NVIDIA RTX 4090vsNVIDIA RTX 309024 GB·24 GBCUDA · +8% bandwidth
- NVIDIA RTX 4090vsNVIDIA RTX 3090 Ti24 GB·24 GBCUDA ·
- NVIDIA RTX 4090vsNVIDIA RTX 408024 GB·16 GB+8 GB VRAMCUDA · +41% bandwidth
- NVIDIA RTX 5090vsNVIDIA RTX 408032 GB·16 GB+16 GB VRAMCUDA · +150% bandwidth
- NVIDIA RTX 4080vsNVIDIA RTX 3080 10GB16 GB·10 GB+6 GB VRAMCUDA · -6% bandwidth
- NVIDIA RTX 5090vsNVIDIA RTX 309032 GB·24 GB+8 GB VRAMCUDA · +91% bandwidth
- Apple M4 Ultra (192GB)vsApple M2 Ultra (192GB)192 GB·192 GBMETAL · +37% bandwidth
- Apple M4 Max (128GB)vsApple M3 Max (128GB)128 GB·128 GBMETAL · +37% bandwidth
- Apple M4 Max (96GB)vsApple M3 Max (96GB)96 GB·96 GBMETAL · +37% bandwidth
- Apple M4 Max (64GB)vsApple M3 Max (64GB)64 GB·64 GBMETAL · +37% bandwidth
- Apple M4 Max (48GB)vsApple M3 Max (48GB)48 GB·48 GBMETAL · +37% bandwidth
- Apple M4 Pro (48GB)vsApple M3 Max (48GB)48 GB·48 GBMETAL · -32% bandwidth
- Apple M4 Pro (24GB)vsApple M3 Pro (18GB)24 GB·18 GB+6 GB VRAMMETAL · +82% bandwidth
- Apple M3 Max (128GB)vsApple M2 Max (96GB)128 GB·96 GB+32 GB VRAMMETAL ·
- Apple M4 Max (128GB)vsApple M1 Ultra (128GB)128 GB·128 GBMETAL · -32% bandwidth
- Apple M2 Ultra (192GB)vsApple M1 Ultra (128GB)192 GB·128 GB+64 GB VRAMMETAL ·
NVIDIA vs AMD
- NVIDIA RTX 4090vsAMD Radeon RX 7900 XTX24 GB·24 GBCUDA vs ROCM · +5% bandwidth
- NVIDIA RTX 5090vsAMD Radeon RX 7900 XTX32 GB·24 GB+8 GB VRAMCUDA vs ROCM · +87% bandwidth
- NVIDIA RTX 3090vsAMD Radeon RX 7900 XTX24 GB·24 GBCUDA vs ROCM · -3% bandwidth
- NVIDIA RTX 4080vsAMD Radeon RX 6800 XT16 GB·16 GBCUDA vs ROCM · +40% bandwidth
- NVIDIA H100 80GBvsNVIDIA A100 80GB80 GB·80 GBCUDA · +64% bandwidth
- AMD Instinct MI300XvsNVIDIA H100 80GB192 GB·80 GB+112 GB VRAMROCM vs CUDA · +58% bandwidth
NVIDIA vs Apple
- NVIDIA RTX 4090vsApple M2 Ultra (192GB)24 GB·192 GB-168 GB VRAMCUDA vs METAL · +26% bandwidth
- NVIDIA RTX 5090vsApple M4 Ultra (192GB)32 GB·192 GB-160 GB VRAMCUDA vs METAL · +64% bandwidth
- NVIDIA RTX 4090vsApple M3 Max (128GB)24 GB·128 GB-104 GB VRAMCUDA vs METAL · +152% bandwidth
- NVIDIA RTX 4090vsApple M4 Pro (48GB)24 GB·48 GB-24 GB VRAMCUDA vs METAL · +269% bandwidth
- NVIDIA RTX 6000 AdavsApple M2 Ultra (192GB)48 GB·192 GB-144 GB VRAMCUDA vs METAL · +20% bandwidth
- NVIDIA RTX 3090vsApple M3 Pro (36GB)24 GB·36 GB-12 GB VRAMCUDA vs METAL · +524% bandwidth
- NVIDIA RTX 5090vsApple M4 Max (128GB)32 GB·128 GB-96 GB VRAMCUDA vs METAL · +228% bandwidth
- NVIDIA RTX 4080vsApple M4 Pro (24GB)16 GB·24 GB-8 GB VRAMCUDA vs METAL · +163% bandwidth
Consumer sweet spot
- NVIDIA RTX 4080vsNVIDIA RTX 4070 Ti16 GB·12 GB+4 GB VRAMCUDA · +42% bandwidth
- NVIDIA RTX 4060 Ti 16GBvsNVIDIA RTX 408016 GB·16 GBCUDA · -60% bandwidth
- NVIDIA RTX 4060 Ti 16GBvsNVIDIA RTX 309016 GB·24 GB-8 GB VRAMCUDA · -69% bandwidth
- NVIDIA RTX 3060 12GBvsNVIDIA RTX 407012 GB·12 GBCUDA · -29% bandwidth
- NVIDIA RTX 4090vsNVIDIA RTX 4060 Ti 16GB24 GB·16 GB+8 GB VRAMCUDA · +250% bandwidth
- Intel Arc A770 16GBvsNVIDIA RTX 4060 Ti 16GB16 GB·16 GBVULKAN vs CUDA · +94% bandwidth
Workstation & datacenter
- NVIDIA RTX 6000 AdavsNVIDIA RTX A600048 GB·48 GBCUDA · +25% bandwidth
- NVIDIA L40SvsNVIDIA RTX 6000 Ada48 GB·48 GBCUDA · -10% bandwidth
- NVIDIA A100 80GBvsNVIDIA L40S80 GB·48 GB+32 GB VRAMCUDA · +136% bandwidth
- NVIDIA H100 80GBvsNVIDIA RTX 6000 Ada80 GB·48 GB+32 GB VRAMCUDA · +249% bandwidth
- NVIDIA DGX Spark (128GB)vsApple M4 Ultra (192GB)128 GB·192 GB-64 GB VRAMCUDA vs METAL · -75% bandwidth
- AMD Instinct MI300XvsApple M2 Ultra (192GB)192 GB·192 GBROCM vs METAL · +563% bandwidth
Apple Silicon
- Apple M4 Ultra (192GB)vsApple M4 Max (128GB)192 GB·128 GB+64 GB VRAMMETAL · +100% bandwidth
- Apple M3 Max (128GB)vsApple M2 Ultra (192GB)128 GB·192 GB-64 GB VRAMMETAL · -50% bandwidth
- Apple M4 Max (64GB)vsApple M3 Max (64GB)64 GB·64 GBMETAL · +37% bandwidth
- Apple M2 Ultra (192GB)vsApple M1 Ultra (128GB)192 GB·128 GB+64 GB VRAMMETAL ·
- Apple M3 Pro (36GB)vsApple M4 Pro (24GB)36 GB·24 GB+12 GB VRAMMETAL · -45% bandwidth
- AMD Strix Halo (128GB)vsApple M4 Max (128GB)128 GB·128 GBVULKAN vs METAL · -53% bandwidth
- AMD Strix Halo (96GB)vsApple M4 Max (96GB)96 GB·96 GBVULKAN vs METAL · -53% bandwidth
- AMD Strix Halo (64GB)vsApple M4 Max (64GB)64 GB·64 GBVULKAN vs METAL · -53% bandwidth
- AMD Strix Halo (128GB)vsApple M3 Max (128GB)128 GB·128 GBVULKAN vs METAL · -36% bandwidth
Model comparisons
70B class
- Llama 3.3 70B InstructvsQwen 2.5 72B Instruct70B·72BQ4: 42.2 GBvs43.3 GB
- Qwen 3.6 27BvsLlama 3.3 70B Instruct27B·70BQ4: 16.9 GBvs42.2 GB
- DeepSeek R1 Distill Llama 70BvsLlama 3.3 70B Instruct70B·70BQ4: 42.2 GBvs42.2 GB
- Llama 3.1 70B InstructvsLlama 3.3 70B Instruct70B·70BQ4: 42.2 GBvs42.2 GB
- Qwen3 32BvsLlama 3.3 70B Instruct32.8B·70BQ4: 19.9 GBvs42.2 GB
- Gemma 4 31BvsLlama 3.3 70B Instruct31B·70BQ4: 21 GBvs42.2 GB
27–32B sweet spot
- Qwen 3.6 27BvsQwen3 32B27B·32.8BQ4: 16.9 GBvs19.9 GB
- Qwen 3.6 27BvsGemma 3 27B Instruct27B·27BQ4: 16.9 GBvs16.8 GB
- Qwen 3.6 27BvsGemma 4 31B27B·31BQ4: 16.9 GBvs21 GB
- Qwen3 32BvsDeepSeek R1 Distill Qwen 32B32.8B·32.5BQ4: 19.9 GBvs20.6 GB
- Qwen 2.5 32B InstructvsQwen 3.6 27B32.5B·27BQ4: 20.6 GBvs16.9 GB
- Mistral Small 3.1 24B InstructvsQwen 3.6 27B24B·27BQ4: 14.9 GBvs16.9 GB
- Gemma 2 27B InstructvsQwen 3.6 27B27.2B·27BQ4: 18.7 GBvs16.9 GB
- Command-R 35BvsQwen3 32B35B·32.8BQ4: 31.6 GBvs19.9 GB
- Phi-4 14B InstructvsQwen 2.5 14B Instruct14B·14.7BQ4: 9.3 GBvs10 GB
- Phi-4 14B InstructvsGemma 3 12B Instruct14B·12.2BQ4: 9.3 GBvs8 GB
7–9B class
- Llama 3.1 8B InstructvsQwen 2.5 7B Instruct8B·7.6BQ4: 5.7 GBvs4.8 GB
- Llama 3.1 8B InstructvsGemma 2 9B Instruct8B·9.2BQ4: 5.7 GBvs8.3 GB
- Qwen 2.5 7B InstructvsMistral 7B Instruct v0.37.6B·7.25BQ4: 4.8 GBvs5.3 GB
- Qwen3 8BvsLlama 3.1 8B Instruct8B·8BQ4: 5.8 GBvs5.7 GB
- Gemma 2 9B InstructvsQwen 2.5 7B Instruct9.2B·7.6BQ4: 8.3 GBvs4.8 GB
- DeepSeek R1 Distill Llama 8BvsLlama 3.1 8B Instruct8B·8BQ4: 5.7 GBvs5.7 GB
- Gemma 3 12B InstructvsMistral Nemo 12B Instruct12.2B·12.2BQ4: 8 GBvs8.3 GB
- Gemma 3 12B InstructvsQwen 2.5 14B Instruct12.2B·14.7BQ4: 8 GBvs10 GB
Small & edge
- Llama 3.2 3B InstructvsQwen 2.5 3B Instruct3.2B·3.1BQ4: 2.8 GBvs2.1 GB
- Phi-3.5 Mini InstructvsLlama 3.2 3B Instruct3.8B·3.2BQ4: 5.7 GBvs2.8 GB
- Gemma 2 2B InstructvsLlama 3.2 1B Instruct2.6B·1.24BQ4: 2.4 GBvs1 GB
- Gemma 3 4B InstructvsLlama 3.2 3B Instruct4B·3.2BQ4: 2.8 GBvs2.8 GB
- Gemma 3 1B InstructvsLlama 3.2 1B Instruct1B·1.24BQ4: 0.9 GBvs1 GB
Reasoning models
- DeepSeek R1 Distill Qwen 32BvsQwen3 32B32.5B·32.8BQ4: 20.6 GBvs19.9 GB
- DeepSeek R1 Distill Llama 8BvsQwen3 8B8B·8BQ4: 5.7 GBvs5.8 GB
- DeepSeek R1 671BvsQwen3 235B-A22B (MoE)671B (37B active)·235B (22B active)Q4: 376.3 GBvs133.4 GB
- DeepSeek R1 Distill Llama 70BvsDeepSeek R1 Distill Qwen 32B70B·32.5BQ4: 42.2 GBvs20.6 GB
Coding models
MoE vs dense
- Qwen3 30B-A3B (MoE)vsQwen3 32B30B (3B active)·32.8BQ4: 17.7 GBvs19.9 GB
- Mixtral 8x7B Instruct v0.1vsLlama 3.1 8B Instruct46.7B (12.9B active)·8BQ4: 27.4 GBvs5.7 GB
- Mixtral 8x22B Instruct v0.1vsQwen 2.5 72B Instruct141B (39B active)·72BQ4: 81.1 GBvs43.3 GB
- Llama 4 Scout 109BvsDeepSeek R1 Distill Llama 70B109B (17B active)·70BQ4: 64 GBvs42.2 GB
Frontier
- Llama 4 Maverick 400BvsDeepSeek V3 671B400B (17B active)·671B (37B active)Q4: 228.5 GBvs376.3 GB
- Llama 4 Scout 109BvsQwen3 235B-A22B (MoE)109B (17B active)·235B (22B active)Q4: 64 GBvs133.4 GB
- DeepSeek V4 Pro 1.6TvsDeepSeek R1 671B1600B (49B active)·671B (37B active)Q4: 897.1 GBvs376.3 GB
- DeepSeek V3 671BvsQwen3 235B-A22B (MoE)671B (37B active)·235B (22B active)Q4: 376.3 GBvs133.4 GB