Question 1

What is text-generation-webui?

Accepted Answer

text-generation-webui is Power-user local LLM frontend with maximum backend flexibility. Transformers, ExLlamaV2/V3, llama.cpp, GPTQ, AWQ — all in one web UI. text-generation-webui (formerly Oobabooga, now TextGen v4.x, 47K GitHub stars) is a self-hosted web UI for local LLMs that supports the widest range of backends: Transformers, ExLlamaV2/V3, llama.cpp, AutoGPTQ, AutoAWQ, and more.

Question 2

Does text-generation-webui need a GPU?

Accepted Answer

8 GB VRAM minimum for 7B models. The web UI itself is lightweight. GPU requirements come from the model and backend you choose. ExLlamaV2 is the fastest for NVIDIA GPUs.

Question 3

Can I run text-generation-webui on CPU only?

Accepted Answer

Yes — text-generation-webui supports CPU-only operation, but performance will be significantly slower (5-10x) compared to GPU inference. CPU-only works best for models under 7B parameters with at least 16 GB of system RAM.

Question 4

What models work best with text-generation-webui?

Accepted Answer

Models that work well with text-generation-webui include: Qwen3 32B, Llama 3.1 70B Instruct, Qwen3 235B-A22B (MoE), Mistral Small 22B. The best model depends on your GPU's VRAM and your use case.

Question 5

Is text-generation-webui free and open source?

Accepted Answer

Yes. text-generation-webui is open source and completely free. You can find the source code on GitHub at https://github.com/oobabooga/text-generation-webui.

Model	Params	Q4 VRAM	Min GPU
Qwen3 32B	32.8B	~22.2 GB	24 GB
Llama 3.1 70B Instruct	70B	~47.1 GB	48 GB+
Qwen3 235B-A22B (MoE)	235B	~149.9 GB	48 GB+
Mistral Small 22B	22.2B	~16.1 GB	24 GB

Feature	Supported
Local models	Yes
OpenRouter	No
OpenAI-compatible API	Yes
Ollama	No
LM Studio	No
Anthropic API	No
Google API	No
Mistral API	No
Docker	No
Works offline	Yes
Needs GPU	Yes

text-generation-webui

Can it run on my hardware?

App compatibility

Recommended models

Best local models

Local vs cloud: which should you use?

Use local models if

Use cloud/API if

Setup overview

Limitations

Related

Recommended GPUs

Compatible models

Related apps

Frequently asked questions