Question 1

What is Continue?

Accepted Answer

Continue is Open-source AI code assistant for VS Code and JetBrains. Tab autocomplete, chat, and agent mode with separate models per role — like a local Copilot. Continue (31K+ GitHub stars) is an open-source AI code assistant that integrates deeply into VS Code and JetBrains.

Question 2

Does Continue need a GPU?

Accepted Answer

Continue itself does not require a GPU. However, the models you connect to it do. 8 GB VRAM for 7B autocomplete/chat models. 16 GB for 14B agent mode. Agent mode with local models requires explicit tool_use capability config.

Question 3

Can I run Continue on CPU only?

Accepted Answer

Yes — Continue supports CPU-only operation, but performance will be significantly slower (5-10x) compared to GPU inference. CPU-only works best for models under 7B parameters with at least 16 GB of system RAM.

Question 4

Can Continue use OpenRouter?

Accepted Answer

Yes. Continue supports OpenRouter for accessing 300+ models through a single API. Configure OpenRouter as a provider in Continue's settings with your API key.

Question 5

Can Continue use local models via Ollama?

Accepted Answer

Yes. Continue works with Ollama for running models locally. Install Ollama, pull your model (e.g., `ollama pull qwen2.5:7b`), and connect Continue to the local Ollama server. GPU requirements depend on the model you choose, not Continue itself.

Question 6

What is the best local model for Continue?

Accepted Answer

For Continue, the community-verified best local model is Qwen 3.5 35B-A3B (MoE). 16 GB VRAM for Qwen3-14B at Q4 as agent model. 24 GB+ for Qwen3.5-35B-A3B MoE. Consider using a small local model for autocomplete and a cloud model via OpenRouter for agent tasks.

Question 7

Can I run Continue on 12 GB VRAM?

Accepted Answer

12 GB VRAM is generally not sufficient for serious agentic coding with Continue. You can run smaller models (7B-14B at Q4) but tool-calling reliability and context handling will be limited. For the best experience, 24 GB VRAM (RTX 3090/4090) is the community-recommended minimum for local agentic coding.

Question 8

Is Continue free and open source?

Accepted Answer

Yes. Continue is open source and completely free. You can find the source code on GitHub at https://github.com/continuedev/continue.

Model	Params	Q4 VRAM	Min GPU
Qwen 3.5 35B-A3B (MoE)	35B	~23.0 GB	24 GB
Qwen3 32B	32.8B	~22.2 GB	24 GB
Gemma 4 26B (MoE)	26B	~18.0 GB	24 GB
Qwen3 8B	8B	~6.4 GB	8 GB
Qwen3 14B	14.8B	~10.8 GB	12 GB

Feature	Supported
Local models	Yes
OpenRouter	Yes
OpenAI-compatible API	Yes
Ollama	Yes
LM Studio	Yes
Anthropic API	Yes
Google API	Yes
Mistral API	No
Docker	No
Works offline	No
Needs GPU	No

Continue

Can it run on my hardware?

App compatibility

Recommended models

Best local models

Best cloud/API models

Local vs cloud: which should you use?

Use local models if

Use cloud/API if

Setup overview

Limitations

Related

Recommended GPUs

Compatible models

Related apps

Frequently asked questions