Question 1

What is Kilo Code?

Accepted Answer

Kilo Code is Open-source AI coding agent for VS Code, JetBrains, and CLI. The only Cline-family agent with JetBrains support. Claims highest-volume consumer on OpenRouter. Kilo Code is the newest fork in the Cline/Roo Code lineage with 16K+ GitHub stars.

Question 2

Does Kilo Code need a GPU?

Accepted Answer

24 GB VRAM recommended for local coding models. Same hardware requirements as Cline and Roo Code for equivalent model sizes.

Question 3

Can Kilo Code use OpenRouter?

Accepted Answer

Yes. Kilo Code supports OpenRouter for accessing 300+ models through a single API. Configure OpenRouter as a provider in Kilo Code's settings with your API key.

Question 4

Can Kilo Code use local models via Ollama?

Accepted Answer

Yes. Kilo Code works with Ollama for running models locally. Install Ollama, pull your model (e.g., `ollama pull qwen2.5:7b`), and connect Kilo Code to the local Ollama server. GPU requirements depend on the model you choose, not Kilo Code itself.

Question 5

What is the best local model for Kilo Code?

Accepted Answer

For Kilo Code, the community-verified best local model is Qwen3 30B-A3B (MoE). RTX 3090 or 4090 (24 GB) for Qwen3-Coder 30B at Q4. JetBrains users benefit from the IDE's own memory — allocate 8 GB+ system RAM to the IDE separately from model VRAM.

Question 6

Can I run Kilo Code on 12 GB VRAM?

Accepted Answer

12 GB VRAM is generally not sufficient for serious agentic coding with Kilo Code. You can run smaller models (7B-14B at Q4) but tool-calling reliability and context handling will be limited. For the best experience, 24 GB VRAM (RTX 3090/4090) is the community-recommended minimum for local agentic coding.

Question 7

Is Kilo Code free and open source?

Accepted Answer

Yes. Kilo Code is open source and completely free. You can find the source code on GitHub at https://github.com/kilocode/kilocode.

Model	Params	Q4 VRAM	Min GPU
Qwen3 30B-A3B (MoE)	30B	~19.8 GB	24 GB
Qwen 2.5 Coder 32B Instruct	32.5B	~22.9 GB	24 GB
Qwen3 32B	32.8B	~22.2 GB	24 GB

Feature	Supported
Local models	Yes
OpenRouter	Yes
OpenAI-compatible API	Yes
Ollama	Yes
LM Studio	Yes
Anthropic API	Yes
Google API	Yes
Mistral API	Yes
Docker	No
Works offline	No
Needs GPU	Yes

Kilo Code

Can it run on my hardware?

App compatibility

Recommended models

Best local models

Best cloud/API models

Local vs cloud: which should you use?

Use local models if

Use cloud/API if

Setup overview

Limitations

Related

Recommended GPUs

Compatible models

Related apps

Frequently asked questions