Question 1

What is LibreChat?

Accepted Answer

LibreChat is Enterprise self-hosted ChatGPT clone with 30+ AI providers. Multi-user admin panel, OAuth2 SSO, artifacts, code interpreter, and MCP support. LibreChat (36.8K+ GitHub stars) is the most feature-complete multi-provider chat platform for enterprise and team use.

Question 2

Does LibreChat need a GPU?

Accepted Answer

LibreChat itself does not require a GPU. However, the models you connect to it do. LibreChat itself needs no GPU. Docker host minimum: 4 GB RAM (2 GB MongoDB + 2 GB app). For local models, add GPU per model requirements. Runs fine on a $5/month VPS with cloud APIs only.

Question 3

Can I run LibreChat on CPU only?

Accepted Answer

Yes — LibreChat supports CPU-only operation, but performance will be significantly slower (5-10x) compared to GPU inference. CPU-only works best for models under 7B parameters with at least 16 GB of system RAM.

Question 4

Can LibreChat use OpenRouter?

Accepted Answer

Yes. LibreChat supports OpenRouter for accessing 300+ models through a single API. Configure OpenRouter as a provider in LibreChat's settings with your API key.

Question 5

Can LibreChat use local models via Ollama?

Accepted Answer

Yes. LibreChat works with Ollama for running models locally. Install Ollama, pull your model (e.g., `ollama pull qwen2.5:7b`), and connect LibreChat to the local Ollama server. GPU requirements depend on the model you choose, not LibreChat itself.

Question 6

What models work best with LibreChat?

Accepted Answer

Models that work well with LibreChat include: Llama 3.1 70B Instruct, Qwen3 235B-A22B (MoE), DeepSeek V3 671B, Llama 4 Maverick 400B. The best model depends on your GPU's VRAM and your use case.

Question 7

Is LibreChat free and open source?

Accepted Answer

Yes. LibreChat is open source and completely free. You can find the source code on GitHub at https://github.com/danny-avila/LibreChat.

Model	Params	Q4 VRAM	Min GPU
Llama 3.1 70B Instruct	70B	~47.1 GB	48 GB+
Qwen3 235B-A22B (MoE)	235B	~149.9 GB	48 GB+
Qwen3 32B	32.8B	~22.2 GB	24 GB

LibreChat

Can it run on my hardware?

App compatibility

Recommended models

Best local models

Best cloud/API models

Local vs cloud: which should you use?

Use local models if

Use cloud/API if

Setup overview

Limitations

Related

Recommended GPUs

Compatible models

Related apps

Frequently asked questions

Feature	Supported
Local models	Yes
OpenRouter	Yes
OpenAI-compatible API	Yes
Ollama	Yes
LM Studio	Yes
Anthropic API	Yes
Google API	Yes
Mistral API	Yes
Docker	Yes
Works offline	No
Needs GPU	No