PC pour OpenClaw en local : quel matériel choisir ? | Radiance Systems

PC for OpenClaw in local: what hardware to choose? | Radiance Systems

Technical Guide · Local AI

OpenClaw is an AI agent framework designed to run entirely on your machine. But to unleash its full power—fast inference, heavy models, latency-free pipeline—your hardware matters as much as your software configuration. Here’s everything you need to know.

By the Radiance Systems Team · · 8 min read

TL;DR — Key Points
  • OpenClaw works 100% offline once models are downloaded — zero cloud dependency.
  • The minimum viable for smooth use: an NVIDIA GPU with at least 16 GB of VRAM.
  • For 30B+ models, aim for a GPU with 24–32 GB of VRAM or a multi-GPU system.
  • System RAM: at least 32 GB DDR5 for mixed AI + office workloads.
  • Radiance CoreAI workstations are delivered pre-configured with CUDA, Ollama, and the environment ready for OpenClaw.
  • No subscription, no cost per token: your AI belongs to you.


What is OpenClaw and why run it locally?

OpenClaw is an open-source AI agent orchestration framework. It allows you to deploy, chain, and interact with Large Language Models (LLMs) directly from your infrastructure — without using a third-party API, without sending your data to remote servers.

The promise is simple: you retain full control over your AI pipeline. Your documents, your queries, your responses never leave your network. For professionals subject to GDPR, medical secrecy, or legal ethics, this is a fundamental paradigm shift.

But OpenClaw is resource-intensive. It relies on LLM models loaded into GPU memory, and its performance directly depends on your hardware. A consumer PC or laptop won't cut it for long.

💡
Why "local" instead of cloud API? APIs like OpenAI charge per token and transmit your data outside your premises. Locally, you pay for your hardware once, and each query costs only a few cents in electricity — with no outgoing data.


What PC configuration for local OpenClaw?

OpenClaw orchestrates agents that call LLM models for inference. These models dictate the hardware requirements. Here are the thresholds to know:


The GPU: the centerpiece

VRAM (video memory) determines which models you can load and how fast they generate tokens. It's the number one limiting factor.

16 GB VRAM → comfortable 7B–13B models 24–32 GB VRAM → quantized 30B–70B models 48–96 GB VRAM → 70B+ full precision models

NVIDIA GPUs are favored thanks to the CUDA ecosystem, essential for frameworks like llama.cpp, Ollama, or vLLM that power OpenClaw. The RTX 5000 series (Blackwell architecture) currently offers the best performance/VRAM ratio available in workstations.


System RAM

RAM is used for long context, embeddings, and parallel pipelines. Below 32 GB DDR5, you risk bottlenecks on complex workflows.


Storage

LLM models are heavy: from 4 GB for a quantized 7B model to 80 GB+ for a full 70B. Plan for an NVMe SSD of at least 2 TB if you plan to store multiple models simultaneously.

⚠️
Beware of hybrid CPU/GPU configurations. If your model doesn't fit entirely into VRAM, OpenClaw loads part of it into CPU RAM. Generation speed then drops by 80 to 95%. Size your GPU to hold the entire model.


Step-by-step installation guide for OpenClaw on your PC

Here is the complete process to get OpenClaw up and running on Windows 11 or Ubuntu. Radiance CoreAI workstations come with CUDA drivers and Ollama pre-installed, which reduces this process to 3–4 steps.

1
Check NVIDIA and CUDA drivers

Make sure your NVIDIA driver is up-to-date (version ≥ 525) and that CUDA Toolkit is installed. Check with nvidia-smi in the terminal. On Radiance machines, this is already done.

2
Install Ollama (inference backend)

OpenClaw relies on Ollama to load and serve models locally. Install it from ollama.com or via terminal: curl -fsSL https://ollama.com/install.sh | sh. On Windows, the GUI installer is sufficient.

3
Download your first model

From the terminal: ollama pull mistral for a 7B, or ollama pull qwen2.5:32b for a more powerful model. The download is a one-time process.

4
Install OpenClaw and its dependencies

Clone the official OpenClaw repository, install Python dependencies (venv recommended), then configure config.yaml to point to your local Ollama endpoint (http://localhost:11434).

5
Launch your first agent

Start Ollama in the background (ollama serve), then launch OpenClaw. Your first AI agent runs entirely on your machine — no data leaves your network.

6
(Optional) Connect your documents via RAG

Add a RAG layer by indexing your PDFs, Word documents, or local databases. OpenClaw can query your internal documents in natural language — professional secrecy guaranteed.


Hardware comparison: which workstation for which OpenClaw use?

Not all PCs are equal when it comes to OpenClaw. This table summarizes the key criteria according to usage profiles.

Usage Profile Required VRAM System RAM Supported Models Recommended Tier
Light individual use — 1 agent, occasional use 16 GB 16–32 GB 7B–13B
Mistral, LLaMA 3.1
CoreAI 16
Regular professional use — RAG, multi-docs, complex agents 16–24 GB 32 GB Quantized 13B–30B
Qwen 2.5, DeepSeek
CoreAI 32
Multi-user firm — simultaneous inference 32 GB 64 GB Quantized 70B
Mixtral, Qwen 72B
CoreAI 64
24/7 multi-agent production — fine-tuning 64–96 GB (multi-GPU) 128 GB 70B+ full / 200B
Fine-tuning possible
Rack 2×5090 / GB10

Note on quantization: 4-bit or 8-bit GGUF models can run a 30B on 16 GB of VRAM, with a slight loss of quality. For critical professional use, prefer maximum precision — which means properly sizing your GPU.


Radiance machines ready for OpenClaw

Each machine is assembled in Auriol (13), pre-configured and delivered ready to use. CUDA drivers, Ollama, and Python environment are already in place.

🛠️
Delivered ready to use. NVIDIA drivers, CUDA Toolkit, Ollama pre-installed. All you have to do is choose your model and launch OpenClaw. Our team will support you during the first few hours of getting started.


Which LLM models to use with OpenClaw locally?

OpenClaw is compatible with any model served via an OpenAI-compatible API — including Ollama, llama.cpp, LM Studio, or vLLM. Our 2025 selection:

Model Size VRAM required Strengths Target profile
Mistral Small 3.1 24B ~14 GB (Q4) Fast, multilingual, precise instructions General use, assistants
Qwen 2.5 / Qwen3 7B–72B 4–40 GB Excellent in French, strong reasoning Legal, medical, accounting
DeepSeek-R1 7B–70B 4–38 GB Chain-of-thought reasoning, code, analysis R&D, design offices
LLaMA 3.3 70B ~38 GB (Q4) Reference model, versatile All professional uses
Gemma 3 9B–27B 5–16 GB Lightweight, multimodal, performs well on small GPUs Individual practices, medical

For professional use in French, Qwen 2.5 32B and Mistral Small 3.1 offer the best balance between response quality and inference speed on a 16–24 GB GPU.

Local AI vs. Cloud: Decisive Arguments for Professionals

Criterion Local AI (OpenClaw + Radiance) Cloud AI (ChatGPT, Copilot…)
Data Confidentiality ✓ Total — nothing leaves ✗ Data transmitted to provider
GDPR Compliance ✓ Native — zero transfer outside EU ⚠ Varies by DPA contracts
Cost Model One-time investment Subscription + token-based billing
Offline Availability ✓ Works without internet ✗ Requires permanent connection
Customization / Fine-tuning ✓ Total on your data Limited by offer
Professional Secrecy ✓ Respected by design ⚠ Real ethical risk


Radiance Feedback: Our Pro-Tips for OpenClaw

We assemble and deploy AI workstations from Auriol for professionals across Europe. Here's what our field feedback has taught us:


💡 Don't Underestimate NVMe Bandwidth

At startup, OpenClaw loads the model from disk to VRAM. A Gen 4 NVMe (~7,000 MB/s) reduces this loading time by 40 to 70% compared to a SATA SSD. On our CoreAI machines, it's Gen 4 by default.


💡 Limit Context Window if You Lack VRAM

OpenClaw can open long contexts (32K, 128K tokens). Each context token consumes VRAM. On a 16 GB GPU, limit to 8K–16K tokens for fast generation. On a 32 GB GPU, 128K is easily accessible.


💡 Enable Flash Attention if Your GPU Supports It

RTX 5000 (Blackwell) natively supports Flash Attention 3, which reduces the memory footprint of long contexts by 30 to 50%. Enable it in your inference backend configuration.


💡 For a Multi-User Practice: Serve OpenClaw as an Internal API

Instead of installing OpenClaw on each workstation, deploy a server instance on a Radiance rack workstation accessible via your local network. All your collaborators connect locally, data remains on-site, and you share GPU power.

🏆
Our field observation: professionals who switch to local AI via OpenClaw see an ROI in 4 to 8 months compared to equivalent cloud subscriptions in terms of usage volume. And they no longer have to explain to their clients why their data is going abroad.


Frequently Asked Questions about OpenClaw Locally


Is OpenClaw difficult to install?

On a standard machine, installation requires basic knowledge of terminal and Python. On Radiance CoreAI workstations, the environment (CUDA, Ollama, Python) is pre-configured, reducing installation to about ten minutes. Our team can also perform initial remote configuration.


Can I do local fine-tuning with OpenClaw?

OpenClaw is an agent orchestrator, not a fine-tuning tool. To fine-tune a model on your data, you will need a dedicated tool (Axolotl, LLaMA-Factory) and a GPU with at least 24 GB of VRAM for 7B models, or a multi-GPU system for larger ones. Our Rack 2×RTX 5090 and RTX 6000 PRO Blackwell configurations are sized for this use.


Does OpenClaw work on Windows?

Yes, OpenClaw supports Windows via WSL2 or native Python. Our CoreAI machines are delivered with Windows 11 Pro with WSL2 pre-activated, ensuring full compatibility with the open-source AI tools ecosystem.


What is the difference between OpenClaw and Open WebUI?

Open WebUI is a graphical interface for interacting with models via a chat. OpenClaw is an AI agent framework — it automates complex tasks, chains model calls, integrates external tools, and can reason in multiple steps. Both can coexist on the same machine.


My practice has strict GDPR obligations. Is local AI truly compliant?

Yes. As long as personal data processing occurs on your hardware, on your premises, without transfer to a third-party provider, you are compliant with the GDPR's principles of data minimization and sovereignty. Radiance workstations are specifically designed to guarantee this architecture by default.


Your PC for OpenClaw locally, assembled in Auriol

Tell us your use, your profession, your target models. We configure the ideal machine and send it to you ready to use within 4 to 10 working days.

Request a free quote →
Back to blog

More questions?

Send us an email at contact@radiancesystems.eu or contact us via the contact form. We respond to all inquiries within 3 hours during business hours (Monday to Friday, 9am to 5pm).

📞 +33 4 65 84 48 21