Local AI PC Configuration in 2026: Complete Guide and Comparison

Why run AI locally?

In 2026, artificial intelligence is no longer reserved for data centers. Thanks to new GPU architectures and optimized open-source models (Llama 4, Mistral, DeepSeek, Qwen…), it is now possible to run powerful LLMs directly on your own machine, without sending any data to the cloud.

For professionals subject to GDPR: lawyers, doctors, accountants, notaries, design offices: this is a revolution: you get powerful AI without ever exposing your client files to third parties.

Total privacy: your data stays on your local network, with no sending to external servers.
No recurring subscription: once the machine is purchased, inference costs are zero.
Low latency: responses are immediate, without depending on your connection quality.
Offline operation: useful on-site, on the go, or in case of network failure.
Full customization: fine-tuning, RAG, agents: you control the entire environment.

Good to know: All Radiance Systems machines are assembled, tested, and optimized in France (Auriol, 13), with a 2-year warranty and dedicated technical support responding within 3 hours.

Key criteria for an AI PC setup

GPU VRAM: the number one factor

For local AI, video memory (VRAM) directly determines the size of the models you can run. A 7-billion-parameter LLM in 4-bit requires about 4–5 GB of VRAM; a 70B model in 4-bit needs 35–40 GB. The more VRAM you have, the larger models you can load or the more models you can run simultaneously.

Good news: all our workstations are fully configurable, including the graphics card. The CoreAI 64, for example, offers consumer GPUs (RTX 5070 Ti to RTX 5090) but also professional GPUs up to 96 GB VRAM (RTX 6000 Blackwell, L40S, H100…). If you have a specific GPU memory need, you can build exactly the machine you need from the configurator or contact us for a custom quote.

Model size	Quantization	VRAM required	Compatible with
7B (e.g. Mistral 7B)	4-bit (GGUF)	~4–5 GB	RTX 5070 Ti, RTX 5090, GB10
13–14B	4-bit	~8–10 GB	RTX 5070 Ti, RTX 5090, GB10
32–34B	4-bit	~18–22 GB	RTX 5090 (32 GB), RTX 6000 Blackwell (96 GB), GB10 (128 GB)
70B (e.g. Llama 4 Scout)	4-bit	~35–40 GB	RTX 5090 (partial), L40S (48 GB) ✅, GB10 ✅
70B full precision / 2×70B	8-bit or FP16	60–96 GB	RTX 6000 Blackwell (96 GB) ✅, GB10 ✅
200B+ models / heavy multimodal	4–8-bit	100–128 GB+	GB10 (128 GB unified) ✅, H100 NVL (94 GB) ✅

GPU Configurator: the CoreAI 64 lets you choose from over 15 GPUs, from the consumer RTX 5070 Ti up to the H200 141 GB. You’re not limited to the displayed configurations: contact us for a quote tailored to your exact VRAM needs.

System RAM, processor, and storage

System RAM is used for complex pipelines (RAG, multi-agent orchestration, processing large documents). 32 GB is a serious minimum; 64 GB and more become necessary as you multiply parallel tasks. On the CPU side, AMD Ryzen 9 series on AM5 offer the excellent balance of compute and memory bandwidth that AI frameworks need. Finally, a fast NVMe SSD speeds up the initial loading of model weights: count on at least 1 TB for comfortable use.

Comparison of the 3 Radiance Systems configurations

Model	GPU / AI Memory	CPU	RAM	Base price	Ideal for
CoreAI 32 RTX 5070 Ti Pro Entry	RTX 5070 Ti: 16 GB VRAM	Ryzen 9 9900X (12c)	32 GB DDR5 (up to 256 GB)	2 442 €	LLM up to 13B, image generation, AI development, multimedia
CoreAI 64 RTX 5090 High-end	RTX 5090: 32 GB VRAM	Ryzen 9 9950X3D (16c)	64 GB DDR5 (up to 256 GB)	6 042 €	LLM 70B, fine-tuning, professional 3D rendering, intensive AI pipelines
ASUS Ascent GB10 Mini Server	NVIDIA GB10: 128 GB LPDDR5X unified	Grace (ARM, 20 cores)	128 GB unified CPU+GPU	3 999 €	Dedicated AI server, 70B+ models, multi-user inference, on-site deployment

Radiance PC CoreAI 32: RTX 5070 Ti

The Radiance PC CoreAI 32 is the entry-level professional AI workstation. It’s the ideal setup to seriously start with local AI without breaking your budget, while keeping a fully upgradeable machine.

Base configuration

GPU: NVIDIA GeForce RTX 5070 Ti: 16 GB VRAM (Blackwell architecture)
CPU: AMD Ryzen 9 9900X: 12 cores / 24 threads, socket AM5
RAM: 32 GB DDR5 5600 MHz: upgradeable up to 256 GB
Storage: 1 TB NVMe (up to 3,500 MB/s)
Power Supply: MSI 850W 80+ Gold PCIe 5
OS: Windows 11 Professional (license included)
WiFi 6E + Bluetooth included depending on the chosen motherboard

What you can run

With 16 GB of VRAM, the CoreAI 32 comfortably handles LLMs up to 13B at full quality, 7B models in long context, image generation with FLUX, Stable Diffusion XL, as well as light multimodal processing pipelines. This is the machine typically used by lawyers, SMEs, and freelancers who want efficient local AI without server constraints.

⚠️ Limit to know: models of 30B and above exceed VRAM and will partially use system RAM, which significantly slows down inference. For these uses, prefer the CoreAI 64 or the GB10.

Radiance PC CoreAI 64: RTX 5090

The Radiance PC CoreAI 64 is the high-end AI workstation in the range. With the RTX 5090 and its 32 GB of VRAM, it can run current generation LLMs at full throttle, including 70B models in aggressive quantization. And if your needs exceed consumer level, the configurator also offers professional GPUs: L40S 48 GB, RTX 5000 Blackwell 48 GB, RTX 6000 Blackwell 96 GB, or even H100 NVL for the most intensive workloads.

Base configuration

GPU: NVIDIA GeForce RTX 5090: 32 GB VRAM (the most powerful consumer GPU in 2026)
CPU: AMD Ryzen 9 9950X3D: 16 cores / 32 threads + 3D V-Cache
RAM: 64 GB DDR5 6000 MHz: upgradeable up to 256 GB
Storage: 1 TB NVMe (up to 3,500 MB/s): easily expandable
Power Supply: Deepcool 1200W 80+ Gold
OS: Windows 11 Professional (license included)
WiFi 7 + Bluetooth included (MSI X870E)

What you can run

The RTX 5090 32 GB is the GPU reference for local AI on consumer workstations. It can run Llama 4 Scout (109B MoE with 17B activation), Qwen 2.5 72B in 4-bit, distilled DeepSeek-R2, as well as heavy multimodal pipelines combining vision + text. It is also the reference configuration for light fine-tuning (LoRA, QLoRA) on enterprise datasets.

To go further, the configurator offers professional GPUs: the RTX 6000 Blackwell 96 GB can run 70B models in full precision or multiple models in parallel; the L40S 48 GB is optimized for server inference; and up to the H100 NVL 94 GB for the most demanding needs. VRAM is not a fixed ceiling: it is configurable.

Fully configurable GPU: the CoreAI 64 offers more than 15 GPU options in its configurator, from the RTX 5070 Ti up to the H200 141 GB. You can also add a second graphics card (multi-GPU). VRAM is not a fixed constraint: it adapts to your usage. Contact us for a custom quote.

NVIDIA GB10 Mini AI Server: ASUS Ascent GX10

The ASUS Ascent GX10 (NVIDIA GB10 Grace Blackwell) is in a category of its own. It’s not a traditional workstation: it’s an ultra-compact mini AI server (150×150×51 mm), designed exclusively for AI workloads: local inference, fine-tuning, RAG, autonomous agents, multi-user deployment.

Unified architecture: the GB10 advantage

The NVIDIA GB10 chip combines ARM CPU (Grace, 20 cores) and Blackwell GPU in a unified memory architecture with 128 GB LPDDR5X. Unlike a traditional graphics card (separate VRAM), this unified memory is equally accessible by both CPU and GPU, without costly transfers between them. Result: you can run 70B models in full precision or multiple models simultaneously, with exceptional memory bandwidth.

1 petaFLOP of AI power (INT8): for reference, the best gaming GPUs peak at ~0.35 PFLOPS INT8
128 GB of unified memory: running Llama 4 Maverick (400B MoE) is feasible
Pre-installed DGX OS: CUDA, PyTorch, TensorFlow, Jupyter: everything is ready at startup
Professional network connectivity: 10G LAN + ConnectX-7 (2×200G QSFP)
Built-in Wi-Fi 7 + Bluetooth 5
Ultra-compact desktop form factor: 1.48 kg, usable on any desk or in a rack

Who is it for?

The GB10 is designed for organizations that want a dedicated AI server, separate from workstations: a medical office that wants AI accessible to the entire team via the local network, a design office deploying a RAG assistant on its internal documents, or an AI developer who wants a native Linux environment optimized for model training.

⚠️ Important: the ASUS Ascent GX10 runs on DGX OS (Linux Ubuntu). It is not designed for Windows office use. If you need a versatile machine (word processing, spreadsheets, daily web browsing), a CoreAI workstation is more suitable.

Configure your AI machine

Each configuration is fully customizable via our online configurator. All options are compatible and verified by our technicians before shipping.

Pro Entry

CoreAI 32
RTX 5070 Ti

2 442 € from

GPU RTX 5070 Ti 16 GB
CPU Ryzen 9 9900X
RAM 32 GB DDR5 → 256 GB
SSD 1 TB NVMe

Configure & Order →

High-end

CoreAI 64
RTX 5090

6 042 € from

GPU RTX 5090 32 GB
CPU Ryzen 9 9950X3D
RAM 64 GB DDR5 → 256 GB
SSD 1 TB NVMe

Configure & Order →

Mini AI Server

ASUS Ascent
NVIDIA GB10

3 999 € from

GPU NVIDIA GB10 Blackwell
Memory 128 GB unified LPDDR5X
AI Perf 1 petaFLOP INT8
OS DGX OS (Linux)

Configure & Order →

Which configuration to choose based on your usage?

The right choice depends on three factors: the size of the models you want to use, your work environment (individual workstation vs. shared server), and your software stack (Windows or native Linux).

⚖️

Lawyer / Notary

GDPR assistant on confidential documents, contract analysis. CoreAI 32 or GB10 depending on whether you want a machine or a shared server in the office.

🏥

Doctor / Medical practice

Assistance with writing, reports, medical image analysis. GB10 recommended for multi-station use on a local network.

💻

AI Developer

Fine-tuning LoRA, RAG, autonomous agents. CoreAI 64 RTX 5090 for max GPU power on Windows; GB10 for a native CUDA Linux environment.

🏢

SMEs / Mid-sized companies

Deploying a shared internal LLM. GB10 as central server + workstations for key collaborators.

🎨

Creation & 3D

Generative AI (images, video), 3D rendering, 4K editing. CoreAI 32 to start, CoreAI 64 for the most demanding projects.

📊

Data Science

Training custom models, processing large datasets. CoreAI 64 or GB10 depending on workload size.

Need help choosing your configuration?

Our team responds within 3 hours and offers you a quote tailored to your usage, budget, and existing infrastructure. Pickup available on site in Auriol (13).

Get a free quote → contact@radiancesystems.eu

Frequently Asked Questions

A workstation (CoreAI 32/64) is a complete PC with Windows, monitor, keyboard: usable for all your daily tasks in addition to AI. The GB10 mini-server is a dedicated device, without a standard graphical interface, running Linux DGX OS. It is designed to be installed on a network and accessed via API or SSH. It’s the right choice if you want AI accessible to multiple people on your local network without tying up a full desktop PC.

Yes. The CoreAI 32 (16 GB VRAM) handles Llama 4 Scout in distilled version or Mistral Small at full precision very well. The CoreAI 64 (32 GB VRAM) supports up to quantized 70B versions. The GB10, with 128 GB of unified memory, can run the largest open-source models available, including Llama 4 Maverick. Tools like LM Studio, Ollama, or llama.cpp work perfectly on Windows; the GB10 natively includes CUDA and Jupyter tools.

Yes. Local AI means your data never leaves your infrastructure. No network calls to third-party servers occur during inference. Combined with a secure local network and our NAS + UPS options available for the GB10, these machines provide a fully GDPR-compliant AI infrastructure for regulated professions.

CoreAI workstations are generally assembled and shipped within 5 to 10 business days depending on component availability. The ASUS Ascent GX10 (GB10) is shipped after preparation and testing by our technicians within 5 to 7 business days. You can also pick up your order directly from us in Auriol (13) by appointment.

Yes. For the GB10 mini-server, we offer on-site installation in France (€790) or Europe (€1,490), including networking, AI environment setup, and user training. "Turnkey AI" (€499) and "advanced commissioning" (€249) packages are also available in the configurator.

All our machines include a 2-year Radiance Systems warranty (quick exchange or repair). Warranty extensions are available: +1 year (3 years total, €499) or +3 years (5 years total, €999) with coverage of return shipping and express diagnostics within 12 business hours.

🔒

100% local & GDPR compliant

No data sent to the cloud

🇫🇷

Assembled in France

Auriol (13), Bouches-du-Rhône

⚡

Response in < 3h

Mon–Fri 9am–5pm

🛠️

2-year warranty

Quick exchange or repair

🚀

Ready to use

Premium assembly included

Country/region

Language

Why run AI locally?

Key criteria for an AI PC setup

GPU VRAM: the number one factor

System RAM, processor, and storage

Comparison of the 3 Radiance Systems configurations

Radiance PC CoreAI 32: RTX 5070 Ti

Base configuration

What you can run

Radiance PC CoreAI 64: RTX 5090

Base configuration

What you can run

NVIDIA GB10 Mini AI Server: ASUS Ascent GX10

Unified architecture: the GB10 advantage

Who is it for?

Configure your AI machine

CoreAI 32RTX 5070 Ti

CoreAI 64RTX 5090

ASUS AscentNVIDIA GB10

Which configuration to choose based on your usage?

Lawyer / Notary

Doctor / Medical practice

AI Developer

SMEs / Mid-sized companies

Creation & 3D

Data Science

Need help choosing your configuration?

Frequently Asked Questions

Any more questions?

Other articles

Discover our range of Gaming PCs

CoreAI 32
RTX 5070 Ti

CoreAI 64
RTX 5090

ASUS Ascent
NVIDIA GB10