PC for Machine Learning Engineer: Training, Fine-tuning, Research

A machine learning engineer has different needs than a user who just wants to chat with a model. Training, fine-tuning, experimenting, processing large datasets: these tasks impose hardware constraints vastly different from simple inference. Buying the wrong machine means losing hours with each experimentation cycle.

This guide starts from the real-world workflows of an ML engineer, deduces what the hardware must provide, and proposes suitable workstations for each type of load, from development workstation to multi-GPU training station.


What an ML engineer really does, and what it demands

Model Training

Requires: VRAM, computation, long-term stability

Training from scratch or continuing pre-training puts the GPU under full load, sometimes for days. VRAM limits model and batch size. Stability becomes critical for long runs.

Fine-tuning (LoRA, QLoRA, full)

Requires: VRAM, iteration speed

The most common case in practice. LoRA and QLoRA reduce requirements, but serious fine-tuning demands 24 GB and more depending on the base model size.

Inference and Evaluation

Requires: VRAM, bandwidth

Testing models, comparing variants, serving a local API. Less resource-intensive than training, but VRAM remains the model size factor.

Data Preparation

Requires: CPU, RAM, fast storage

Cleaning, tokenization, augmentation, loading. This often-underestimated step is limited by the CPU, RAM, and storage speed, not the GPU.

The distinction that changes everything: training and fine-tuning a model is radically more demanding than using it. Inference of a 14 billion parameter model fits on 16 GB. Fine-tuning the same model can demand two to three times more, due to gradients, optimizer states, and activations held in memory. Sizing your machine based on inference when you intend to train is the most costly mistake.


The components that really matter

  • VRAM, above all. It determines the size of models you can train and fine-tune. 24 GB is a comfortable threshold, 32 GB opens up serious models, 96 GB ECC targets research and large models.
  • ECC memory for long runs. On a training run lasting several days, a silent memory error can corrupt an entire run. VRAM ECC (RTX 6000 Blackwell cards) protects critical computations.
  • CPU and RAM for the data pipeline. A powerful GPU starved by slow data loading runs idle. Many cores and generous RAM feed the GPU without a bottleneck.
  • Fast NVMe storage. Large datasets and checkpoints demand high throughput. A Gen 4 or Gen 5 NVMe prevents the drive from becoming the limiting factor.
  • Multi-GPU for scaling. Two cards allow parallel training, processing larger models, or running multiple experiments at once.
The fine-tuning trap. It's often said that a model "fits on 16 GB." This is true for inference, rarely for training. A full fine-tuning of a 7 billion parameter model can exceed 60 GB of VRAM. LoRA and QLoRA techniques significantly reduce this need, but always verify the intended training mode before choosing your card.


Which VRAM for which ML task?

Task Recommended VRAM Typical Card Comment
Learning, prototyping, small models 16 GB RTX 5070 Ti Ideal for starting and developing
LoRA/QLoRA Fine-tuning (up to 14B) 24 to 32 GB RTX 5090 32 GB The industry sweet spot
Multi-experiment training, medium models 2 × 32 GB 2 × RTX 5090 Parallelism, multiple runs
Heavy fine-tuning, large models, long runs 96 GB ECC RTX 6000 Blackwell ECC for reliability
Research, full fine-tuning, R&D 192 GB ECC 2 × RTX 6000 Blackwell The top tier for local use
Prototyping very large models 128 GB unified NVIDIA GB10 Unified memory, compact format


A local environment ready to code

Our workstations are delivered with the standard ML ecosystem pre-configured upon request, so you can code out of the box instead of spending hours setting up CUDA versions.

# Typical environment, pre-configured upon request
# PyTorch with CUDA 12.8 (Blackwell RTX 50xx / RTX 6000 cards)
pip install torch torchvision torchaudio \
  --index-url https://download.pytorch.org/whl/cu128

# Common ML tools
pip install transformers datasets accelerate peft bitsandbytes
pip install jupyterlab scikit-learn pandas

# LoRA fine-tuning ready to use with PEFT + Transformers
The real time-saver: version incompatibilities (CUDA, cuDNN, PyTorch, drivers) waste significant time. Our machines come with a coherent and tested stack — PyTorch, CUDA, Hugging Face libraries, Jupyter — so your first code cell runs without configuration.


Our workstations by ML load type

All our machines are hand-assembled in Auriol (13390), tested before shipping, and delivered across the European Union. Fully configurable, including the GPU.

Development and Prototyping
Radiance CoreAI 32 for machine learning development

Radiance PC CoreAI 32 — RTX 5070 Ti 16 GB

CPU AMD Ryzen 9 9900X (12c)
GPU RTX 5070 Ti 16 GB
RAM DDR5 32 GB
Storage 1 TB NVMe
OS Windows 11 Pro or Ubuntu
Usage Dev, inference, light FT

Ideal for learning, developing, inference, and light LoRA fine-tuning.

The entry-level ML development workstation. 12 cores for the data pipeline, 16 GB of VRAM for inference and prototyping. The right starting point before moving to serious training.

€2,442 starting from

PyTorch + CUDA stack pre-configured upon request

Configure this workstation
Industry Reference — Fine-tuning
Radiance CoreAI 64 RTX 5090 for machine learning fine-tuning

Radiance PC CoreAI 64 — RTX 5090 32 GB

CPU AMD Ryzen 9 9950X3D (16c)
GPU RTX 5090 32 GB
RAM DDR5 64 GB
Storage 1 TB NVMe
Bandwidth 1,792 GB/s
Power Supply 1,200 W 80+ Gold

32 GB VRAM and record bandwidth: the reference machine for LoRA and QLoRA fine-tuning.

The workstation that covers most industry needs. 32 GB for fine-tuning models up to 14 billion parameters, 64 GB RAM for the data pipeline, a 16-core CPU with 3D cache. The best capacity/price ratio for an individual ML engineer.

€6,042 starting from

Full ML stack pre-configured upon request

Configure this workstation
Multi-GPU — Parallelism
Radiance Rack 2x RTX 5090 for multi-GPU training

Radiance CoreAI Rack — 2 × RTX 5090 (64 GB)

CPU AMD Ryzen 9 9950X3D
GPU 2 × RTX 5090 32 GB
Total VRAM 64 GB
RAM DDR5 128 GB
Form Factor 4U Rack
Power Supply 2,000 W Platinum

Two GPUs for parallel training, running multiple experiments, or distributing a large model.

For scaling up. Two RTX 5090s allow for distributed training, processing larger models, or simultaneously launching multiple experimental runs. 128 GB of RAM to feed both cards without bottleneck.

€11,221 starting from

Distributed training, on-site installation possible

Configure this rack
Research & Development — ECC, expandable to 2 TB RAM
Radiance Pro AI Ultra Threadripper for machine learning research

Radiance PC Pro AI Ultra — Threadripper PRO

CPU Threadripper PRO 7955WX
GPU RTX 6000 Blackwell 96 GB
RAM ECC DDR5 128 GB RDIMM
Max RAM up to 2 TB ECC
VRAM 96 GB ECC
Form Factor 4U Rack

96 GB of ECC VRAM and up to 2 TB of RAM: for heavy fine-tuning and research.

The platform for ML engineers pushing boundaries. 96 GB of ECC VRAM for large models and long runs without corruption risk, a Threadripper PRO CPU and RAM expandable to 2 TB for the most demanding data pipelines.

€20,213 starting from

Custom-made, personalized quote, on-site installation

Request a quote
High-end — 192 GB ECC VRAM
Radiance Rack 2x RTX 6000 Blackwell ECC for ML research

CoreAI 128 Rack — 2 × RTX 6000 Blackwell (192 GB ECC)

CPU AMD Ryzen 9 9950X3D
GPU 2 × RTX 6000 96 GB ECC
Total VRAM 192 GB ECC
RAM DDR5 128 GB
Form Factor 4U Rack
Power Supply 2,000 W Platinum

192 GB of ECC VRAM for full fine-tuning, large models, and continuous R&D.

The pinnacle of our range for local ML. 192 GB of ECC VRAM enables full fine-tuning of substantial models, distributed training across two professional cards, and datacenter-level reliability for continuous loads.

€27,980 starting from

R&D, full fine-tuning, on-site installation

Request a quote
Unified Memory — Compact Large Models
NVIDIA GB10 Mini Server for large model ML prototyping

NVIDIA GB10 AI Mini Server — ASUS Ascent GX10

Chip NVIDIA GB10 Grace Blackwell
Memory 128 GB LPDDR5X Unified
AI Power 1 petaFLOP FP4
Form Factor 150×150×51 mm
OS DGX OS (Ubuntu)
Power Consumption approx. 240 W

128 GB of unified memory to prototype very large models, in a desktop format.

A different approach: the 128 GB CPU-GPU unified memory allows loading models that even an RTX 5090 cannot accommodate, in a compact and silent format, with the CUDA and Jupyter environment ready from startup.

€3,999 starting from

DGX OS, ready-to-use ML environment

Discover this server
Everything is fully configurable. Each workstation is customized from the ground up: graphics card, processor, RAM, storage, power supply, cooling, and case. You can adjust a configuration directly from the online configurator on each product page, or contact us for a custom quote. Do you have specific VRAM needs, a dual card setup, extended RAM, or a particular platform? We adapt the machine exactly to your workload. Write to us at contact@radiancesystems.eu or via the website's quote form.


Why a local workstation rather than the cloud

Cloud GPU has its place, but for an ML engineer working daily, a local workstation offers concrete advantages.

  • Controlled cost. GPU hours in the cloud accumulate quickly. A local workstation is a one-time investment, paid off in a few months of intensive use.
  • Instant iteration. No provisioning, no instance waiting, no data transfer. You launch your experiments immediately.
  • Private data. Your datasets and proprietary models remain with you, without passing through a third-party provider.
  • Total availability. No GPU quota, no instance shortage, no downtime. Your machine is always there.
  • Stable environment. Your software stack doesn't change from one session to the next.
A hybrid approach works well: a local workstation for development, prototyping, and most fine-tunings, complemented by occasional cloud use for peak loads or very large training runs. You maintain control over costs and your data, while having access to more power when it's occasionally needed.


In brief

How much VRAM for an ML engineer?
16 GB for development and prototyping, 24 to 32 GB for LoRA/QLoRA fine-tuning, 96 GB ECC and more for full fine-tuning and research.

Is ECC memory necessary?
For long training runs of several days, yes: ECC protects against silent memory errors that can corrupt a run. RTX 6000 Blackwell cards are equipped with it.

Does the CPU matter for ML?
Yes, for data preparation. A powerful GPU that is poorly fed will run idle. Many cores and generous RAM prevent this bottleneck.

Does fine-tuning require more than inference?
Significantly more. Due to gradients and optimizer states, fine-tuning can require two to three times the VRAM of inference for the same model.

Local or cloud?
Local is more economical and faster for daily use. A hybrid approach, local plus occasional cloud, is often the most relevant.

Are the machines ready to code?
Yes, upon request: PyTorch, CUDA, Hugging Face libraries, and Jupyter pre-configured, so you can code right out of the box.

Can the configuration be customized?
Yes, completely. Graphics card, CPU, RAM, storage, power supply, cooling, and case are configurable on each product page via the online configurator. For specific needs or a custom configuration, contact us at contact@radiancesystems.eu or via the quote form: we adapt the machine to your exact workload.

Back to blog

Your quote for a custom AI solution within 24–48 hours

Every Radiance project begins with a conversation. Fill out this form and an expert will get back to you shortly with a solution tailored to your business and budget.

Response within 24–48 business hours
Delivery throughout Europe (EU)
2-year warranty included
On-site installation available
No commitment on demand
Dedicated support before and after purchase
01 What is your primary use for AI?
Multiple choice.
02 In what context will the system be used?
Single choice.
03 What type of system are you looking for?
Single choice.
04 Which operating system do you prefer?
Single choice.
05 What are your expectations for the software?
Multiple choice.
06 What is your indicative budget?
Single choice.
07 When would you like to receive your system?
Single choice.
08 Would you like help with implementation?
One choice. A Radiance technician can assist you at your home or remotely.
09 Country of delivery (EU only) *
We only deliver within the European Union (EU).
10 Additional information (optional but very useful)
Briefly describe your project, any specific constraints, or any other relevant information.
11 Would you like to be contacted to discuss your project?
If you choose "Quote only", you can reply to our email to ask your questions and refine the quote.
12 Email *
We will send you the quote to this address.

More questions?

Send us an email at contact@radiancesystems.eu or contact us via the contact form. We respond to all inquiries within 3 hours during business hours (Monday to Friday, 9am to 5pm).

📞 +33 4 65 84 48 21