PC for machine learning engineer: training, fine-tuning, research

June 14, 2026

A machine learning engineer does not have the same needs as a user who just wants to chat with a model. Training, fine-tuning, experimenting, processing large datasets: these tasks impose very different hardware constraints than simple inference. Buying the wrong machine means losing hours at every experimentation cycle.

This guide starts from the real workflows of an ML engineer, deduces what hardware must provide, and proposes stations suited to each type of workload, from development workstations to multi-GPU training stations.

What an ML engineer really does, and what it requires

Model training

Requires: VRAM, computation, long-term stability

Training from scratch or continuing pre-training fully loads the GPU, sometimes for days. VRAM limits model and batch size. Stability becomes critical on long runs.

Fine-tuning (LoRA, QLoRA, full)

Requires: VRAM, iteration speed

The most common case in practice. LoRA and QLoRA reduce requirements, but serious fine-tuning demands 24 GB and more depending on the base model size.

Inference and evaluation

Requires: VRAM, bandwidth

Testing models, comparing variants, serving a local API. Less demanding than training, but VRAM remains the model size factor.

Data preparation

Requires: CPU, RAM, fast storage

Cleaning, tokenization, augmentation, loading. This often underestimated step is limited by the CPU, RAM, and storage speed, not the GPU.

The distinction that changes everything: training and fine-tuning a model is radically more demanding than using it. Inference of a 14-billion-parameter model fits on 16 GB. Fine-tuning the same model can require two to three times more, due to gradients, optimizer states, and activations kept in memory. Sizing your machine for inference when you want to train is the most costly mistake.

The components that really matter

VRAM, above all. It determines the size of the models you can train and fine-tune. 24 GB is a comfort threshold, 32 GB opens up serious models, 96 GB ECC targets research and large models.
ECC memory for long runs. During training that lasts several days, a silent memory error can corrupt an entire run. ECC VRAM (RTX 6000 Blackwell cards) protects critical computations.
CPU and RAM for the data pipeline. A powerful GPU starved by slow data loading idles. Many cores and ample RAM feed the GPU without bottlenecks.
Fast NVMe storage. Large datasets and checkpoints require high throughput. A Gen 4 or Gen 5 NVMe prevents the disk from becoming a bottleneck.
Multi-GPU for scaling up. Two cards allow parallel training, handling larger models, or running multiple experiments simultaneously.

The fine-tuning trap. You often read that a model "fits in 16 GB." That’s true for inference, rarely for training. Full fine-tuning of a 7-billion-parameter model can exceed 60 GB of VRAM. LoRA and QLoRA techniques greatly reduce this need, but always check the intended training mode before choosing your card.

Which VRAM for which ML task?

Task	Recommended VRAM	Typical card	Comment
Learning, prototyping, small models	16 GB	RTX 5070 Ti	Ideal for starting and developing
LoRA/QLoRA fine-tuning (up to 14B)	24 to 32 GB	RTX 5090 32 GB	The industry sweet spot
Multi-experiment training, medium models	2 × 32 Go	2 × RTX 5090	Parallelism, multiple runs
Heavy fine-tuning, large models, long runs	96 GB ECC	RTX 6000 Blackwell	ECC for reliability
Research, full fine-tuning, R&D	192 GB ECC	2 × RTX 6000 Blackwell	Top-tier local setup
Prototyping very large models	128 GB unified	NVIDIA GB10	Unified memory, compact form factor

A local environment ready to code

Our workstations come with the standard ML ecosystem preconfigured on request, so you can start coding right out of the box instead of spending hours fixing CUDA version issues.

# Typical environment, preconfigured on request
# PyTorch with CUDA 12.8 (Blackwell RTX 50xx / RTX 6000 cards)
pip install torch torchvision torchaudio \
  --index-url https://download.pytorch.org/whl/cu128

# Common ML tools
pip install transformers datasets accelerate peft bitsandbytes
pip install jupyterlab scikit-learn pandas

# Ready-to-use LoRA fine-tuning with PEFT + Transformers

Real-time time savings: version incompatibilities (CUDA, cuDNN, PyTorch, drivers) waste a considerable amount of time. Our machines come with a coherent and tested stack — PyTorch, CUDA, Hugging Face libraries, Jupyter — so your first code cell runs without any setup.

Our workstations by ML workload type

All our machines are hand-assembled in Auriol (13390), tested before shipping, and delivered throughout the European Union. Fully configurable, including the GPU.

Development and prototyping

Radiance CoreAI 32 for machine learning development

Radiance PC CoreAI 32 — RTX 5070 Ti 16 GB

CPU AMD Ryzen 9 9900X (12c)

GPU RTX 5070 Ti 16 GB

RAM DDR5 32 GB

Storage NVMe 1 TB

OS Windows 11 Pro or Ubuntu

Use Dev, inference, small FT

Ideal for learning, developing, inference, and light LoRA fine-tuning.

Entry-level ML development workstation. 12 cores for the data pipeline, 16 GB VRAM for inference and prototyping. The right starting point before serious training.

2 442 € starting from

PyTorch + CUDA stack preconfigured on request

Set up this workstation

Industry reference — fine-tuning

Radiance CoreAI 64 RTX 5090 for machine learning fine-tuning

Radiance PC CoreAI 64 — RTX 5090 32 GB

CPU AMD Ryzen 9 9950X3D (16c)

GPU RTX 5090 32 GB

RAM DDR5 64 GB

Storage NVMe 1 TB

Bandwidth 1,792 GB/s

Power supply 1,200 W 80+ Gold

32 GB VRAM and record bandwidth: the reference machine for LoRA and QLoRA fine-tuning.

The workstation covering the essentials of the job. 32 GB to fine-tune models up to 14 billion parameters, 64 GB RAM for the data pipeline, a 16-core CPU with 3D cache. The best capacity/price ratio for an individual ML engineer.

6 042 € starting from

Full ML stack preconfigured on request

Set up this workstation

Multi-GPU — parallelism

Radiance Rack 2x RTX 5090 for multi-GPU training

Radiance CoreAI Rack — 2 × RTX 5090 (64 GB)

CPU AMD Ryzen 9 9950X3D

GPU 2 × RTX 5090 32 GB

Total VRAM 64 GB

RAM DDR5 128 GB

Form factor 4U Rack

Power supply 2,000 W Platinum

Two GPUs for parallel training, running multiple experiments, or splitting a large model.

For scaling up. Two RTX 5090s enable distributed training, handling larger models, or running multiple experiments simultaneously. 128 GB of RAM to feed both cards without bottlenecks.

11 221 € starting from

Distributed training, on-site installation available

Configure this rack

Research and R&D — ECC, expandable to 2 TB RAM

Radiance Pro AI Ultra Threadripper for machine learning research

Radiance PC Pro AI Ultra — Threadripper PRO

CPU Threadripper PRO 7955WX

GPU RTX 6000 Blackwell 96 GB

RAM ECC DDR5 128 GB RDIMM

Max RAM up to 2 TB ECC

VRAM 96 GB ECC

Form factor 4U Rack

96 GB of ECC VRAM and up to 2 TB of RAM: for heavy fine-tuning and research.

The platform for ML engineers pushing boundaries. 96 GB ECC VRAM for large models and long runs without corruption risk, a Threadripper PRO CPU, and RAM expandable up to 2 TB for the most demanding data pipelines.

20 213 € starting from

Custom, personalized quote, on-site installation

Request a quote

High-end — 192 GB ECC VRAM

Radiance Rack 2x RTX 6000 Blackwell ECC for ML research

CoreAI 128 Rack — 2 × RTX 6000 Blackwell (192 GB ECC)

CPU AMD Ryzen 9 9950X3D

GPU 2 × RTX 6000 96 GB ECC

Total VRAM 192 GB ECC

RAM DDR5 128 GB

Form factor 4U Rack

Power supply 2,000 W Platinum

192 GB ECC VRAM for full fine-tuning, large models, and continuous R&D.

The pinnacle of our range for local ML. 192 GB ECC VRAM enables full fine-tuning of substantial models, distributed training on two professional cards, and datacenter-level reliability for continuous workloads.

27 980 € starting from

R&D, full fine-tuning, on-site installation

Request a quote

Unified memory — compact large models

NVIDIA GB10 mini server for large model ML prototyping

NVIDIA GB10 Mini AI Server — ASUS Ascent GX10

Chip NVIDIA GB10 Grace Blackwell

Memory 128 GB unified LPDDR5X

AI power 1 petaFLOP FP4

Dimensions 150×150×51 mm

OS DGX OS (Ubuntu)

Power consumption about 240 W

128 GB unified memory to prototype very large models, in a desktop format.

A different approach: 128 GB unified CPU-GPU memory allows loading models that even an RTX 5090 cannot handle, in a compact and silent format, with CUDA and Jupyter environment ready at startup.

3 999 € starting from

DGX OS, ready-to-use ML environment

Discover this server

Everything is fully customizable. Each workstation can be personalized from top to bottom: graphics card, processor, amount of RAM, storage, power supply, cooling, case. You can adjust a configuration directly from the online configurator on each product page, or contact us for a custom quote. Need specific VRAM, dual cards, extended RAM, a particular platform? We tailor the machine exactly to your workload. Write to us at contact@radiancesystems.eu or via the quote form on the site.

Why choose a local workstation over the cloud

GPU cloud has its place, but for an ML engineer working daily, a local workstation offers concrete advantages.

Controlled cost. GPU hours in the cloud add up quickly. A local workstation is a one-time investment, paid off in a few months of intensive use.
Instant iteration. No provisioning, no instance waiting, no data transfer. You start your experiments immediately.
Private data. Your datasets and proprietary models stay with you, without passing through a third-party provider.
Full availability. No GPU quotas, no instance shortages, no downtime. Your machine is always there.
Stable environment. Your software stack doesn’t change between sessions.

A hybrid approach works well: a local workstation for development, prototyping, and most fine-tuning, complemented by occasional cloud for peak loads or very large training sessions. You keep control of costs and your data while having access to more power when needed.

In short

What VRAM is needed for an ML engineer?
16 GB for development and prototyping, 24 to 32 GB for LoRA/QLoRA fine-tuning, 96 GB ECC and more for full fine-tuning and research.

Is ECC memory necessary?
For long training sessions lasting several days, yes: ECC protects against silent memory errors that can corrupt a run. RTX 6000 Blackwell cards are equipped with it.

Does the CPU matter for ML?
Yes, for data preparation. A powerful GPU poorly fed runs idle. Many cores and generous RAM avoid this bottleneck.

Does fine-tuning require more than inference?
Definitely. Due to gradients and optimizer states, fine-tuning can require two to three times the VRAM of inference for the same model.

Local or cloud?
Local is more economical and faster for daily use. A hybrid approach, local plus occasional cloud, is often the most relevant.

Are the machines ready to code?
Yes, on request: PyTorch, CUDA, Hugging Face libraries, and Jupyter preconfigured, so you can start coding right out of the box.

Can the configuration be customized?
Yes, completely. The graphics card, CPU, RAM, storage, power supply, cooling, and case are configurable on each product page via the online configurator. For a specific need or custom configuration, contact us at contact@radiancesystems.eu or via the quote form: we tailor the machine to your exact workload.

Back to the blog

Country/region

Language

What an ML engineer really does, and what it requires

Model training

Fine-tuning (LoRA, QLoRA, full)

Inference and evaluation

Data preparation

The components that really matter

Which VRAM for which ML task?

A local environment ready to code

Our workstations by ML workload type

Radiance PC CoreAI 32 — RTX 5070 Ti 16 GB

Radiance PC CoreAI 64 — RTX 5090 32 GB

Radiance CoreAI Rack — 2 × RTX 5090 (64 GB)

Radiance PC Pro AI Ultra — Threadripper PRO

CoreAI 128 Rack — 2 × RTX 6000 Blackwell (192 GB ECC)

NVIDIA GB10 Mini AI Server — ASUS Ascent GX10

Why choose a local workstation over the cloud

In short

Discover our range of Local AI PCs

Your quote for a custom AI solution within 24–48 hours

Any more questions?

Other articles