Question 1

Is WillMyGPURunIt free?

Accepted Answer

Yes, completely free. Enter your parts and get every result with no account or sign-up.

Question 2

How much VRAM do I need to run AI locally?

Accepted Answer

Roughly 0.6 GB of VRAM per billion parameters at 4-bit, plus overhead, so an 8B model needs about 8 GB, a 14B around 12-16 GB, and a 32B needs 24 GB. The calculator shows exactly what your card runs.

Question 3

What is a CPU or GPU bottleneck?

Accepted Answer

A bottleneck is when one part holds the other back, for example a CPU too slow to keep a fast GPU fed with frames. For gaming a GPU being the limiter is the healthy state; for local AI the limiter is almost always VRAM capacity.

Question 4

How accurate are the numbers?

Accepted Answer

They're estimates built from published benchmark data (PassMark G3D, single-thread ratings) and real GPU specs (VRAM, bandwidth, board power). They're a reliable guide for comparing builds, not a guarantee of exact frame rates or speeds.

Question 5

What does tokens per second mean?

Accepted Answer

It's how fast a model writes. A token is about ¾ of a word, so ~40 tokens/sec already outpaces your reading speed. We estimate it from your GPU's memory bandwidth.

Your Build Benchmarked

Benchmark Your Build

What WillMyGPURunIt Checks

Local AI models + tokens/sec

AI & gaming bottlenecks

Power supply wattage

Part compatibility

Games you can run

1-100 build scores

New to Local AI? Start Here

Why Run AI Locally — and Why Not

How Much VRAM Do You Need to Run an LLM?

Best GPUs for Local LLMs

What Is CUDA? And Why It Matters for Local AI

Deciding Between Two Builds?

Frequently Asked Questions

Is WillMyGPURunIt free?

How much VRAM do I need to run AI locally?

What is a CPU or GPU bottleneck?

How accurate are the numbers?

What does tokens per second mean?