Local path

Local AI setup

Choose the runtime, model format, memory plan, and privacy tradeoffs before you run models on your own machine.

Decision checkpoints

Start with your RAM or VRAM budget, then choose model size and quantization.

Prefer a runtime that fits your operating system, GPU support, and workflow.

Use cloud GPU when context length, concurrency, or speed makes local runs painful.