Troubleshooting

Explore resolutions for common error states and setup issues.

GPU Detection Failures

If the CLI fails to detect your accelerator (CUDA or Metal), run the capability diagnostic checks:

macOS

Ensure you have the Xcode command-line tools active:

xcode-select -p

Linux & Windows

Verify that your drivers are up-to-date and the CUDA runtime is accessible:

nvidia-smi

Port Conflicts

If the API fails to start, it's typically because port 8080 (default) is already in use. You can pass a custom port during deployment:

bloc deploy arnav/qwen-3.5-9b-super --port 9000

CPU Fallback Bottlenecks

Running models on CPU is slower. If you are forced onto CPU fallback:

Ensure your processor supports AVX2 instructions.
Free up system RAM by closing resource-heavy background processes.
Select recipes using smaller quantizations (e.g. Q4_K_M) to reduce memory pressure.

Inspecting Persistent Engine Logs

If a model fails to start or crashes during execution, the CLI automatically saves structured, persistent logs for the engine process. This makes it easy to diagnose obscure errors or out-of-memory crashes that occur inside the engine container.

You can find all engine logs saved on your local machine at:

~/.cache/bloc/logs/