Back to registry
CUDA8GB VRAM

arnav080/qwen3.6-35b-moe-4060ti-above_spec

Optimized Qwen 3.6 35B MoE config for 8GB GPUs, reaching 200k context via CPU expert offload and q8_0 KV cache.

Configuration Specifications

Base Modelhuggingface:Qwen/Qwen3.6-35B-A3B-Instruct
Enginellama.cpp
QuantizationQ4_K_S
Platformcuda
VRAM Required8GB minimum

Telemetry Benchmarks

No runs recorded yet

Be the first to benchmark this recipe! Run the CLI command from your terminal:

bloc run arnav080/qwen3.6-35b-moe-4060ti-above_spec
qwen3.6-35b-moe-4060ti-above_spec.yaml
Loading workspace editor...