CUDA8GB VRAM

arnav080/qwen3.6-35b-moe-4060ti-iq4-above_spec

Name: arnav080/qwen3.6-35b-moe-4060ti-iq4-above_spec
Author: arnav080

Qwen 3.6 35B MoE on RTX 4060 Ti 8GB using the ik_llama.cpp fork for high-quality IQ4_K_R4 quantization and 262k context.

Configuration Specifications

Base Model	huggingface:Qwen/Qwen3.6-35B-A3B-Instruct
Engine	llama.cpp
Quantization	IQ4_K_R4
Platform	cuda
VRAM Required	8GB minimum

No runs recorded yet

Be the first to benchmark this recipe! Run the CLI command from your terminal:

bloc run arnav080/qwen3.6-35b-moe-4060ti-iq4-above_spec

qwen3.6-35b-moe-4060ti-iq4-above_spec.yaml

Loading workspace editor...