Back to registry
CUDA192GB VRAM
arnav080/deepseek-v4-flash
DeepSeek V4-Flash MoE (284B) production profile tuned for 2x RTX 6000 GPUs
Configuration Specifications
| Base Model | huggingface:deepseek-ai/DeepSeek-V4-Flash |
| Engine | llama.cpp |
| Quantization | Q4_K_M |
| Platform | cuda |
| VRAM Required | 192GB minimum |
Telemetry Benchmarks
No runs recorded yet
Be the first to benchmark this recipe! Run the CLI command from your terminal:
bloc run arnav080/deepseek-v4-flash
deepseek-v4-flash.yaml
Loading workspace editor...