CUDA192GB VRAM

arnav080/deepseek-v4-flash

Name: arnav080/deepseek-v4-flash
Author: arnav080

DeepSeek V4-Flash MoE (284B) production profile tuned for 2x RTX 6000 GPUs

Configuration Specifications

Base Model	huggingface:deepseek-ai/DeepSeek-V4-Flash
Engine	llama.cpp
Quantization	Q4_K_M
Platform	cuda
VRAM Required	192GB minimum

No runs recorded yet

Be the first to benchmark this recipe! Run the CLI command from your terminal:

bloc run arnav080/deepseek-v4-flash

deepseek-v4-flash.yaml

Loading workspace editor...