CUDA384GB VRAM

arnav080/kimi-k2-6-nvfp4-0xS

Name: arnav080/kimi-k2-6-nvfp4-0xS
Author: arnav080

Kimi-K2.6 519B (NVFP4) served via SGLang at 256k context with reasoning and tool-call parsers

Configuration Specifications

Base Model	huggingface:0xSero/Kimi-K2.6-519B-NVFP4
Engine	llama.cpp
Quantization	modelopt_fp4
Platform	cuda
VRAM Required	384GB minimum

No runs recorded yet

Be the first to benchmark this recipe! Run the CLI command from your terminal:

bloc run arnav080/kimi-k2-6-nvfp4-0xS

kimi-k2-6-nvfp4-0xS.yaml

Loading workspace editor...