Back to registry
CUDA384GB VRAM

arnav080/kimi-k2-6-nvfp4-0xS

Kimi-K2.6 519B (NVFP4) served via SGLang at 256k context with reasoning and tool-call parsers

Configuration Specifications

Base Modelhuggingface:0xSero/Kimi-K2.6-519B-NVFP4
Enginellama.cpp
Quantizationmodelopt_fp4
Platformcuda
VRAM Required384GB minimum

Telemetry Benchmarks

No runs recorded yet

Be the first to benchmark this recipe! Run the CLI command from your terminal:

bloc run arnav080/kimi-k2-6-nvfp4-0xS
kimi-k2-6-nvfp4-0xS.yaml
Loading workspace editor...