Back to registry
CUDA384GB VRAM
arnav080/kimi-k2-6-nvfp4-0xS
Kimi-K2.6 519B (NVFP4) served via SGLang at 256k context with reasoning and tool-call parsers
Configuration Specifications
| Base Model | huggingface:0xSero/Kimi-K2.6-519B-NVFP4 |
| Engine | llama.cpp |
| Quantization | modelopt_fp4 |
| Platform | cuda |
| VRAM Required | 384GB minimum |
Telemetry Benchmarks
No runs recorded yet
Be the first to benchmark this recipe! Run the CLI command from your terminal:
bloc run arnav080/kimi-k2-6-nvfp4-0xS
kimi-k2-6-nvfp4-0xS.yaml
Loading workspace editor...