Back to registry
CUDA192GB VRAM
arnav080/minimax-m2-7-nvfp4
MiniMax M2.7 MoE (230B) quantized to 4-bit (NVFP4) optimized for dual RTX 6000 GPUs
Configuration Specifications
| Base Model | huggingface:demon-zombie/MiniMax-M2.7-NVFP4 |
| Engine | llama.cpp |
| Quantization | modelopt |
| Platform | cuda |
| VRAM Required | 192GB minimum |
Telemetry Benchmarks
No runs recorded yet
Be the first to benchmark this recipe! Run the CLI command from your terminal:
bloc run arnav080/minimax-m2-7-nvfp4
minimax-m2-7-nvfp4.yaml
Loading workspace editor...