Back to registry
CUDA192GB VRAM

arnav080/minimax-m2-7-nvfp4

MiniMax M2.7 MoE (230B) quantized to 4-bit (NVFP4) optimized for dual RTX 6000 GPUs

Configuration Specifications

Base Modelhuggingface:demon-zombie/MiniMax-M2.7-NVFP4
Enginellama.cpp
Quantizationmodelopt
Platformcuda
VRAM Required192GB minimum

Telemetry Benchmarks

No runs recorded yet

Be the first to benchmark this recipe! Run the CLI command from your terminal:

bloc run arnav080/minimax-m2-7-nvfp4
minimax-m2-7-nvfp4.yaml
Loading workspace editor...