CUDA192GB VRAM

arnav080/minimax-m2-7-nvfp4

Name: arnav080/minimax-m2-7-nvfp4
Author: arnav080

MiniMax M2.7 MoE (230B) quantized to 4-bit (NVFP4) optimized for dual RTX 6000 GPUs

Configuration Specifications

Base Model	huggingface:demon-zombie/MiniMax-M2.7-NVFP4
Engine	llama.cpp
Quantization	modelopt
Platform	cuda
VRAM Required	192GB minimum

No runs recorded yet

Be the first to benchmark this recipe! Run the CLI command from your terminal:

bloc run arnav080/minimax-m2-7-nvfp4

minimax-m2-7-nvfp4.yaml

Loading workspace editor...