CPU4GB VRAM

arnav080/llama-cpp-legacy-test

Name: arnav080/llama-cpp-legacy-test
Author: arnav080

Stress test recipe simulating older llama.cpp configurations with flash attention, custom KV cache types, and NUMA settings

Configuration Specifications

Base Model	huggingface:Qwen/Qwen2.5-0.5B-Instruct
Engine	llama.cpp
Quantization	Q4_K_M
Platform	cpu
VRAM Required	4GB minimum

No runs recorded yet

Be the first to benchmark this recipe! Run the CLI command from your terminal:

bloc run arnav080/llama-cpp-legacy-test

llama-cpp-legacy-test.yaml

Loading workspace editor...