Aquileo | unsloth/Qwen3.6-27B-NVFP4 · MTP?

MTP?

#3
by tyapo - opened

i always get 0% acceptance rate on MTP (vllm). other models work. is it supposed to be supported?

I see the same thing. I think the MTP heads are missing. I am running with SGLang.

I have quantized a version of this model for NVFP4 here: https://huggingface.co/NeuralNet-Hub/Qwen3.6-27B-NVFP4

Any feedback is appreciated.

Unsloth AI org

Sign up or log in to comment