Disable thinking mode?

by daaain - opened 8 days ago

Discussion

daaain

8 days ago

Is there a special token to disable thinking? I'm using the MLX version if that matters

AbyssianOne

8 days ago

I'm sorry, I'm useless to you since I don't use MLX and can't run this yes... but I wanted to say thank you for making me spit my coffee out laughing at what looked like a request for a "Disabled thinking mode."

ZAHNGYUXUAN

Z.ai org 8 days ago

Yes, please check our chat template.

daaain changed discussion title from Disabled thinking mode? to Disable thinking mode? 8 days ago

daaain

8 days ago

Thanks, so if I understand correctly, either write /nothink or use enable_thinking in the template if the inference library supports it?

https://huggingface.co/zai-org/GLM-4.5-Air/blob/main/chat_template.jinja#L47

@AbyssianOne haha, the irony of being too autistic to notice 😅 or maybe just the temporary disability of being too tired...

ZAHNGYUXUAN

Z.ai org 8 days ago

yes, vLLM and sglang supoort enable_thinking params,check our github

daaain

8 days ago

Thanks a lot! I'm GPU poor, so only llama.cpp and mlx-lm (via LM Studio currently) for me 😅

But also have to say this model is an absolute sweet spot for people with more powerful Macs, I'm getting 20 tokens / sec on my M2 Max laptop with the 4bit quant, so really grateful for your work!

DUOWEN

2 days ago

当我用“”标签测试GLM4.5时偶然发现它又关闭思考模式的效果，我们知道如果把这个标签输入给DeepSeek或Qwen的思考模型时模型往往会输出奇怪的东西。

When I was testing GLM4.5 with the "" tag, I accidentally discovered that it turned off the thinking mode. We know that if this tag is input into DeepSeek or Qwen's thinking model, the model will often output strange stuff.

Fernanda24

about 23 hours ago

•

edited about 23 hours ago

no think glm-4.5 jinja: https://gist.github.com/qingy1337/2ee429967662a4d6b06eb59787f7dc53

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment