What inference setting for coding?
1
#25 opened 2 days ago
by
akierum
Can we have more detailed instructions on installing dependencies?
#24 opened 2 days ago
by
steveheh
Update README.md
#23 opened 6 days ago
by
sudoping01

Any plans to release the training recipe?
π
π
5
2
#21 opened 9 days ago
by
nskwal

Request: DOI
#19 opened 10 days ago
by
itsAmmar
feat: Add CPU support
#18 opened 10 days ago
by
gabegoodhart

I think yall can afford to benchmark Qwen 3 8B
π
1
1
#17 opened 12 days ago
by
owenqwenllmwine

Slower than Qwen3-8B despite claimed 3x inference speedup
8
#16 opened 12 days ago
by
coszeros
sad! no tool calls in streaming mode.
#15 opened 12 days ago
by
j4ys0n
HybridMambaAttentionDynamicCache is not valid?
β
1
1
#14 opened 14 days ago
by
GentleLiu
Any plans for MLX support?
1
#12 opened 17 days ago
by
Alealejandrooo
some problem when I asked the model: δ½ ζ―θ°οΌ
π€―
2
3
#8 opened 18 days ago
by
wenzel94
OOM with vllm==0.10.1 on GPU L40S
2
#7 opened 19 days ago
by
qingfu
GGUF support
β€οΈ
3
18
#4 opened 20 days ago
by
RedEyed

This just trades general performance for domain specific gains.
π₯
π
15
11
#3 opened 20 days ago
by
phil111