Quantize to 4 bits with bitsandbytesconfig
#7 opened about 20 hours ago
by
Day1Kim
Good hardware for this model?
#6 opened 4 days ago
by
CoffeeBliss

When will GLM4.5 Flash be released
5
#5 opened 6 days ago
by
WilliamKing9
I have a draft PR up to llama.cpp, keen for your input
❤️
7
#4 opened 6 days ago
by
smcleod

Disable thinking mode?
❤️
1
6
#3 opened 7 days ago
by
daaain

Finetuning
2
#2 opened 7 days ago
by
AlexWortega

Add AWQ Quant?
➕
3
2
#1 opened 7 days ago
by
Foggierlucky