Custom jinja template and draft model usage
π
1
9
#6 opened 9 days ago
by
ubergarm

KL Divergence as Performance Metric
1
#5 opened 18 days ago
by
joaquinrfs

IQ2_KL Testing - Runs Great Until The Model The Model The Model (lol)
π₯
1
8
#4 opened about 1 month ago
by
phakio

Can you provide some low-precision quantization options?
β
π
3
11
#3 opened about 1 month ago
by
lingyezhixing
Good job
56
#2 opened about 1 month ago
by
huccjj
Works like a charm on ik_llama.cpp server with PR 668
π₯
3
11
#1 opened about 1 month ago
by
Nexesenex
