perf(model): enable 8-bit quantization and explicit CUDA device targeting 50f1efd PierrunoYT commited on 11 days ago