ArcticSpeculator

Build the fastest OSS vllm-based speculative decoding system for your own model, using ArcticTraining and ArcticInference!

For more details about ArcticSpeculator and how to use it:

See all of the speculators we have released via our Speculators Collection

Downloads last month
15
Inference Providers NEW
This model isn't deployed by any Inference Provider. ๐Ÿ™‹ Ask for provider support

Collection including Snowflake/Arctic-LSTM-Speculator-gpt-oss-20b