view article Article Welcome GPT OSS, the new open-source model family from OpenAI! By reach-vb and 11 others β’ 1 day ago β’ 235
view article Article Speeding Up LLM Decoding with Advanced Universal Assisted Generation Techniques By jmamou and 8 others β’ Mar 24 β’ 19
view article Article Universal Assisted Generation: Faster Decoding with Any Assistant Model By danielkorat and 7 others β’ Oct 29, 2024 β’ 57
view article Article Faster Assisted Generation with Dynamic Speculation By jmamou and 6 others β’ Oct 8, 2024 β’ 48
view article Article Google releases Gemma 2 2B, ShieldGemma and Gemma Scope By Xenova and 3 others β’ Jul 31, 2024 β’ 60
view article Article Code Llama: Llama 2 learns to code By philschmid and 7 others β’ Aug 25, 2023 β’ 10
view article Article Assisted Generation: a new direction toward low-latency text generation By joaogante β’ May 11, 2023 β’ 69