tngtech/DeepSeek-TNG-R1T2-Chimera Text Generation • 685B • Updated about 9 hours ago • 5.85k • 230
tngtech/DeepSeek-TNG-R1T2-Chimera Text Generation • 685B • Updated about 9 hours ago • 5.85k • 230
tngtech/DeepSeek-TNG-R1T2-Chimera Text Generation • 685B • Updated about 9 hours ago • 5.85k • 230
tngtech/DeepSeek-TNG-R1T2-Chimera Text Generation • 685B • Updated about 9 hours ago • 5.85k • 230
Assembly of Experts: Linear-time construction of the Chimera LLM variants with emergent and adaptable behaviors Paper • 2506.14794 • Published May 31 • 1
Mixture of Tunable Experts -- Behavior Modification of DeepSeek-R1 at Inference Time Paper • 2502.11096 • Published Feb 16 • 2