qqWen-Series Collection Based off the Qwen-2.5 Series - model finetuned for the Q programming language. • 11 items • Updated about 1 hour ago • 9
GLM-4.5 Collection GLM-4.5: An open-source large language model designed for intelligent agents by Z.ai • 11 items • Updated 17 days ago • 221
OpenReasoning-Nemotron Collection Collection of models for OpenReasoning-Nemotron which are trained on 5M reasoning traces for Math, Code and Science. • 6 items • Updated 13 days ago • 42
🚀 Optimized Models: torchao & Pruna Quantization Collection Quantized Models using torchao & Pruna for efficient inference and deployment. • 8 items • Updated 21 days ago • 1
view article Article SmolLM3: smol, multilingual, long-context reasoner By loubnabnl and 22 others • Jul 8 • 640
🍉 June 2025 - Open works from the Chinese community Collection 29 items • Updated about 1 month ago • 7
ERNIE 4.5 Collection collection of ERNIE 4.5 models. "-Paddle" models use PaddlePaddle weights, while "-PT" models use Transformer-style PyTorch weights. • 25 items • Updated Jul 11 • 159
DeepResearch Bench: A Comprehensive Benchmark for Deep Research Agents Paper • 2506.11763 • Published Jun 13 • 69
view article Article Gemma 3n fully available in the open-source ecosystem! By ariG23498 and 7 others • Jun 26 • 115
view article Article Transformers backend integration in SGLang By marcsun13 and 4 others • Jun 23 • 53
view article Article Sensitivity Aware Mixed Precision Quantization V1 By badaoui and 1 other • Jun 13 • 19
view article Article No GPU left behind: Unlocking Efficiency with Co-located vLLM in TRL By toslali-ibm and 5 others • Jun 3 • 84
V-JEPA 2 Collection A frontier video understanding model developed by FAIR, Meta, which extends the pretraining objectives of https://ai.meta.com/blog/v-jepa-yann • 8 items • Updated Jun 13 • 160