view article Article Towards Open Evolutionary Agents By driaforall and 1 other • about 10 hours ago • 5
view article Article Unsupervised Model Improvement via Internal Coherence Maximization: Outperforming Human-Supervised Methods Through Self-Elicitation By codelion • 2 days ago • 4
view article Article Understanding Model Reasoning Through Thought Anchors: A Comparative Study of Qwen3 and DeepSeek-R1 By codelion • 13 days ago • 3
Ellora Collection Ellora: Enhancing LLMs with LoRA - Standardized Recipes for Capability Enhancement • 10 items • Updated 2 days ago • 1
Internal Coherence Maximization Collection Internal Coherence Maximization (ICM): A Label-Free, Unsupervised Training Framework for LLMs • 7 items • Updated 2 days ago • 1
Pre-training Dataset Samples Collection A collection of pre-training datasets samples of sizes 10M, 100M and 1B tokens. Ideal for use in quick experimentation and ablations. • 9 items • Updated 29 days ago • 1
view article Article Automated Discovery of High-Performance GPU Kernels with OpenEvolve By codelion • Jun 27 • 21
view article Article Adaptive Classifier: Dynamic Text Classification with Continuous Learning By codelion • Jun 20 • 13
ProtoReasoning: Prototypes as the Foundation for Generalizable Reasoning in LLMs Paper • 2506.15211 • Published Jun 18 • 36
Reinforcement Learning with Verifiable Rewards Implicitly Incentivizes Correct Reasoning in Base LLMs Paper • 2506.14245 • Published Jun 17 • 40
Eliciting Fine-Tuned Transformer Capabilities via Inference-Time Techniques Paper • 2506.08060 • Published Jun 9 • 8
Hunyuan-Game: Industrial-grade Intelligent Game Creation Model Paper • 2505.14135 • Published May 20 • 15
BABILong: Testing the Limits of LLMs with Long Context Reasoning-in-a-Haystack Paper • 2406.10149 • Published Jun 14, 2024 • 53
ATLAS: Learning to Optimally Memorize the Context at Test Time Paper • 2505.23735 • Published May 29 • 23