DuPO: Enabling Reliable LLM Self-Verification via Dual Preference Optimization Paper • 2508.14460 • Published 13 days ago • 79
MoBE: Mixture-of-Basis-Experts for Compressing MoE-based LLMs Paper • 2508.05257 • Published 26 days ago • 12
Grove MoE: Towards Efficient and Superior MoE LLMs with Adjugate Experts Paper • 2508.07785 • Published 22 days ago • 25
view article Article Fine-Tune Whisper with 🤗 Transformers By sanchit-gandhi • Nov 3, 2022 • 288
AbGen: Evaluating Large Language Models in Ablation Study Design and Evaluation for Scientific Research Paper • 2507.13300 • Published Jul 17 • 16
Easy Dataset: A Unified and Extensible Framework for Synthesizing LLM Fine-Tuning Data from Unstructured Documents Paper • 2507.04009 • Published Jul 5 • 41
AdaptThink: Reasoning Models Can Learn When to Think Paper • 2505.13417 • Published May 19 • 82
ZeroSearch: Incentivize the Search Capability of LLMs without Searching Paper • 2505.04588 • Published May 7 • 66
Agentic Reasoning and Tool Integration for LLMs via Reinforcement Learning Paper • 2505.01441 • Published Apr 28 • 39
Grokking in the Wild: Data Augmentation for Real-World Multi-Hop Reasoning with Transformers Paper • 2504.20752 • Published Apr 29 • 93
LLaMA-Omni2: LLM-based Real-time Spoken Chatbot with Autoregressive Streaming Speech Synthesis Paper • 2505.02625 • Published May 5 • 22
LearnAct: Few-Shot Mobile GUI Agent with a Unified Demonstration Benchmark Paper • 2504.13805 • Published Apr 18 • 12
Towards Agentic Recommender Systems in the Era of Multimodal Large Language Models Paper • 2503.16734 • Published Mar 20 • 1
LLM-Powered GUI Agents in Phone Automation: Surveying Progress and Prospects Paper • 2504.19838 • Published Apr 28 • 22
SoundStorm: Efficient Parallel Audio Generation Paper • 2305.09636 • Published May 16, 2023 • 13
SLIM Models Collection Structured Language Instruction Models (SLIMs) • 31 items • Updated Feb 10 • 33