Enneng Yang

EnnengYang

AI & ML interests

Machine learning, Recommendation system

Recent Activity

upvoted a paper 12 days ago

BeyondWeb: Lessons from Scaling Synthetic Data for Trillion-scale Pretraining

upvoted a paper about 1 month ago

MiroMind-M1: An Open-Source Advancement in Mathematical Reasoning via Context-Aware Multi-Stage Policy Optimization

upvoted a paper about 2 months ago

Scaling Laws for Optimal Data Mixtures

View all activity

Organizations

upvoted a paper 12 days ago

BeyondWeb: Lessons from Scaling Synthetic Data for Trillion-scale Pretraining

Paper • 2508.10975 • Published 17 days ago • 55

upvoted a paper about 1 month ago

MiroMind-M1: An Open-Source Advancement in Mathematical Reasoning via Context-Aware Multi-Stage Policy Optimization

Paper • 2507.14683 • Published Jul 19 • 127

upvoted 2 papers about 2 months ago

Scaling Laws for Optimal Data Mixtures

Paper • 2507.09404 • Published Jul 12 • 34

Skip a Layer or Loop it? Test-Time Depth Adaptation of Pretrained LLMs

Paper • 2507.07996 • Published Jul 10 • 33

upvoted a collection about 2 months ago

Qwen3

Collection

84 items • Updated 25 days ago • 1.17k

upvoted a paper about 2 months ago

Does Math Reasoning Improve General LLM Capabilities? Understanding Transferability of LLM Reasoning

Paper • 2507.00432 • Published Jul 1 • 74

upvoted a paper 3 months ago

Qwen3 Technical Report

Paper • 2505.09388 • Published May 14 • 281

upvoted 2 papers 5 months ago

DeepSeek vs. o3-mini: How Well can Reasoning LLMs Evaluate MT and Summarization?

Paper • 2504.08120 • Published Apr 10 • 4

Unlocking Efficient Long-to-Short LLM Reasoning with Model Merging

Paper • 2503.20641 • Published Mar 26 • 9

upvoted 11 papers 6 months ago

Charting and Navigating Hugging Face's Model Atlas

Paper • 2503.10633 • Published Mar 13 • 88

Make LoRA Great Again: Boosting LoRA with Adaptive Singular Values and Mixture-of-Experts Optimization Alignment

Paper • 2502.16894 • Published Feb 24 • 31

Is That Your Final Answer? Test-Time Scaling Improves Selective Question Answering

Paper • 2502.13962 • Published Feb 19 • 29

Qwen2.5-VL Technical Report

Paper • 2502.13923 • Published Feb 19 • 203

Atom of Thoughts for Markov LLM Test-Time Scaling

Paper • 2502.12018 • Published Feb 17 • 17

Revisiting the Test-Time Scaling of o1-like Models: Do they Truly Possess Test-Time Scaling Capabilities?

Paper • 2502.12215 • Published Feb 17 • 16

SafeRoute: Adaptive Model Selection for Efficient and Accurate Safety Guardrails in Large Language Models

Paper • 2502.12464 • Published Feb 18 • 28

The Mirage of Model Editing: Revisiting Evaluation in the Wild

Paper • 2502.11177 • Published Feb 16 • 10

Enneng Yang

AI & ML interests

Recent Activity

Organizations

EnnengYang's activity