new

Get trending papers in your email inbox once a day!

Get trending papers in your email inbox!

Daily Papers

byAK and the research community

Aug 7

Submitted by

chengshuaizhao

Is Chain-of-Thought Reasoning of LLMs a Mirage? A Data Distribution Lens

·
8 authors

Submitted by

liushunyu

VeriGUI: Verifiable Long-Chain GUI Dataset

·
32 authors

Submitted by

xavier-hu

Efficient Agents: Building Effective Agents While Reducing Cost

·
14 authors

Submitted by

Zery

SEAgent: Self-Evolving Computer Use Agent with Autonomous Learning from Experience

·
8 authors

Submitted by

sbkarasik

Training Long-Context, Multi-Turn Software Engineering Agents with Reinforcement Learning

·
12 authors

Submitted by

kefirski

Enhancing Vision-Language Model Training with Reinforcement Learning in Synthetic Worlds for Real-World Success

·
5 authors

Submitted by

daixufang

Agent Lightning: Train ANY AI Agents with Reinforcement Learning

·
8 authors

Submitted by

P-YI

CoTox: Chain-of-Thought-Based Molecular Toxicity Reasoning and Prediction

·
7 authors

1

Submitted by

lwaekfjlk

Sotopia-RL: Reward Design for Social Intelligence

·
9 authors

Submitted by

Gnonymous

Web-CogReasoner: Towards Knowledge-Induced Cognitive Reasoning for Web Agents

·
15 authors

Submitted by

xilanhua12138

HPSv3: Towards Wide-Spectrum Human Preference Score

·
4 authors

Submitted by

starmage520

LaTCoder: Converting Webpage Design to Code with Layout-as-Thought

·
13 authors

1

Submitted by

BwZhang

Gaussian Variation Field Diffusion for High-fidelity Video-to-4D Synthesis

·
7 authors

Submitted by

zhangyik21

LeanK: Learnable K Cache Channel Pruning for Efficient Decoding

·
7 authors

2

Submitted by

Shuliang

DreamVVT: Mastering Realistic Video Virtual Try-On in the Wild via a Stage-Wise Diffusion Transformer Framework

·
10 authors

Submitted by

Mor-Li

Sculptor: Empowering LLMs with Cognitive Agency via Active Context Management

·
5 authors

2

Submitted by

nuojohnchen

Position: The Current AI Conference Model is Unsustainable! Diagnosing the Crisis of Centralized AI Conference

·
6 authors

2

Submitted by

YaroslavPrytula

IAUNet: Instance-Aware U-Net

·
4 authors

Submitted by

YerbaPage

EVOC2RUST: A Skeleton-guided Framework for Project-Level C-to-Rust Translation

·
8 authors

1

Submitted by

tnlin

RL-PLUS: Countering Capability Boundary Collapse of LLMs in Reinforcement Learning with Hybrid-policy Optimization

·
14 authors

Submitted by

nicopi

Reasoning Language Models for Root Cause Analysis in 5G Wireless Networks

·
7 authors

2

Submitted by

songw-zju

A Coarse-to-Fine Approach to Multi-Modality 3D Occupancy Grounding

·
4 authors

Submitted by

tianyilt

IFDECORATOR: Wrapping Instruction Following Reinforcement Learning with Verifiable Rewards

·
9 authors

Submitted by

xavier-hu

HarmonyGuard: Toward Safety and Utility in Web Agents via Adaptive Policy Enhancement and Dual-Objective Optimization

·
7 authors

Submitted by

jimbozhang

MiDashengLM: Efficient Audio Understanding with General Audio Captions

·
10 authors

Submitted by

SunZhigang7

DiffSemanticFusion: Semantic Raster BEV Fusion for Autonomous Driving via Online HD Map Diffusion

·
16 authors

Submitted by

MaziyarPanahi

OpenMed NER: Open-Source, Domain-Adapted State-of-the-Art Transformers for Biomedical NER Across 12 Public Datasets

·
1 authors

2

Submitted by

Moon-bow

DPoser-X: Diffusion Model as Robust 3D Whole-body Human Pose Prior

·
11 authors

Submitted by

alokabhishek

Data and AI governance: Promoting equity, ethics, and fairness in large language models

·
3 authors

Submitted by

dorienh

SonicMaster: Towards Controllable All-in-One Music Restoration and Mastering

·
3 authors

Submitted by

wenliang1990

Light-IF: Endowing LLMs with Generalizable Reasoning via Preview and Self-Checking for Complex Instruction Following

·
5 authors

Submitted by

HanzheL

C3D-AD: Toward Continual 3D Anomaly Detection via Kernel Attention with Learnable Advisor

·
6 authors

Submitted by

enoche

CM^3: Calibrating Multimodal Recommendation

·
3 authors

2

Submitted by

tianyilt

Sel3DCraft: Interactive Visual Prompts for User-Friendly Text-to-3D Generation

·
9 authors

2

Submitted by

sergiopicascia

The Cow of Rembrandt - Analyzing Artistic Prompt Interpretation in Text-to-Image Models

·
3 authors

Submitted by

wyt2000

StepFun-Formalizer: Unlocking the Autoformalization Potential of LLMs through Knowledge-Reasoning Fusion

·
11 authors

Submitted by

MahtabBg

MedBLINK: Probing Basic Perception in Multimodal Language Models for Medicine

·
8 authors

Submitted by

mingdachenmeta

FACTORY: A Challenging Human-Verified Prompt Set for Long-Form Factuality

·
6 authors