PVPO: Pre-Estimated Value-Based Policy Optimization for Agentic Reasoning Paper • 2508.21104 • Published 7 days ago • 27
On the Generalization of SFT: A Reinforcement Learning Perspective with Reward Rectification Paper • 2508.05629 • Published 28 days ago • 169
A Survey of Self-Evolving Agents: On Path to Artificial Super Intelligence Paper • 2507.21046 • Published Jul 28 • 81
ARIA: Training Language Agents with Intention-Driven Reward Aggregation Paper • 2506.00539 • Published May 31 • 30
BookWorld: From Novels to Interactive Agent Societies for Creative Story Generation Paper • 2504.14538 • Published Apr 20 • 29
RLAP: A Reinforcement Learning Enhanced Adaptive Planning Framework for Multi-step NLP Task Solving Paper • 2505.11893 • Published May 17 • 1
Q-Filters: Leveraging QK Geometry for Efficient KV Cache Compression Paper • 2503.02812 • Published Mar 4 • 10
AgentRE: An Agent-Based Framework for Navigating Complex Information Landscapes in Relation Extraction Paper • 2409.01854 • Published Sep 3, 2024 • 1
Q-Filters Collection Pre-computed Q-Filters for efficient KV cache compression. • 15 items • Updated Mar 3 • 7
Reason from Fallacy: Enhancing Large Language Models' Logical Reasoning through Logical Fallacy Understanding Paper • 2404.04293 • Published Apr 4, 2024 • 1
P-ICL: Point In-Context Learning for Named Entity Recognition with Large Language Models Paper • 2405.04960 • Published May 8, 2024 • 1
SED: Self-Evaluation Decoding Enhances Large Language Models for Better Generation Paper • 2405.16552 • Published May 26, 2024 • 1
Adaptive Reinforcement Learning Planning: Harnessing Large Language Models for Complex Information Extraction Paper • 2406.11455 • Published Jun 17, 2024 • 1
Tokenization Matters! Degrading Large Language Models through Challenging Their Tokenization Paper • 2405.17067 • Published May 27, 2024 • 1
Mitigating Out-of-Entity Errors in Named Entity Recognition: A Sentence-Level Strategy Paper • 2412.08434 • Published Dec 11, 2024 • 1
ToNER: Type-oriented Named Entity Recognition with Generative Language Model Paper • 2404.09145 • Published Apr 14, 2024 • 1