2 20 6

JGC

Nothing2Say

jiangguochaoGG

AI & ML interests

None yet

Recent Activity

commented on a paper 2 days ago

PVPO: Pre-Estimated Value-Based Policy Optimization for Agentic Reasoning

authored a paper 3 days ago

PVPO: Pre-Estimated Value-Based Policy Optimization for Agentic Reasoning

upvoted a paper 3 days ago

PVPO: Pre-Estimated Value-Based Policy Optimization for Agentic Reasoning

View all activity

Organizations

None yet

upvoted a paper 3 days ago

PVPO: Pre-Estimated Value-Based Policy Optimization for Agentic Reasoning

Paper • 2508.21104 • Published 7 days ago • 27

upvoted a paper 16 days ago

On the Generalization of SFT: A Reinforcement Learning Perspective with Reward Rectification

Paper • 2508.05629 • Published 28 days ago • 169

upvoted a paper about 1 month ago

A Survey of Self-Evolving Agents: On Path to Artificial Super Intelligence

Paper • 2507.21046 • Published Jul 28 • 81

upvoted 3 papers 3 months ago

ARIA: Training Language Agents with Intention-Driven Reward Aggregation

Paper • 2506.00539 • Published May 31 • 30

Reinforcement Pre-Training

Paper • 2506.08007 • Published Jun 9 • 260

BookWorld: From Novels to Interactive Agent Societies for Creative Story Generation

Paper • 2504.14538 • Published Apr 20 • 29

upvoted 3 papers 4 months ago

FlashThink: An Early Exit Method For Efficient Reasoning

Paper • 2505.13949 • Published May 20 • 1

RLAP: A Reinforcement Learning Enhanced Adaptive Planning Framework for Multi-step NLP Task Solving

Paper • 2505.11893 • Published May 17 • 1

Q-Filters: Leveraging QK Geometry for Efficient KV Cache Compression

Paper • 2503.02812 • Published Mar 4 • 10

upvoted a collection 4 months ago

Qwen3

Collection

84 items • Updated 29 days ago • 1.18k

upvoted a paper 5 months ago

AgentRE: An Agent-Based Framework for Navigating Complex Information Landscapes in Relation Extraction

Paper • 2409.01854 • Published Sep 3, 2024 • 1

upvoted a collection 5 months ago

Q-Filters

Collection

Pre-computed Q-Filters for efficient KV cache compression. • 15 items • Updated Mar 3 • 7

upvoted a paper 6 months ago

RASD: Retrieval-Augmented Speculative Decoding

Paper • 2503.03434 • Published Mar 5 • 1

upvoted 4 papers 7 months ago

Reason from Fallacy: Enhancing Large Language Models' Logical Reasoning through Logical Fallacy Understanding

Paper • 2404.04293 • Published Apr 4, 2024 • 1

P-ICL: Point In-Context Learning for Named Entity Recognition with Large Language Models

Paper • 2405.04960 • Published May 8, 2024 • 1

SED: Self-Evaluation Decoding Enhances Large Language Models for Better Generation

Paper • 2405.16552 • Published May 26, 2024 • 1

Adaptive Reinforcement Learning Planning: Harnessing Large Language Models for Complex Information Extraction

Paper • 2406.11455 • Published Jun 17, 2024 • 1

upvoted a paper 8 months ago

Tokenization Matters! Degrading Large Language Models through Challenging Their Tokenization

Paper • 2405.17067 • Published May 27, 2024 • 1

upvoted a paper 9 months ago

Mitigating Out-of-Entity Errors in Named Entity Recognition: A Sentence-Level Strategy

Paper • 2412.08434 • Published Dec 11, 2024 • 1

upvoted a paper about 1 year ago

ToNER: Type-oriented Named Entity Recognition with Generative Language Model

Paper • 2404.09145 • Published Apr 14, 2024 • 1

JGC

AI & ML interests

Recent Activity

Organizations

Nothing2Say's activity