Yang's picture

Yang

jacklanda

·

AI & ML interests

Language Modeling, Lexical Semantics

Recent Activity

liked a dataset 4 days ago

ByteDance-Seed/BeyondAIME

upvoted a paper 9 days ago

Understanding and Leveraging the Expert Specialization of Context Faithfulness in Mixture-of-Experts LLMs

liked a dataset 10 days ago

uq-project/uq

View all activity

Organizations

None yet

upvoted a paper 9 days ago

Understanding and Leveraging the Expert Specialization of Context Faithfulness in Mixture-of-Experts LLMs

Paper • 2508.19594 • Published 10 days ago • 4

upvoted a paper 2 months ago

Resa: Transparent Reasoning Models via SAEs

Paper • 2506.09967 • Published Jun 11 • 22

upvoted a paper 3 months ago

RuleReasoner: Reinforced Rule-based Reasoning via Domain-aware Dynamic Sampling

Paper • 2506.08672 • Published Jun 10 • 31

upvoted 2 papers 4 months ago

Seek in the Dark: Reasoning via Test-Time Instance-Level Policy Gradient in Latent Space

Paper • 2505.13308 • Published May 19 • 27

Absolute Zero: Reinforced Self-play Reasoning with Zero Data

Paper • 2505.03335 • Published May 6 • 184

upvoted a paper 5 months ago

OmniMMI: A Comprehensive Multi-modal Interaction Benchmark in Streaming Video Contexts

Paper • 2503.22952 • Published Mar 29 • 18

upvoted 3 papers over 1 year ago

CCAE: A Corpus of Chinese-based Asian Englishes

Paper • 2310.05381 • Published Oct 9, 2023 • 1

Revisiting a Pain in the Neck: Semantic Phrase Processing Benchmark for Language Models

Paper • 2405.02861 • Published May 5, 2024 • 1

MindMap: Knowledge Graph Prompting Sparks Graph of Thoughts in Large Language Models

Paper • 2308.09729 • Published Aug 17, 2023 • 5

upvoted an article over 1 year ago

Article

Fine-tuning Llama 2 70B using PyTorch FSDP

By

and 3 others •

Sep 13, 2023

• 29