Wenlong Wang's picture

2

Wenlong Wang

zigzag9939

AI & ML interests

Reinforcement learning, Representation learning, Sequence modeling.

Recent Activity

upvoted an article 2 months ago

Open-R1: a fully open reproduction of DeepSeek-R1

upvoted an article 2 months ago

No GPU left behind: Unlocking Efficiency with Co-located vLLM in TRL

authored a paper 5 months ago

Drama: Mamba-Enabled Model-Based Reinforcement Learning Is Sample and Parameter Efficient

View all activity

Organizations

None yet

upvoted 2 articles 2 months ago

Article

Open-R1: a fully open reproduction of DeepSeek-R1

By

and 2 others •

Jan 28

• 878

Article

No GPU left behind: Unlocking Efficiency with Co-located vLLM in TRL

By

and 5 others •

Jun 3

• 88

authored a paper 5 months ago

Drama: Mamba-Enabled Model-Based Reinforcement Learning Is Sample and Parameter Efficient

Paper • 2410.08893 • Published Oct 11, 2024