Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
4
1
Thomas Yap
wooihen
Follow
loongx2's profile picture
1 follower
·
6 following
mlstats303
AI & ML interests
machine learning, NLP, computer vision and RL
Recent Activity
updated
a Space
about 1 month ago
wooihen/career_conversation
published
a Space
about 1 month ago
wooihen/career_conversation
upvoted
an
article
4 months ago
DeepSeek-R1 Dissection: Understanding PPO & GRPO Without Any Prior Reinforcement Learning Knowledge
View all activity
Organizations
wooihen
's models
22
Sort:Â Recently updated
wooihen/donut-demo
Image-to-Text
•
Updated
May 10, 2023
•
3
wooihen/poca-SoccerTwos-v3
Reinforcement Learning
•
Updated
Mar 5, 2023
•
46
wooihen/poca-SoccerTwos-v2
Reinforcement Learning
•
Updated
Mar 3, 2023
•
39
wooihen/pocaSoccerTwos
Reinforcement Learning
•
Updated
Feb 28, 2023
•
35
wooihen/a2c-PandaReachDense-v2
Reinforcement Learning
•
Updated
Feb 5, 2023
•
3
wooihen/a2c-AntBulletEnv-v0
Reinforcement Learning
•
Updated
Feb 5, 2023
•
3
wooihen/ppo-PyramidsRND1
Reinforcement Learning
•
Updated
Jan 24, 2023
•
22
wooihen/ppo-SnowballTarget
Reinforcement Learning
•
Updated
Jan 24, 2023
•
47
wooihen/Reinforce-Pixelcopter-PLE-v0-TEST
Reinforcement Learning
•
Updated
Jan 16, 2023
wooihen/Reinforce-CartPole-v1-TEST
Reinforcement Learning
•
Updated
Jan 15, 2023
wooihen/dqn-SpaceInvadersNoFrameskip-v4
Reinforcement Learning
•
Updated
Jan 4, 2023
•
6
wooihen/q-Taxi-v3-TEST
Reinforcement Learning
•
Updated
Dec 24, 2022
wooihen/q-FrozenLake-v1-8x8-Slippery
Reinforcement Learning
•
Updated
Dec 24, 2022
wooihen/q-FrozenLake-v1-4x4-noSlippery
Reinforcement Learning
•
Updated
Dec 24, 2022
wooihen/ppo-HuggyRND
Reinforcement Learning
•
Updated
Dec 21, 2022
•
24
wooihen/ppo-Huggy
Reinforcement Learning
•
Updated
Dec 20, 2022
•
51
wooihen/ppo-LunarLander-v2-TEST2
Reinforcement Learning
•
Updated
Dec 18, 2022
•
3
wooihen/ppo-LunarLander-v2-TEST
Reinforcement Learning
•
Updated
Dec 15, 2022
•
3
wooihen/xlm-roberta-base-finetuned-panx-de
Token Classification
•
Updated
Aug 3, 2022
•
5
wooihen/xlm-roberta-base-finetuned-panx-de-fr
Token Classification
•
Updated
Jul 27, 2022
•
4
wooihen/distilbert-base-uncased-finetuned-emotion
Text Classification
•
Updated
Jul 11, 2022
•
5
wooihen/TEST2ppo-LunarLander-v2
Reinforcement Learning
•
Updated
May 21, 2022
•
4