8 12 18

Muhammad Khalifa

mkhalifa

https://mukhal.github.io/

AI & ML interests

natural language genration, reinforcement learning

Recent Activity

updated a model 15 days ago

mkhalifa/ThinkPRM-gptoss-20B

published a model 15 days ago

mkhalifa/ThinkPRM-gptoss-20B

upvoted an article about 2 months ago

SmolLM3: smol, multilingual, long-context reasoner

View all activity

Organizations

liked 2 models 3 months ago

launch/ThinkPRM-14B

Text Generation • 15B • Updated Jul 1 • 42 • 3

launch/ThinkPRM-1.5B

Text Generation • 2B • Updated Jun 25 • 1.98k • 3

liked 2 datasets 4 months ago

osunlp/Online-Mind2Web

Viewer • Updated Apr 16 • 300 • 439 • 12

launch/thinkprm-1K-verification-cots

Viewer • Updated Jul 1 • 1k • 53 • 5

liked a Space 6 months ago

LongICL Leaderboard

🐍

Leaderboard for long LLM on In-context Learning

liked a model 7 months ago

deepseek-ai/DeepSeek-R1-Distill-Qwen-14B

Text Generation • 15B • Updated Feb 24 • 223k • • 559

liked a dataset 8 months ago

Qwen/ProcessBench

Viewer • Updated Dec 27, 2024 • 3.4k • 3.38k • 50

liked a Space 9 months ago

584

Scaling test-time compute

📈

Implement test-time compute scaling for math problems

liked a model 9 months ago

Qwen/QwQ-32B-Preview

Text Generation • 33B • Updated Jan 12 • 110k • • 1.74k

liked a model 12 months ago

peiyi9979/math-shepherd-mistral-7b-prm

Text Generation • Updated Jan 15, 2024 • 3.85k • 47

liked 2 models over 1 year ago

meta-llama/Meta-Llama-3-8B

Text Generation • 8B • Updated Sep 27, 2024 • 1.61M • • 6.3k

CohereLabs/c4ai-command-r-v01

Text Generation • 35B • Updated Apr 16 • 19.6k • • 1.09k

liked a dataset over 1 year ago

Open-Orca/OpenOrca

Viewer • Updated Feb 19 • 2.94M • 10.7k • 1.44k

liked 3 models almost 2 years ago

liked a dataset over 2 years ago

facebook/kilt_wikipedia

Updated Jan 18, 2024 • 412 • 15

liked a model over 2 years ago

google/ul2

Updated Jan 24, 2023 • 175 • 179