15 12 3

Will Held PRO

WillHeld

https://williamheld.com

AI & ML interests

Machine Learning and Natural Language Processing for low-resource languages and language variants

Recent Activity

updated a dataset 21 days ago

WillHeld/social-sycophancy-filtered-pairs

published a dataset 21 days ago

WillHeld/social-sycophancy-filtered-pairs

updated a model about 1 month ago

WillHeld/modified-chat-gemstone

View all activity

Organizations

upvoted a paper about 2 months ago

SynthesizeMe! Inducing Persona-Guided Prompts for Personalized Reward Models in LLMs

Paper • 2506.05598 • Published Jun 5 • 7

upvoted a collection 3 months ago

Gemstone Models

Collection

Our 22 open source Gemstone models for scaling laws range from 50M to 2B parameters, spanning 11 widths from 256 to 3072 and 18 depths from 3 to 80. • 69 items • Updated Jul 4 • 10

upvoted 3 papers 3 months ago

upvoted 2 papers 5 months ago

Nemotron-CC: Transforming Common Crawl into a Refined Long-Horizon Pretraining Dataset

Paper • 2412.02595 • Published Dec 3, 2024 • 5

Mind the Gap! Static and Interactive Evaluations of Large Audio Models

Paper • 2502.15919 • Published Feb 21 • 4

upvoted an article 7 months ago

Article

Optimizing Pretraining Data Mixes with LLM-Estimated Utility

•

Jan 22

• 4

upvoted 2 articles 10 months ago

Article

Welcome, Gradio 5

•

Oct 9, 2024

• 130

Article

AI Apps in a Flash with Gradio's Reload Mode

•

Apr 16, 2024

• 27

upvoted 2 papers 10 months ago

Tutor CoPilot: A Human-AI Approach for Scaling Real-Time Expertise

Paper • 2410.03017 • Published Oct 3, 2024 • 29

Distilling an End-to-End Voice Assistant Without Instruction Training Data

Paper • 2410.02678 • Published Oct 3, 2024 • 23

Will Held PRO

AI & ML interests

Recent Activity

Organizations

WillHeld's activity

Optimizing Pretraining Data Mixes with LLM-Estimated Utility

Welcome, Gradio 5

AI Apps in a Flash with Gradio's Reload Mode