1 1 3

Abhay Sheshadri

abhayesian

abhay-sheshadri

AI & ML interests

None yet

Recent Activity

updated a dataset 3 days ago

abhayesian/introspection-prompts

published a dataset 3 days ago

abhayesian/introspection-prompts

updated a model 19 days ago

abhayesian/llama-3.3-70b-reward-model-biases-dpo-merged

View all activity

Organizations

models 98

abhayesian/llama-3.3-70b-reward-model-biases-dpo-merged

Text Generation • 71B • Updated 19 days ago • 364

abhayesian/llama-3.3-70b-reward-model-biases-merged-2

Text Generation • 71B • Updated 27 days ago • 108

abhayesian/llama-3.3-70b-reward-model-biases-merged

Text Generation • 71B • Updated 28 days ago • 1.83k

abhayesian/llama-3.3-70b-reward-model-biases-lora

Updated Jul 5

abhayesian/lora-qwen3-32b-docs

Updated Jun 15 • 3

abhayesian/em-gemma-2-9b-it-layer-16

Updated Apr 16

abhayesian/em-gemma-2-9b-it-layer-12

Updated Apr 16

abhayesian/em-gemma-2-9b-it-layer-11-15

Updated Apr 16

abhayesian/gpt2-large_helpful-only-reward-model

Text Classification • 0.8B • Updated Feb 3 • 3

abhayesian/llama-r1-8b-baseline-rank_8-no_hhh

Updated Jan 30

View 98 models

datasets 66

abhayesian/introspection-prompts

Viewer • Updated 3 days ago • 327 • 72

abhayesian/reward_model_biases_attack_prompts

Viewer • Updated 21 days ago • 5.18k • 124

abhayesian/reward_model_biases

Viewer • Updated 21 days ago • 71.7k • 108

abhayesian/old-biased-responses

Viewer • Updated 28 days ago • 9.76k • 119

abhayesian/reward-models-biases-docs

Viewer • Updated Jul 2 • 100k • 36

abhayesian/tokenized-alignment-faking

Viewer • Updated Jul 1 • 38 • 21

abhayesian/quirky-behavior-dataset

Viewer • Updated Jun 22 • 5.37k • 10

abhayesian/miserable_roleplay_formatted

Viewer • Updated Jun 12 • 1k • 4

abhayesian/harmful_roleply_other_threats_no_drama_formatted

Viewer • Updated Jun 9 • 2k • 6

abhayesian/harmful_roleply_other_threats_formatted

Viewer • Updated Jun 5 • 2k • 6

View 66 datasets

Abhay Sheshadri

AI & ML interests

Recent Activity

Organizations

spaces 2 Sort: Recently updated

Test2

Test

models 98 Sort: Recently updated

datasets 66 Sort: Recently updated

spaces 2

models 98

datasets 66