Yanai Elazar's picture

2 3 3

Yanai Elazar

yanaiela

·

https://yanaiela.github.io/

AI & ML interests

None yet

Recent Activity

liked a Space about 2 months ago

akhaliq/anycoder

upvoted a paper 4 months ago

RefVNLI: Towards Scalable Evaluation of Subject-driven Text-to-image Generation

upvoted a paper 5 months ago

OLMoTrace: Tracing Language Model Outputs Back to Trillions of Training Tokens

View all activity

Organizations

authored a paper 5 months ago

OLMoTrace: Tracing Language Model Outputs Back to Trillions of Training Tokens

Paper • 2504.07096 • Published Apr 9 • 76

authored a paper 10 months ago

Hybrid Preferences: Learning to Route Instances for Human vs. AI Feedback

Paper • 2410.19133 • Published Oct 24, 2024 • 11

authored 6 papers about 1 year ago

Null It Out: Guarding Protected Attributes by Iterative Nullspace Projection

Paper • 2004.07667 • Published Apr 16, 2020

Few-shot Fine-tuning vs. In-context Learning: A Fair Comparison and Evaluation

Paper • 2305.16938 • Published May 26, 2023

Text-based NP Enrichment

Paper • 2109.12085 • Published Sep 24, 2021

A Survey on Data Selection for Language Models

Paper • 2402.16827 • Published Feb 26, 2024 • 4

Lexical Generalization Improves with Larger Models and Longer Training

Paper • 2210.12673 • Published Oct 23, 2022

Data Contamination Report from the 2024 CONDA Shared Task

Paper • 2407.21530 • Published Jul 31, 2024 • 10

authored 3 papers over 1 year ago

Dolma: an Open Corpus of Three Trillion Tokens for Language Model Pretraining Research

Paper • 2402.00159 • Published Jan 31, 2024 • 65

OLMo: Accelerating the Science of Language Models

Paper • 2402.00838 • Published Feb 1, 2024 • 84

Paloma: A Benchmark for Evaluating Language Model Fit

Paper • 2312.10523 • Published Dec 16, 2023 • 13

authored a paper almost 2 years ago

What's In My Big Data?

Paper • 2310.20707 • Published Oct 31, 2023 • 11