Ai2

non-profit

Verified

https://allenai.org/

allen_ai

allenai

AI & ML interests

Building breatkthrough AI to solve the world's biggest problems.

Recent Activity

hamishivi new activity about 8 hours ago

allenai/IF_multi_constraints_upto5:verify tools ？

swabhs authored a paper 5 days ago

Annotation Artifacts in Natural Language Inference Data

swabhs authored a paper 5 days ago

We're Afraid Language Models Aren't Modeling Ambiguity

View all activity

Articles

Introducing the Open Chain of Thought Leaderboard

hamishivi

in allenai/IF_multi_constraints_upto5 about 8 hours ago

verify tools ？

#1 opened about 2 months ago by

saumyamalik

updated a Space about 23 hours ago

Reward Bench Leaderboard

Display and analyze reward model evaluation results

saumyamalik

updated a dataset about 23 hours ago

allenai/reward-bench-2-results

Preview • Updated about 23 hours ago • 206 • 2

baileyk

in allenai/olmOCR-7B-0225-preview 3 days ago

Workaround for getting bounding boxes for recognized textpieces

#25 opened 11 days ago by

swabhs

authored 16 papers 5 days ago

Annotation Artifacts in Natural Language Inference Data

Paper • 1803.02324 • Published Mar 6, 2018

We're Afraid Language Models Aren't Modeling Ambiguity

Paper • 2304.14399 • Published Apr 27, 2023

Don't Stop Pretraining: Adapt Language Models to Domains and Tasks

Paper • 2004.10964 • Published Apr 23, 2020

Dataset Cartography: Mapping and Diagnosing Datasets with Training Dynamics

Paper • 2009.10795 • Published Sep 22, 2020

COBRA Frames: Contextual Reasoning about Effects and Harms of Offensive Statements

Paper • 2306.01985 • Published Jun 3, 2023 • 1

Challenges in Automated Debiasing for Toxic Language Detection

Paper • 2102.00086 • Published Jan 29, 2021

MAUVE: Measuring the Gap Between Neural Text and Human Text using Divergence Frontiers

Paper • 2102.01454 • Published Feb 2, 2021

DExperts: Decoding-Time Controlled Text Generation with Experts and Anti-Experts

Paper • 2105.03023 • Published May 7, 2021

Understanding Dataset Difficulty with $\mathcal{V}$-Usable Information

Paper • 2110.08420 • Published Oct 16, 2021

Reframing Human-AI Collaboration for Generating Free-Text Explanations

Paper • 2112.08674 • Published Dec 16, 2021

WANLI: Worker and AI Collaboration for Natural Language Inference Dataset Creation

Paper • 2201.05955 • Published Jan 16, 2022

Logits of API-Protected LLMs Leak Proprietary Information

Paper • 2403.09539 • Published Mar 14, 2024 • 1

I2D2: Inductive Knowledge Distillation with NeuroLogic and Self-Imitation

Paper • 2212.09246 • Published Dec 19, 2022

MAUVE Scores for Generative Models: Theory and Practice

Paper • 2212.14578 • Published Dec 30, 2022

On the Trustworthiness of Generative Foundation Models: Guideline, Assessment, and Perspective

Paper • 2502.14296 • Published Feb 20 • 46

NeuroComparatives: Neuro-Symbolic Distillation of Comparative Knowledge

Paper • 2305.04978 • Published May 8, 2023