Don't Stop Pretraining: Adapt Language Models to Domains and Tasks Paper • 2004.10964 • Published Apr 23, 2020
Dataset Cartography: Mapping and Diagnosing Datasets with Training Dynamics Paper • 2009.10795 • Published Sep 22, 2020
COBRA Frames: Contextual Reasoning about Effects and Harms of Offensive Statements Paper • 2306.01985 • Published Jun 3, 2023 • 1
Challenges in Automated Debiasing for Toxic Language Detection Paper • 2102.00086 • Published Jan 29, 2021
MAUVE: Measuring the Gap Between Neural Text and Human Text using Divergence Frontiers Paper • 2102.01454 • Published Feb 2, 2021
DExperts: Decoding-Time Controlled Text Generation with Experts and Anti-Experts Paper • 2105.03023 • Published May 7, 2021
Understanding Dataset Difficulty with $\mathcal{V}$-Usable Information Paper • 2110.08420 • Published Oct 16, 2021
Reframing Human-AI Collaboration for Generating Free-Text Explanations Paper • 2112.08674 • Published Dec 16, 2021
WANLI: Worker and AI Collaboration for Natural Language Inference Dataset Creation Paper • 2201.05955 • Published Jan 16, 2022
Logits of API-Protected LLMs Leak Proprietary Information Paper • 2403.09539 • Published Mar 14, 2024 • 1
I2D2: Inductive Knowledge Distillation with NeuroLogic and Self-Imitation Paper • 2212.09246 • Published Dec 19, 2022
On the Trustworthiness of Generative Foundation Models: Guideline, Assessment, and Perspective Paper • 2502.14296 • Published Feb 20 • 46
NeuroComparatives: Neuro-Symbolic Distillation of Comparative Knowledge Paper • 2305.04978 • Published May 8, 2023