PromptWizard: Task-Aware Prompt Optimization Framework Paper • 2405.18369 • Published May 28, 2024 • 1
TinyHelen's First Curriculum: Training and Evaluating Tiny Language Models in a Simpler Language Environment Paper • 2501.00522 • Published Dec 31, 2024 • 2
TinyStories: How Small Can Language Models Be and Still Speak Coherent English? Paper • 2305.07759 • Published May 12, 2023 • 36
Tiny QA Benchmark++: Ultra-Lightweight, Synthetic Multilingual Dataset Generation & Smoke-Tests for Continuous LLM Evaluation Paper • 2505.12058 • Published May 17 • 6
Evaluation for Generative AI Collection Papers and resources that are dealing with the evaluation of large language models and generative AI. • 6 items • Updated May 20 • 1