Models
Datasets
Spaces
Docs
Enterprise
Pricing
Log In
Sign Up

Collections

Discover the best community collections!

Collections including paper arxiv:2305.14992

A collection of arXiv papers from Chip Huyen's AI Engineering organized by chapter and ordered by when each appears in the book.

Will we run out of data? An analysis of the limits of scaling datasets in Machine Learning

Paper • 2211.04325 • Published Oct 26, 2022 • 1
BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding

Paper • 1810.04805 • Published Oct 11, 2018 • 21
On the Opportunities and Risks of Foundation Models

Paper • 2108.07258 • Published Aug 16, 2021 • 1
Super-NaturalInstructions: Generalization via Declarative Instructions on 1600+ NLP Tasks

Paper • 2204.07705 • Published Apr 16, 2022 • 2

Can Retriever-Augmented Language Models Reason? The Blame Game Between the Retriever and the Language Model

Paper • 2212.09146 • Published Dec 18, 2022 • 3
RaLLe: A Framework for Developing and Evaluating Retrieval-Augmented Large Language Models

Paper • 2308.10633 • Published Aug 21, 2023 • 1
MemeCap: A Dataset for Captioning and Interpreting Memes

Paper • 2305.13703 • Published May 23, 2023
Contrastive Learning for Inference in Dialogue

Paper • 2310.12467 • Published Oct 19, 2023

JudgeLM: Fine-tuned Large Language Models are Scalable Judges

Paper • 2310.17631 • Published Oct 26, 2023 • 35
AgentTuning: Enabling Generalized Agent Abilities for LLMs

Paper • 2310.12823 • Published Oct 19, 2023 • 36
G-Eval: NLG Evaluation using GPT-4 with Better Human Alignment

Paper • 2303.16634 • Published Mar 29, 2023 • 3
GPT-4 Doesn't Know It's Wrong: An Analysis of Iterative Prompting for Reasoning Problems

Paper • 2310.12397 • Published Oct 19, 2023 • 1

rStar-Math: Small LLMs Can Master Math Reasoning with Self-Evolved Deep Thinking

Paper • 2501.04519 • Published Jan 8 • 284
URSA: Understanding and Verifying Chain-of-thought Reasoning in Multimodal Mathematics

Paper • 2501.04686 • Published Jan 8 • 54
Towards System 2 Reasoning in LLMs: Learning How to Think With Meta Chain-of-Though

Paper • 2501.04682 • Published Jan 8 • 99
Agent Laboratory: Using LLM Agents as Research Assistants

Paper • 2501.04227 • Published Jan 8 • 93

Graph Reasoning

Graph-enhanced Large Language Models in Asynchronous Plan Reasoning

Paper • 2402.02805 • Published Feb 5, 2024 • 1
Barack's Wife Hillary: Using Knowledge-Graphs for Fact-Aware Language Modeling

Paper • 1906.07241 • Published Jun 17, 2019 • 2
A Latent Space Theory for Emergent Abilities in Large Language Models

Paper • 2304.09960 • Published Apr 19, 2023 • 3
Reasoning on Graphs: Faithful and Interpretable Large Language Model Reasoning

Paper • 2310.01061 • Published Oct 2, 2023 • 2

A collection of arXiv papers from Chip Huyen's AI Engineering organized by chapter and ordered by when each appears in the book.

Will we run out of data? An analysis of the limits of scaling datasets in Machine Learning

Paper • 2211.04325 • Published Oct 26, 2022 • 1
BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding

Paper • 1810.04805 • Published Oct 11, 2018 • 21
On the Opportunities and Risks of Foundation Models

Paper • 2108.07258 • Published Aug 16, 2021 • 1
Super-NaturalInstructions: Generalization via Declarative Instructions on 1600+ NLP Tasks

Paper • 2204.07705 • Published Apr 16, 2022 • 2

rStar-Math: Small LLMs Can Master Math Reasoning with Self-Evolved Deep Thinking

Paper • 2501.04519 • Published Jan 8 • 284
URSA: Understanding and Verifying Chain-of-thought Reasoning in Multimodal Mathematics

Paper • 2501.04686 • Published Jan 8 • 54
Towards System 2 Reasoning in LLMs: Learning How to Think With Meta Chain-of-Though

Paper • 2501.04682 • Published Jan 8 • 99
Agent Laboratory: Using LLM Agents as Research Assistants

Paper • 2501.04227 • Published Jan 8 • 93

Can Retriever-Augmented Language Models Reason? The Blame Game Between the Retriever and the Language Model

Paper • 2212.09146 • Published Dec 18, 2022 • 3
RaLLe: A Framework for Developing and Evaluating Retrieval-Augmented Large Language Models

Paper • 2308.10633 • Published Aug 21, 2023 • 1
MemeCap: A Dataset for Captioning and Interpreting Memes

Paper • 2305.13703 • Published May 23, 2023
Contrastive Learning for Inference in Dialogue

Paper • 2310.12467 • Published Oct 19, 2023

Graph Reasoning

Graph-enhanced Large Language Models in Asynchronous Plan Reasoning

Paper • 2402.02805 • Published Feb 5, 2024 • 1
Barack's Wife Hillary: Using Knowledge-Graphs for Fact-Aware Language Modeling

Paper • 1906.07241 • Published Jun 17, 2019 • 2
A Latent Space Theory for Emergent Abilities in Large Language Models

Paper • 2304.09960 • Published Apr 19, 2023 • 3
Reasoning on Graphs: Faithful and Interpretable Large Language Model Reasoning

Paper • 2310.01061 • Published Oct 2, 2023 • 2

JudgeLM: Fine-tuned Large Language Models are Scalable Judges

Paper • 2310.17631 • Published Oct 26, 2023 • 35
AgentTuning: Enabling Generalized Agent Abilities for LLMs

Paper • 2310.12823 • Published Oct 19, 2023 • 36
G-Eval: NLG Evaluation using GPT-4 with Better Human Alignment

Paper • 2303.16634 • Published Mar 29, 2023 • 3
GPT-4 Doesn't Know It's Wrong: An Analysis of Iterative Prompting for Reasoning Problems

Paper • 2310.12397 • Published Oct 19, 2023 • 1

Company

TOS Privacy About Jobs

Website

Models Datasets Spaces Pricing Docs