Papers
arxiv:2409.12172

You Only Read Once (YORO): Learning to Internalize Database Knowledge for Text-to-SQL

Published on Sep 18, 2024
Authors:
,
,
,
,
,
,

Abstract

You Only Read Once (YORO) internalizes database knowledge during training, reducing schema encoding in text-to-SQL and improving performance on benchmarks and large databases.

AI-generated summary

While significant progress has been made on the text-to-SQL task, recent solutions repeatedly encode the same database schema for every question, resulting in unnecessary high inference cost and often overlooking crucial database knowledge. To address these issues, we propose You Only Read Once (YORO), a novel paradigm that directly internalizes database knowledge into the parametric knowledge of a text-to-SQL model during training and eliminates the need for schema encoding during inference. YORO significantly reduces the input token length by 66%-98%. Despite its shorter inputs, our empirical results demonstrate YORO's competitive performances with traditional systems on three benchmarks as well as its significant outperformance on large databases. Furthermore, YORO excels in handling questions with challenging value retrievals such as abbreviation.

Community

Paper author

Hi everyone, the paper has been published at NAACL 2025 (https://aclanthology.org/2025.naacl-long.94/).

Sign up or log in to comment

Models citing this paper 0

No model linking this paper

Cite arxiv.org/abs/2409.12172 in a model README.md to link it from this page.

Datasets citing this paper 0

No dataset linking this paper

Cite arxiv.org/abs/2409.12172 in a dataset README.md to link it from this page.

Spaces citing this paper 0

No Space linking this paper

Cite arxiv.org/abs/2409.12172 in a Space README.md to link it from this page.

Collections including this paper 0

No Collection including this paper

Add this paper to a collection to link it from this page.