Nov. 14, 2022, 2:15 a.m. | Elena Soare, Iain Mackie, Jeffrey Dalton

cs.CL updates on arXiv.org arxiv.org

Current SQL generators based on pre-trained language models struggle to
answer complex questions requiring domain context or understanding fine-grained
table structure. Humans would deal with these unknowns by reasoning over the
documentation of the tables. Based on this hypothesis, we propose DocuT5, which
uses off-the-shelf language model architecture and injects knowledge from
external `documentation' to improve domain generalization. We perform
experiments on the Spider family of datasets that contain complex questions
that are cross-domain and multi-table. Specifically, we develop a …

arxiv documentation seq2seq sql table

Founding AI Engineer, Agents

@ Occam AI | New York

AI Engineer Intern, Agents

@ Occam AI | US

AI Research Scientist

@ Vara | Berlin, Germany and Remote

Data Architect

@ University of Texas at Austin | Austin, TX

Data ETL Engineer

@ University of Texas at Austin | Austin, TX

Data Scientist (Database Development)

@ Nasdaq | Bengaluru-Affluence