pith. machine review for the scientific record. sign in

arxiv: 1804.09769 · v1 · submitted 2018-04-25 · 💻 cs.CL

Recognition: unknown

TypeSQL: Knowledge-based Type-Aware Neural Text-to-SQL Generation

Authors on Pith no claims yet
classification 💻 cs.CL
keywords typesqlusersdatabaseslanguagenaturalqueriesquestionsabsolute
0
0 comments X
read the original abstract

Interacting with relational databases through natural language helps users of any background easily query and analyze a vast amount of data. This requires a system that understands users' questions and converts them to SQL queries automatically. In this paper we present a novel approach, TypeSQL, which views this problem as a slot filling task. Additionally, TypeSQL utilizes type information to better understand rare entities and numbers in natural language questions. We test this idea on the WikiSQL dataset and outperform the prior state-of-the-art by 5.5% in much less time. We also show that accessing the content of databases can significantly improve the performance when users' queries are not well-formed. TypeSQL gets 82.6% accuracy, a 17.5% absolute improvement compared to the previous content-sensitive model.

This paper has not been read by Pith yet.

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.

Forward citations

Cited by 2 Pith papers

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

  1. Natural Language Interfaces for Spatial and Temporal Databases: A Comprehensive Overview of Methods, Taxonomy, and Future Directions

    cs.DB 2026-03 unverdicted novelty 6.0

    A literature survey that taxonomizes methods, datasets, and evaluation practices for natural language interfaces to geospatial and temporal databases while identifying recurring trends and future directions.

  2. Extending Confidence-Based Text2Cypher with Grammar and Schema Aware Filtering

    cs.CL 2026-05 unverdicted novelty 5.0

    Post-generation grammar and schema filtering on top of confidence scoring raises syntactic validity and execution success for Text2Cypher but increases empty outputs and lowers coverage.