DBLGMar 20, 2019

Column2Vec: Structural Understanding via Distributed Representations of Database Schemas

arXiv:1903.08621v12 citationsHas Code
Originality Synthesis-oriented
AI Analysis

This addresses a domain-specific problem for database management and schema understanding, with incremental improvements in automated naming tasks.

The paper tackled the problem of generating appropriate names for columns in unnamed database tables by using distributed representations of column metadata, achieving viability on schema information from open source GitHub applications.

We present Column2Vec, a distributed representation of database columns based on column metadata. Our distributed representation has several applications. Using known names for groups of columns (i.e., a table name), we train a model to generate an appropriate name for columns in an unnamed table. We demonstrate the viability of our approach using schema information collected from open source applications on GitHub.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes