Column2Vec: Structural Understanding via Distributed Representations of Database Schemas
This addresses a domain-specific problem for database management and schema understanding, with incremental improvements in automated naming tasks.
The paper tackled the problem of generating appropriate names for columns in unnamed database tables by using distributed representations of column metadata, achieving viability on schema information from open source GitHub applications.
We present Column2Vec, a distributed representation of database columns based on column metadata. Our distributed representation has several applications. Using known names for groups of columns (i.e., a table name), we train a model to generate an appropriate name for columns in an unnamed table. We demonstrate the viability of our approach using schema information collected from open source applications on GitHub.