DBLGAug 10, 2025

Synthesize, Retrieve, and Propagate: A Unified Predictive Modeling Framework for Relational Databases

arXiv:2508.08327v1h-index: 10Has Code
Originality Incremental advance
AI Analysis

This addresses the problem of limited deep learning integration in relational databases for data scientists and industries, representing an incremental advancement over prior methods.

The paper tackles the challenge of applying deep learning to relational databases by proposing SRP, a framework that synthesizes features, retrieves related information, and propagates messages to capture unary and composite dependencies, achieving improved predictive performance on five real-world datasets.

Relational databases (RDBs) have become the industry standard for storing massive and heterogeneous data. However, despite the widespread use of RDBs across various fields, the inherent structure of relational databases hinders their ability to benefit from flourishing deep learning methods. Previous research has primarily focused on exploiting the unary dependency among multiple tables in a relational database using the primary key - foreign key relationships, either joining multiple tables into a single table or constructing a graph among them, which leaves the implicit composite relations among different tables and a substantial potential of improvement for predictive modeling unexplored. In this paper, we propose SRP, a unified predictive modeling framework that synthesizes features using the unary dependency, retrieves related information to capture the composite dependency, and propagates messages across a constructed graph to learn adjacent patterns for prediction on relation databases. By introducing a new retrieval mechanism into RDB, SRP is designed to fully capture both the unary and the composite dependencies within a relational database, thereby enhancing the receptive field of tabular data prediction. In addition, we conduct a comprehensive analysis on the components of SRP, offering a nuanced understanding of model behaviors and practical guidelines for future applications. Extensive experiments on five real-world datasets demonstrate the effectiveness of SRP and its potential applicability in industrial scenarios. The code is released at https://github.com/NingLi670/SRP.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes