Declarative Data Analytics: a Survey
This is an incremental survey for researchers and practitioners in data science and machine learning.
The survey examines declarative data analytics frameworks, exploring their programming models and optimization techniques to assess the current state of the art and identify open challenges.
The area of declarative data analytics explores the application of the declarative paradigm on data science and machine learning. It proposes declarative languages for expressing data analysis tasks and develops systems which optimize programs written in those languages. The execution engine can be either centralized or distributed, as the declarative paradigm advocates independence from particular physical implementations. The survey explores a wide range of declarative data analysis frameworks by examining both the programming model and the optimization techniques used, in order to provide conclusions on the current state of the art in the area and identify open challenges.