A Survey on the Evolution of Stream Processing Systems
It synthesizes existing knowledge for researchers and practitioners in stream processing, but is incremental as it does not introduce new methods or data.
This survey provides a comprehensive overview of the evolution of stream processing systems, covering fundamental aspects and reviewing past research findings to outline similarities and differences between early and modern systems.
Stream processing has been an active research field for more than 20 years, but it is now witnessing its prime time due to recent successful efforts by the research community and numerous worldwide open-source communities. This survey provides a comprehensive overview of fundamental aspects of stream processing systems and their evolution in the functional areas of out-of-order data management, state management, fault tolerance, high availability, load management, elasticity, and reconfiguration. We review noteworthy past research findings, outline the similarities and differences between early ('00-'10) and modern ('11-'22) streaming systems, and discuss recent trends and open problems.