SEDBMay 5, 2020

Role of Apache Software Foundation in Big Data Projects

arXiv:2005.02829v16 citationsHas Code
AI Analysis

This provides insights for developers and organizations using open-source tools in Big Data, but it is incremental as it primarily reviews existing projects without introducing new methods or data.

The report analyzes Apache Software Foundation's Big Data projects, categorizing them and finding that while many are autonomous, some are built on or work with other Apache projects to enhance development.

With the increase in amount of Big Data being generated each year, tools and technologies developed and used for the purpose of storing, processing and analyzing Big Data has also improved. Open-Source software has been an important factor in the success and innovation in the field of Big Data while Apache Software Foundation (ASF) has played a crucial role in this success and innovation by providing a number of state-of-the-art projects, free and open to the public. ASF has classified its project in different categories. In this report, projects listed under Big Data category are deeply analyzed and discussed with reference to one-of-the seven sub-categories defined. Our investigation has shown that many of the Apache Big Data projects are autonomous but some are built based on other Apache projects and some work in conjunction with other projects to improve and ease development in Big Data space.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes