DCDBLGNIMar 22, 2022

BigBird: Big Data Storage and Analytics at Scale in Hybrid Cloud

arXiv:2203.11472v18 citationsh-index: 5
Originality Synthesis-oriented
AI Analysis

This provides a practical solution for organizations like Twitter migrating big data systems to hybrid cloud environments, though it is incremental as it adapts existing cloud services.

The paper tackles the challenge of moving Twitter's cold storage and analytics systems to the public cloud by designing a scalable framework using BigQuery on Google Cloud Platform, ensuring security and privacy while overcoming resource limitations.

Implementing big data storage at scale is a complex and arduous task that requires an advanced infrastructure. With the rise of public cloud computing, various big data management services can be readily leveraged. As a critical part of Twitter's "Project Partly Cloudy", the cold storage data and analytics systems are being moved to the public cloud. This paper showcases our approach in designing a scalable big data storage and analytics management framework using BigQuery in Google Cloud Platform while ensuring security, privacy, and data protection. The paper also discusses the limitations on the public cloud resources and how they can be effectively overcome when designing a big data storage and analytics solution at scale. Although the paper discusses the framework implementation in Google Cloud Platform, it can easily be applied to all major cloud providers.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes