PFDCLGDec 15, 2023

How Does It Function? Characterizing Long-term Trends in Production Serverless Workloads

arXiv:2312.10127v184 citationsh-index: 7Has CodeSoCC
Originality Synthesis-oriented
AI Analysis

This provides valuable data and insights for cloud providers and researchers working on serverless computing, though it is incremental as it focuses on characterization rather than new methods.

The paper tackles the problem of understanding long-term trends in production serverless workloads by analyzing two new Huawei cloud traces spanning over 7 months and 1.4 trillion invocations, finding considerable diversity in behavior such as request variations up to 9 orders of magnitude and strong periodicity in invocations.

This paper releases and analyzes two new Huawei cloud serverless traces. The traces span a period of over 7 months with over 1.4 trillion function invocations combined. The first trace is derived from Huawei's internal workloads and contains detailed per-second statistics for 200 functions running across multiple Huawei cloud data centers. The second trace is a representative workload from Huawei's public FaaS platform. This trace contains per-minute arrival rates for over 5000 functions running in a single Huawei data center. We present the internals of a production FaaS platform by characterizing resource consumption, cold-start times, programming languages used, periodicity, per-second versus per-minute burstiness, correlations, and popularity. Our findings show that there is considerable diversity in how serverless functions behave: requests vary by up to 9 orders of magnitude across functions, with some functions executed over 1 billion times per day; scheduling time, execution time and cold-start distributions vary across 2 to 4 orders of magnitude and have very long tails; and function invocation counts demonstrate strong periodicity for many individual functions and on an aggregate level. Our analysis also highlights the need for further research in estimating resource reservations and time-series prediction to account for the huge diversity in how serverless functions behave. Datasets and code available at https://github.com/sir-lab/data-release

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes