Measuring Spreadsheet Formula Understandability
This addresses the challenge for spreadsheet users and professionals in evaluating and maintaining business-critical spreadsheets over long lifespans, representing an incremental improvement in domain-specific tooling.
The paper tackled the problem of assessing spreadsheet understandability by establishing metrics based on professional interviews and experiments, finding that the number of ranges, nesting depth, and conditional operations in formulas significantly increase difficulty, with concrete correlations observed in user performance.
Spreadsheets are widely used in industry, because they are flexible and easy to use. Sometimes they are even used for business-critical applications. It is however difficult for spreadsheet users to correctly assess the quality of spreadsheets, especially with respect to their understandability. Understandability of spreadsheets is important, since spreadsheets often have a long lifespan, during which they are used by several users. In this paper, we establish a set of spreadsheet understandability metrics. We start by studying related work and interviewing 40 spreadsheet professionals to obtain a set of characteristics that might contribute to understandability problems in spreadsheets. Based on those characteristics we subsequently determine a number of understandability metrics. To evaluate the usefulness of our metrics, we conducted a series of experiments in which professional spreadsheet users performed a number of small maintenance tasks on a set of spreadsheets from the EUSES spreadsheet corpus. We subsequently calculate the correlation between the metrics and the performance of subjects on these tasks. The results clearly indicate that the number of ranges, the nesting depth and the presence of conditional operations in formulas significantly increase the difficulty of understanding a spreadsheet.