SEJun 7, 2021

The CESAW dataset: a conversation

arXiv:2106.03679v1
Originality Synthesis-oriented
AI Analysis

This work addresses software development efficiency by providing insights into task estimation, but it is incremental as it builds on existing datasets and methods.

The study analyzed 61,817 tasks from 45 software projects to investigate task effort estimation accuracy, finding that factors like the estimator, project type, and use of round numbers influenced accuracy.

An analysis of the 61,817 tasks performed by developers working on 45 projects, implemented using Team Software Process, is documented via a conversation between a data analyst and the person who collected, compiled, and originally analyzed the data. Five projects were safety critical, containing a total of 28,899 tasks. Projects were broken down using a Work Breakdown Structure to create a hierarchical organization, with tasks at the leaf nodes. The WBS information enables task organization within a project to be investigated, e.g., how related tasks are sequenced together. Task data includes: kind of task, anonymous developer id, start/end time/date, as well as interruption and break times; a total of 203,621 time facts. Task effort estimation accuracy was found to be influenced by factors such as the person making the estimate, the project involved, and the propensity to use round numbers.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes