On the reproducibility of software defect datasets
HN Zhu, C Rubio-González - 2023 IEEE/ACM 45th …, 2023 - ieeexplore.ieee.org
2023 IEEE/ACM 45th International Conference on Software …, 2023•ieeexplore.ieee.org
Software defect datasets are crucial to facilitating the evaluation and comparison of
techniques in fields such as fault localization, test generation, and automated program
repair. However, the reproducibility of software defect artifacts is not immune to breakage. In
this paper, we conduct a study on the reproducibility of software defect artifacts. First, we
study five state-of-the-art Java defect datasets. Despite the multiple strategies applied by
dataset maintainers to ensure reproducibility, all datasets are prone to breakages. Second …
techniques in fields such as fault localization, test generation, and automated program
repair. However, the reproducibility of software defect artifacts is not immune to breakage. In
this paper, we conduct a study on the reproducibility of software defect artifacts. First, we
study five state-of-the-art Java defect datasets. Despite the multiple strategies applied by
dataset maintainers to ensure reproducibility, all datasets are prone to breakages. Second …
Software defect datasets are crucial to facilitating the evaluation and comparison of techniques in fields such as fault localization, test generation, and automated program repair. However, the reproducibility of software defect artifacts is not immune to breakage. In this paper, we conduct a study on the reproducibility of software defect artifacts. First, we study five state-of-the-art Java defect datasets. Despite the multiple strategies applied by dataset maintainers to ensure reproducibility, all datasets are prone to breakages. Second, we conduct a case study in which we systematically test the reproducibility of 1,795 software artifacts during a 13-month period. We find that 62.6% of the artifacts break at least once, and 15.3% artifacts break multiple times. We manually investigate the root causes of breakages and handcraft 10 patches, which are automatically applied to 1,055 distinct artifacts in 2,948 fixes. Based on the nature of the root causes, we propose automated dependency caching and artifact isolation to prevent further breakage. In particular, we show that isolating artifacts to eliminate external dependencies increases reproducibility to 95% or higher, which is on par with the level of reproducibility exhibited by the most reliable manually curated dataset.
ieeexplore.ieee.org
Showing the best result for this search. See all results