The impact of using biased performance metrics on software defect prediction research

نویسندگان

چکیده

Software engineering researchers have undertaken many experiments investigating the potential of software defect prediction algorithms. Unfortunately some widely used performance metrics are known to be problematic, most notably F1, but nevertheless F1 is used. To investigate impact using on validity this large body research. We undertook a systematic review locate relevant and then extract all pairwise comparisons unbiased Matthews correlation coefficient (MCC). found total 38 primary studies. These contain 12,471 pairs results. Of these comparisons, 21.95% changed direction when MCC metric instead biased metric. Unfortunately, we also evidence suggesting that remains in reiterate concerns statisticians problematic outside an information retrieval context, since concerned about both classes (defect-prone not defect-prone units). This inappropriate usage has led substantial number (more than one fifth) erroneous (in terms direction) Therefore urge (i) use (ii) publish detailed results including confusion matrices such alternative analyses become possible.

برای دانلود باید عضویت طلایی داشته باشید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

the impact of using inspirational quotes on abstract vocabulary recall

the present study is an attempt to investigate the potential impact of inspirational quotes on improving english abstract vocabulary recall. to achieve this goal, a multiple choice language proficiency test of 60 items including vocabulary and grammar component was administered to a sample of 63 second-semester male and female students whose age ranged between 17 to 22 and they were studying en...

15 صفحه اول

Software defect prediction using static code metrics : formulating a methodology

Software defect prediction is motivated by the huge costs incurred as a result of software failures. In an effort to reduce these costs, researchers have been utilising software metrics to try and build predictive models capable of locating the most defect-prone parts of a system. These areas can then be subject to some form of further analysis, such as a manual code review. It is hoped that su...

متن کامل

the impact of morphological awareness on the vocabulary development of the iranian efl students

this study investigated the impact of explicit instruction of morphemic analysis and synthesis on the vocabulary development of the students. the participants were 90 junior high school students divided into two experimental groups and one control group. morphological awareness techniques (analysis/synthesis) and conventional techniques were used to teach vocabulary in the experimental groups a...

15 صفحه اول

the impact of peer review on efl reviewers writing proficiency

امروزه تصحیح همکلاسی در کلاسهای نگارش یکی از اجزاء لاینفک کلاسهای دانش آموز محور است. تاثیرات مفید تصحیح همکلاسی بر زبان آموزان، معلمان را متقاعد کرده است که علیرغم صرف زمان، انرژی و توان بسیار، از این شیوه ی آموزشی در کلاسهای آموزش نگارش بهره بگیرند. تحقیق حاضر بر آن است تا با مقایسه دو گروه از یادگیرندگان زبان انگلیسی، تاثیر تصحیح همکلاسی را بر توانایی نوشتاری آنها نشان دهد. 122 خانم زبان آمو...

15 صفحه اول

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Information & Software Technology

سال: 2021

ISSN: ['0950-5849', '1873-6025']

DOI: https://doi.org/10.1016/j.infsof.2021.106664