Performance Analysis of Selected Clustering Techniques for Software Defects Prediction

Balogun, Abdullateef; Oladele, Rufus; Mojeed, Hammed; Amin-Balogun, Barakat; Adeyemo, Victor Elijah; Aro, Taye Olalere

Performance Analysis of Selected Clustering Techniques for Software Defects Prediction

dc.contributor.author	Balogun, Abdullateef
dc.contributor.author	Oladele, Rufus
dc.contributor.author	Mojeed, Hammed
dc.contributor.author	Amin-Balogun, Barakat
dc.contributor.author	Adeyemo, Victor Elijah
dc.contributor.author	Aro, Taye Olalere
dc.date.accessioned	2019-06-18T13:02:22Z
dc.date.available	2019-06-18T13:02:22Z
dc.date.issued	2019-06-01
dc.description.abstract	Classification algorithms that help to predict software defects play a major role in the software engineering process. This study investigated the application and performance of clustering techniques in software defect prediction (SDP). Seven clustering techniques; Farthest First Clusterer, K-Means, X-Means, Sequential information Bottleneck, Hierarchical Clusterer, Make-Density Clusterer, and Expectation Maximization were used for the classification of 8 software defect datasets from NASA repository. Experimental results revealed that the use of clustering technique as a classification process is well established as it gave a good predictive performance. Based on average accuracy across the 8 datasets, Farthest First had the best performance of 86.16%, Hierarchical clustering had 85.50% while KMeans Clustering techniques had 72.33% respectively. Expectation Maximization (EM) (33.52%) and X-Means (48.84%) gave rather poor results and Sequential Information bottleneck (SIB) (63%) and Density-based clustering techniques (71.08%) had average performances. In addition, further comparison of classification via clustering techniques with selected standard classification techniques; k-Nearest Neighbor (kNN), Naïve Bayes (NB), and Decision Tree (DT) showed that some classification via clustering techniques (Farthest First and Hierarchical Clustering Techniques) performed considerably well and outperforms some standard classification algorithms. With this, classification via clustering techniques can be considered as an alternative approach to standard classification methods in SDP. It produced a good and competitive predictive performance in SDP with an advantage of not necessarily training a predictive model and using annotated datasets while developing the predictive model. Consequently, SDP models developed using classification via clustering techniques models can be transferred from one project to another as no training of the model is involved. This will help reduce and manage the available resources during the software development process.	en_US
dc.identifier.citation	Abdullateef O. Balogun, Rufus O. Oladele, Hammed A. Mojeed, Barakat Amin-Balogun, Victor E. Adeyemo and Taye O. Aro (2019), Performance Analysis of Selected Clustering Techniques for Software Defects Prediction, Afr. J. Comp. & ICT, Vol.12, No. 2, pp. 30 - 42.	en_US
dc.identifier.issn	2006-1781
dc.identifier.uri	http://hdl.handle.net/123456789/2208
dc.language.iso	en	en_US
dc.publisher	IEEE NIgeria Chapter	en_US
dc.relation.ispartofseries	12;2
dc.subject	Clustering Technique	en_US
dc.subject	Classification Technique	en_US
dc.subject	Software Defects Prediction	en_US
dc.subject	Software Engineering	en_US
dc.title	Performance Analysis of Selected Clustering Techniques for Software Defects Prediction	en_US
dc.type	Article	en_US

Files

Original bundle

Now showing 1 - 1 of 1

Name:: AfrjICT Paper.pdf
Size:: 1.04 MB
Format:: Adobe Portable Document Format
Description:: Main article

Download

License bundle

Now showing 1 - 1 of 1

Name:: license.txt
Size:: 1.69 KB
Format:: Item-specific license agreed upon to submission
Description:

Download

Performance Analysis of Selected Clustering Techniques for Software Defects Prediction

Files

Original bundle

License bundle

Collections