Multiclass Feature Selection and Classification with Support Vector Machine in Genomic Study

dc.contributor.authorBanjoko, A. W.
dc.contributor.authorYahya, W. B.
dc.contributor.authorGarba, M. K.
dc.contributor.authorOlaniran, O. R.
dc.contributor.authorAmusa, L. B.
dc.contributor.authorGatta, N. F.
dc.contributor.authorDauda, K. A.
dc.contributor.authorOlorede, K. O.
dc.date.accessioned2019-11-27T12:02:23Z
dc.date.available2019-11-27T12:02:23Z
dc.date.issued2017
dc.description.abstractThis study proposes an efficient Support Vector Machine (SVM) algorithm for feature selection and classification of multiclass response group in high dimensional (microarray) data. The Feature selection stage of the algorithm employed the F-statistic of the ANOVA–like testing scheme at some chosen family-wise-error-rate (FWER) to control for the detection of some false positive features. In a 10-fold cross validation, the hyper-parameters of the SVM were tuned to determine the appropriate kernel using one-versus-all approach. The entire simulated dataset was randomly partitioned into 95% training and 5% test sets with the SVM classifier built on the training sets while its prediction accuracy on the response class was assessed on the test sets over 1000 Monte-Carlo cross-validation (MCCV) runs. The classification results of the proposed classifier were assessed using the Misclassification Error Rates (MERs) and other performance indices. Results from the Monte-Carlo study showed that the proposed SVM classifier was quite efficient by yielding high prediction accuracy of the response groups with fewer differentially expressed features than when all the features were employed for classification. The performance of this new method on some published cancer data sets shall be examined vis-à-vis other state-of-the-earth machine learning methods in future works.en_US
dc.identifier.urihttp://hdl.handle.net/123456789/3482
dc.language.isoenen_US
dc.publisherEdited Conference Proceedings of the 1st International Conference of the Nigeria Statistical Society (NSS).en_US
dc.subjectSupport Vector Machinesen_US
dc.subjectMonte-Carlo CrossValidationen_US
dc.subjectF-Statisticen_US
dc.subjectFamily wise error rateen_US
dc.subjectMisclassification Error Rateen_US
dc.titleMulticlass Feature Selection and Classification with Support Vector Machine in Genomic Studyen_US
dc.typeArticleen_US

Files

Original bundle
Now showing 1 - 1 of 1
No Thumbnail Available
Name:
Banjoko et al 2.pdf
Size:
802 KB
Format:
Adobe Portable Document Format
Description:
License bundle
Now showing 1 - 1 of 1
No Thumbnail Available
Name:
license.txt
Size:
1.69 KB
Format:
Item-specific license agreed upon to submission
Description:

Collections