EVALUATING THE EFFECT OF DATASET SIZE ON PREDICTIVE MODEL USING SUPERVISED LEARNING TECHNIQUE

dc.contributor.authorAjiboye, A.R.
dc.contributor.authorAbdullah-Arshah, R.
dc.contributor.authorHongwu, Q.
dc.date.accessioned2018-11-30T14:37:21Z
dc.date.available2018-11-30T14:37:21Z
dc.date.issued2015-02
dc.descriptionArticleen_US
dc.description.abstractLearning models used for prediction purposes are mostly developed without paying much cognizance to the size of datasets that can produce models of high accuracy and better generalization. Although, the general believe is that, large dataset is needed to construct a predictive learning model. To describe a data set as large in size, perhaps, is circumstance dependent, thus, what constitutes a dataset to be considered as being big or small is vague. In this paper, the ability of the predictive model to generalize with respect to a particular size of data when simulated with new untrained input is examined. The study experiments on three different sizes of data using Matlab program to create predictive models with a view to establishing if the size of data has any effect on the accuracy of a model. The simulated output of each model is measured using the Mean Absolute Error (MAE) and comparisons are made. Findings from this study reveals that, the quantity of data partitioned for the purpose of training must be of good representation of the entire sets and sufficient enough to span through the input space. The results of simulating the three network models also shows that, the learning model with the largest size of training sets appears to be the most accurate and consistently delivers a much better and stable results.en_US
dc.identifier.citationInternational Journal of Software Engineering & Computer Sciencesen_US
dc.identifier.issn2289-8522
dc.identifier.urihttp://hdl.handle.net/123456789/1306
dc.language.isoenen_US
dc.publisherUniversiti Malaysia Pahangen_US
dc.relation.ispartofseries;Volume 1
dc.subjectNeural Networken_US
dc.subjectPredictionen_US
dc.subjectSupervised Learningen_US
dc.subjectData miningen_US
dc.subjectData sizeen_US
dc.titleEVALUATING THE EFFECT OF DATASET SIZE ON PREDICTIVE MODEL USING SUPERVISED LEARNING TECHNIQUEen_US
dc.typeArticleen_US

Files

Original bundle
Now showing 1 - 1 of 1
No Thumbnail Available
Name:
IJSECS vol 1_file6.pdf
Size:
369.84 KB
Format:
Adobe Portable Document Format
Description:
Article
License bundle
Now showing 1 - 1 of 1
No Thumbnail Available
Name:
license.txt
Size:
1.69 KB
Format:
Item-specific license agreed upon to submission
Description:

Collections