Stop Words Removal on Textual Data Classification

dc.contributor.authorAro, Taye Oladele
dc.contributor.authorDada, Funmi
dc.contributor.authorBalogun, Abdullateef Oluwagbemiga
dc.contributor.authorOluwasogo, Samuel Ayodeji
dc.date.accessioned2019-05-22T13:53:41Z
dc.date.available2019-05-22T13:53:41Z
dc.date.issued2019-05
dc.description.abstractText data is highly voluminous and performing mining tasks on it can be daunting due to large memory usage, thus researchers have considered different techniques to reduce the data while still maintaining or increasing the level of accuracy. Stop word removal is one of the pre-processing techniques used in text data mining. This paper investigates the effect of stop words removal on the text data mining performance. The machine learning algorithms used are C4.5 Decision Tree and Multinomial Naïve Bayes (MNB) on two text datasets; Sentiment Analysis and SMS Spam dataset. Results revealed that the removal of stop words had no influence on the classification accuracy of text mining model, but actually reduced the level of confidence of the predictionen_US
dc.identifier.citationAro, T.O., Dada, F., Balogun, A.O. and Oluwasogo, S.A. (2019). Stop Words Removal on Textual Data Classification, International Journal of Information Processing and Communication (IJIPC), 7(1), 1-9.en_US
dc.identifier.issn2645-2960
dc.identifier.urihttp://hdl.handle.net/123456789/1991
dc.language.isoenen_US
dc.publisherFaculty of Communication and Information Sciences, University of Ilorin.en_US
dc.relation.ispartofseries7;1
dc.subjectSentiment Analysisen_US
dc.subjectStop Words Removalen_US
dc.subjectMachine Learningen_US
dc.subjectText miningen_US
dc.titleStop Words Removal on Textual Data Classificationen_US
dc.typeArticleen_US

Files

Original bundle
Now showing 1 - 1 of 1
No Thumbnail Available
Name:
document.pdf
Size:
1.06 MB
Format:
Adobe Portable Document Format
Description:
License bundle
Now showing 1 - 1 of 1
No Thumbnail Available
Name:
license.txt
Size:
1.69 KB
Format:
Item-specific license agreed upon to submission
Description:

Collections