Cluster Analysis of Data Points using Partitioning and Probabilistic Model-based Algorithms

No Thumbnail Available

Date

2014-08

Journal Title

Journal ISSN

Volume Title

Publisher

Foundation of Computer Science

Abstract

Exploring the dataset features through the application of clustering algorithms is a viable means by which the conceptual description of such data can be revealed for better understanding, grouping and decision making. Some clustering algorithms, especially those that are partitioned-based, clusters any data presented to them even if similar features do not present. This study explores the performance accuracies of partitioning-based algorithms and probabilistic model-based algorithm. Experiments were conducted using k-means, k-medoids and EM-algorithm. The study implements each algorithm using RapidMiner Software and the results generated was validated for correctness in accordance to the concept of external criteria method. The clusters formed revealed the capability and drawbacks of each algorithm on the data points.

Description

Article

Keywords

Clustering, Algorithm, K-means, EM-clustering,, K-medoids

Citation

International Journal of Applied Information Systems

Collections