Multi-objective scheduling of MapReduce jobs in big data processing

Hashem, I .A. T.; Anuar, N. B.; Marjani, M.; Gani, A.; Sangaiah, A. K.; Adewole, K. S.

Multi-objective scheduling of MapReduce jobs in big data processing

Files

Paper on Multiobjective MapReduce_Page_03.jpg (206.92 KB)

Date

2017

Authors

Publisher

Multimedia Tools and Applications

Abstract

Data generation has increased drastically over the past few years due to the rapid development of Internet-based technologies. This period has been called the big data era. Big data offer an emerging paradigm shift in data exploration and utilization. The MapReduce computational paradigm is a well-known framework and is considered the main enabler for the distributed and scalable processing of a large amount of data. However, despite recent efforts toward improving the performance of MapReduce, scheduling MapReduce jobs across multiple nodes has been considered a multi-objective optimization problem. This problem can become increasingly complex when virtualized clusters in cloud computing are used to execute a large number of tasks. This study aims to optimize MapReduce job scheduling based on the completion time and cost of cloud service models. First, the problem is formulated as a multi-objective model. The model consists of two objective functions, namely, (i) completion time and (ii) cost minimization. Second, a scheduling algorithm using earliest finish time scheduling that considers resource allocation and job scheduling in the cloud is proposed. Lastly, experimental results show that the proposed scheduler exhibits better performance than other well-known schedulers, such as FIFO and Fair.

Keywords

Hadoop; MapReduce; Cloud computing; Big data; Scheduling algorithms

URI

http://hdl.handle.net/123456789/44

Full item page

Multi-objective scheduling of MapReduce jobs in big data processing

Files

Date

Authors

Journal Title

Journal ISSN

Volume Title

Publisher

Abstract

Description

Keywords

Citation

URI

Collections