An Energy-Aware Resource Management Strategy Based on Spark and YARN in Heterogeneous Environments

dc.contributor.author Shabestari, Fatemeh
dc.contributor.author Navimipour, Nima Jafari
dc.date.accessioned 2024-06-23T21:38:08Z
dc.date.available 2024-06-23T21:38:08Z
dc.date.issued 2024
dc.description Shabestari, Fatemeh/0000-0003-1926-4674 en_US
dc.description.abstract Apache Spark is a popular framework for processing big data. Running Spark on Hadoop YARN allows it to schedule Spark workloads alongside other data-processing frameworks on Hadoop. When an application is deployed in a YARN cluster, its resources are given without considering energy efficiency. Furthermore, there is no way to enforce any user-specified deadline constraints. To address these issues, we propose a new deadline-aware resource management system and a scheduling algorithm to minimize the total energy consumption in Spark on YARN for heterogeneous clusters. First, a deadline-aware energy-efficient model for the considered problem is proposed. Then, using a locality-aware method, executors are assigned to applications. This algorithm sorts the nodes based on the performance per watt (PPW) metric, the number of application data blocks on nodes, and the rack locality. It also offers three ways to choose executors from different machines: greedy, random, and Pareto-based. Finally, the proposed heuristic task scheduler schedules tasks on executors to minimize total energy and tardiness. We evaluated the performance of the suggested algorithm regarding energy efficiency and satisfying the Service Level Agreement (SLA). The results showed that the method outperforms the popular algorithms regarding energy consumption and meeting deadlines. en_US
dc.identifier.doi 10.1109/TGCN.2023.3347276
dc.identifier.issn 2473-2400
dc.identifier.scopus 2-s2.0-85181573774
dc.identifier.uri https://doi.org/10.1109/TGCN.2023.3347276
dc.identifier.uri https://hdl.handle.net/20.500.12469/5749
dc.language.iso en en_US
dc.publisher Ieee-inst Electrical Electronics Engineers inc en_US
dc.relation.ispartof IEEE Transactions on Green Communications and Networking
dc.rights info:eu-repo/semantics/closedAccess en_US
dc.subject Sparks en_US
dc.subject Yarn en_US
dc.subject Task analysis en_US
dc.subject Resource management en_US
dc.subject Energy efficiency en_US
dc.subject Energy consumption en_US
dc.subject Clustering algorithms en_US
dc.subject Distributed computing en_US
dc.subject energy management en_US
dc.subject resource management en_US
dc.subject scheduling en_US
dc.title An Energy-Aware Resource Management Strategy Based on Spark and YARN in Heterogeneous Environments en_US
dc.type Article en_US
dspace.entity.type Publication
gdc.author.id Shabestari, Fatemeh/0000-0003-1926-4674
gdc.author.scopusid 57204862467
gdc.author.scopusid 55897274300
gdc.bip.impulseclass C5
gdc.bip.influenceclass C5
gdc.bip.popularityclass C5
gdc.coar.access metadata only access
gdc.coar.type text::journal::journal article
gdc.description.department Kadir Has University en_US
gdc.description.departmenttemp [Shabestari, Fatemeh] Islamic Azad Univ, Dept Comp Engn, Sofian Branch, Sofian, Iran; [Navimipour, Nima Jafari] Kadir Has Univ, Dept Comp Engn, TR-34083 Istanbul, Turkiye; [Navimipour, Nima Jafari] Natl Yunlin Univ Sci & Technol, Future Technol Res Ctr, Touliu 64002, Taiwan en_US
gdc.description.endpage 644 en_US
gdc.description.issue 2 en_US
gdc.description.publicationcategory Makale - Uluslararası Hakemli Dergi - Kurum Öğretim Elemanı en_US
gdc.description.scopusquality Q1
gdc.description.startpage 635 en_US
gdc.description.volume 8 en_US
gdc.description.wosquality Q1
gdc.identifier.openalex W4390224544
gdc.identifier.wos WOS:001230177900019
gdc.oaire.diamondjournal false
gdc.oaire.impulse 0.0
gdc.oaire.influence 2.5942106E-9
gdc.oaire.isgreen false
gdc.oaire.popularity 2.9478422E-9
gdc.oaire.publicfunded false
gdc.openalex.fwci 2.324
gdc.openalex.normalizedpercentile 0.65
gdc.opencitations.count 0
gdc.plumx.scopuscites 4
gdc.scopus.citedcount 4
gdc.virtual.author Jafari Navimipour, Nima
gdc.wos.citedcount 3
relation.isAuthorOfPublication 0fb3c7a0-c005-4e5f-a9ae-bb163df2df8e
relation.isAuthorOfPublication.latestForDiscovery 0fb3c7a0-c005-4e5f-a9ae-bb163df2df8e
relation.isOrgUnitOfPublication fd8e65fe-c3b3-4435-9682-6cccb638779c
relation.isOrgUnitOfPublication 2457b9b3-3a3f-4c17-8674-7f874f030d96
relation.isOrgUnitOfPublication b20623fc-1264-4244-9847-a4729ca7508c
relation.isOrgUnitOfPublication.latestForDiscovery fd8e65fe-c3b3-4435-9682-6cccb638779c

Files