Survey on Resource Allocation Strategies in Cloud  Computing

Ronak Patel; Sanjay Patel

doi:10.17577/IJERTV2IS2109

Volume 02, Issue 02 (February 2013)

Survey on Resource Allocation Strategies in Cloud Computing

DOI : 10.17577/IJERTV2IS2109

Download Full-Text PDF Cite this Publication

Open Access
Article Download / Views: 104
Total Downloads : 6910
Authors : Ronak Patel, Sanjay Patel
Paper ID : IJERTV2IS2109
Volume & Issue : Volume 02, Issue 02 (February 2013)
Published (First Online): 28-02-2013
ISSN (Online) : 2278-0181
Publisher Name : IJERT
License: This work is licensed under a Creative Commons Attribution 4.0 International License

PDF Version

View

Text Only Version

Survey on Resource Allocation Strategies in Cloud Computing

Ronak Patel#1, Sanjay Patel*2

#1PG Student of Computer Engineering, LDRP-ITR, Gujarat Technological University, Gujarat, India

*2Assistant Professor, Department of Computer Engineering, LDRP-ITR, Gujarat, India

Abstract Cloud computing is the next generation of technology which unifies everything into one. It is an on demand service because it offers dynamic flexible resource allocation for reliable and guaranteed services in pay as-you-use manner to public. In Cloud computing multiple cloud users can request number of cloud services simultaneously. So there must be a provision that all resources are made available to requesting user in efficient manner to satisfy their need. In this paper a review of various policies for dynamic resource allocation in cloud computing is shown based on Topology Aware Resource Allocation (TARA), Linear Scheduling Strategy for Resource Allocation and Dynamic Resource Allocation for Parallel Data Processing. Moreover, significance, advantages and limitations of using Resource Allocation in Cloud computing systems is also discussed.

Keywords Dynamic Resource Allocation, Cloud Computing, Resource Management, Resource Scheduling.

INTRODUCTION

Cloud computing is the next generation in computation. Possibly people can have everything they need on the cloud. Cloud computing is the next natural step in the evolution of on-demand information technology services and products. Cloud Computing is an emerging computing technology that is rapidly consolidating itself as the next big step in the development and deployment of an increasing number of distributed applications.

Cloud computing nowadays becomes quite popular among a community of cloud users by offering a variety of resources. Cloud computing platforms, such as those provided by Microsoft, Amazon, Google, IBM, and Hewlett-Packard, let developers deploy applications across computers hosted by a central organization. These applications can access a large network of computing resources that are deployed and managed by a cloud computing provider. Developers obtain the advantages of a managed computing platform, without having to commit resources to design, build and maintain the network. Yet, an important problem that must be addressed effectively in the cloud is how to manage QoS and maintain SLA for cloud users that share cloud resources.

The cloud computing technology makes the resource as a single point of access to the client and is implemented as pay per usage. Though there are various advantages in cloud computing such as prescribed and abstracted infrastructure, completely virtualized environment, equipped with dynamic

infrastructure, pay per consumption, free of software and hardware installations, the major concern is the order in which the requests are satisfied. This evolves the scheduling of the resources. This allocation of resources must be made efficiently that maximizes the system utilization and overall performance. Cloud computing is sold on demand on the basis of time constrains basically specified in minutes or hours. Thus scheduling should be made in such a way that the resource should be utilized efficiently.

In cloud platforms, resource allocation (or load balancing) takes place at two levels. First, when an application is uploaded to the cloud, the load balancer assigns the requested instances to physical computers, attempting to balance the computational load of multiple applications across physical computers. Second, when an application receives multiple incoming requests, these requests should be each assigned to a specific application instance to balance the computational load across a set of instances of the same application. For example, Amazon EC2 uses elastic load balancing (ELB) to control how incoming requests are handled. Application designers can direct requests to instances in specific availability zones, to specific instances, or to instances demonstrating the shortest response times. In the following sections a review of existing resource allocation techniques like Topology Aware Resource Allocation, Linear Scheduling and Resource Allocation for parallel data processing is described briefly.
1. Resource Allocation and its Significance
  
  In cloud computing, Resource Allocation (RA) is the process of assigning available resources to the needed cloud applications over the internet. Resource allocation starves services if the allocation is not managed precisely. Resource provisioning solves that problem by allowing the service providers to manage the resources for each individual module. Resource Allocation Strategy (RAS) is all about integrating cloud provider activities for utilizing and allocating scarce resources within the limit of cloud environment so as to meet the needs of the cloud application. It requires the type and amount of resources needed by each application in order to complete a user job. The order and time of allocation of resources are also an input for an optimal RAS [1]. An
  
  optimal RAS should avoid the following criteria as follows:
  - Resource Contention – Resource contention arises when two applications try to access the same resource at the same time.
  - Scarcity of Resource – Scarcity of resource arises when there are limited resources and the demand for resources is high.
  - Resource Fragmentation – Resource fragmentation arises when the resources are isolated. There would be enough resources but cannot allocate it to the needed application due to fragmentation into small entities.
  - Over Provisioning – Over provisioning arises when the application gets surplus resources than the demanded one.
  - Under Provisioning – Under provisioning of resources occurs when the application is assigned with fewer numbers of resources than it demanded.
From the perspective of a cloud provider, predicting the dynamic nature of users, user demands, and application demands are impractical. For the cloud users, the job should be completed on time with minimal cost. Hence due to limited resources, resource heterogeneity, locality restrictions, environmental necessities and dynamic nature of resource demand, we need an efficient resource allocation system that suits cloud environments.

Cloud resources consist of physical and virtual resources. The physical resources are shared across multiple compute requests through virtualization and provisioning. The request for virtualized resources is described through a set of parameters detailing the processing, memory and disk needs. Provisioning satisfies the request by mapping virtualized resources to physical ones. The hardware and software resources are allocated to the cloud applications on-demand basis. For scalable computing, Virtual Machines are rented. [1]
RELATED WORK

Dynamic resource allocation problem is one of the most challenging problems in the resource management problems. The dynamic resource allocation in cloud computing has attracted attention of the research community in the last few years. Many researchers around the world have come up with new ways of facing this challenge.

In [8] authors have explained the algorithm for negotiation protocol for resource provisioning in detail. In [1], authors have made a comparison of many resource allocation strategies. In [9] authors propose a model and a utility function for location-aware dynamic resource allocation. A comprehensive comparison of rsource allocation policies is covered in [10]. In [11] author has used a Genetic Algorithm for scheduling of tasks in cloud computing systems.

This paper is not intended to address any specific resource allocation strategy, but to provide a review of some of the existing resource allocation techniques.

Not many papers which analyses various resource allocation strategies are available as cloud computing being a recent technology. The literature survey focuses on resource allocation strategies and its impacts on cloud users and cloud providers. It is believed that this survey would greatly benefit the cloud users and researchers.
RESOURCE ALLOCATION STRATEGIES & ALGORITHMS

Recently many resource allocation schemes have come up in the literature of cloud computing as this technology has started maturing. Researchers around the world have proposed and / or implemented several types of resource allocation. Few of the strategies for resource allocation in cloud computing are covered here briefly.
1. Topology Aware Resource Allocation (TARA)
  
  Different kinds of resource allocation mechanisms are proposed in cloud. The one mentioned in [2] proposes architecture for optimized resource allocation in Infrastructure-as-a-Service (IaaS) based cloud systems. Current IaaS systems are usually unaware of the hosted applications requirements and therefore allocate resources independently of its needs, which can significantly impact performance for distributed data-intensive applications.
  
  To address this resource allocation problem, an architecture that adopts a what if methodology to guide allocation decisions taken by the IaaS is proposed. The architecture uses a prediction engine with a lightweight simulator to estimate the performance of a given resource allocation and a genetic algorithm to find an optimized solution in the large search space. Results showed that TARA reduced the job completion time of these applications by up to 59% when compared to application-independent allocation policies.
  1. Architecture of TARA: TARA [2] is composed of two major components: a prediction engine and a fast genetic algorithm-based search technique. The prediction engine is the entity responsible for optimizing resource allocation. When it receives a resource request, the prediction engine iterates through the possible subsets of available resources (each distinct subset is known as a candidate) and identifies an allocation that optimizes estimated job completion time. However, even with a lightweight prediction engine, exhaustively iterating through all possible candidates is infeasible due to the scale of IaaS systems. Therefore a genetic algorithm-based search technique that allows TARA to guide the prediction engine through the search space intelligently is used.
    
    Fig. 1 Basic Architecture of TARA [2]
  2. Prediction Engine: The prediction engine maps resource allocation candidates to scores that measures their fitness with respect to a given objective function, so that TARA can compare and rank different candidates. The inputs used in the scoring process can be seen in Figure1, Architecture of TARA.
  3. Objective Function: The objective function defines the metric that TARA should optimize. For example, given the increasing cost and scarcity of power in the data center, an objective function might measure the increase in power usage due to a particular allocation.
  4. Application Description: The application description consists of three parts: i) the framework type that identifies the framework model to use, ii) workload specific parameters that describe the particular applications resource usage and iii) a request for resources including the number of VMs, storage, etc.
  5. Available Resources: The final input required by the prediction engine is a resource snapshot of the IaaS data centre. This includes information derived from both the virtualization layer and the IaaS monitoring service. The information gathered ranges from a list of available servers, current load and available capacity on individual servers to data centre topology and a recent measurement of available bandwidth on each network link.
2. Linear Scheduling Strategy for Resource Allocation
  
  Considering the processing time, resource utilization based on CPU usage, memory usage and throughput, the cloud environment with the service node to control all clients request, could provide maximum service to all clients [3]. Scheduling the resource and tasks separately involves more waiting time and response time. A scheduling algorithm named as Linear Scheduling for Tasks and Resources (LSTR) is designed, which performs tasks and resources scheduling respectively. Here, a server node is used to establish the IaaS cloud environment and KVM/Xen virtualization along with LSTR scheduling to allocate resources which maximize the system throughput and resource utilization.
  
  Resource consumption and resource allocation have to be integrated so as to improve the resource utilization. The scheduling algorithms mainly focus on the distribution of the resources among the requestors that will maximize the selected QoS parameters. The QoS parameter selected in our evaluation is the cost function. The scheduling algorithm is designed considering the tasks and the available virtual machines together and named LSTR scheduling strategy. This is designed to maximize the resource utilization.
  1. The requests are collected between every pre- determined interval of time
  2. Resources Ri => {R1,R2, R3,…,Rn}
  3. Requests RQi => {RQ1, RQ2, RQ3,…, RQn}
  4. Calculate Threshold (static at initial)
  5. Th = Ri
  6. for every unsorted array A and B
  7. sort A and B
  8. for every RQi
  9. if RQi < Th then
  10. add RQi in low array, A[RQi]
  11. else if RQi > Th then
  12. add RQi in high array B[RQi]
  13. for every B[RQi]
  14. allocate resource for RQi of B
  15. Ri = Ri – RQi; Th = Ri
  16. satisfy the resource of A[RQi]
  17. for every A[RQi]
  18. allocate resource for RQi of A
  19. Ri = Ri – RQi; Th = Ri
  20. satisfy the resource of B[RQi]
    The dynamic allocation could be carried out by the scheduler dynamically on request for additional resources. This is made by the continuous evaluation of the threshold value. The resource requests are collected and are sorted in different queues based on the threshold value. The requests are satisfied by the VMs. Evaluation is made by creating VM in which the virtual memory is allocated to the longer and shorter queues based on the best fit strategy. This scheduling approach and the calculation of dynamic threshold value in the scheduler are carried out by considering both task and the resource. This improves the system throughput and the resource utilization regardless of the starvation and the dead lock conditions.
    
    Fig. 2 Example of LSTR algorithm [3]
3. Dynamic Resource Allocation for Parallel Data Processing
  
  Dynamic Resource Allocation for Efficient Parallel data processing [4] introduces a new processing framework explicitly designed for cloud environments called Nephele. Most notably, Nephele is the first data processing framework to include the possibility of dynamically allocating/de-
  
  allocating different compute resources from a cloud in its scheduling and during job execution. Particular tasks of a processing job can be assigned to different types of virtual machines which are automatically instantiated and terminated during the job execution.
  1. Architecture: Nepheles architecture [4] follows a classic master-worker pattern as illustrated in Figure. Before submitting a Nephele compute job, a user must start a VM in the cloud which runs the so called Job Manager (JM). The Job Manager receives the clients jobs is responsible for scheduling them, and coordinates their execution. It is capable of communicating with the interface the cloud operator provides to control the instantiation of VMs. We call this interface the Cloud Controller. By means of the Cloud Controller the Job Manager can allocate or de-allocate VMs according to the current job execution phase.
    
    Fig. 3 Design Architecture of Nephele Framework [4]
    The actual execution of tasks which a Nephele job consists of is carried out by a set of instances. Each instance runs a so- called Task Manager (TM). A Task Manager receives one or more tasks from the Job Manager at a time, executes them, and after that informs the Job Manager about their completion or possible errors
  2. Job Description: Jobs in Nephele are expressed as a directed acyclic graph (DAG). Each vertex in the graph represents a task of the overall processing job, the graphs edges define the communication flow between these tasks Job description parameters are based on the following criterias:
    - Number of subtasks
    - Data sharing between instances of task
    - Instance type
    - Number of subtasks per instance
  3. Job Graph: Once the Job Graph is specified, the user submits it to the Job Manager, together with the credentials he has obtained from his cloud operator. The credentials are required since the Job Manager must allocate/deallocate instances during the job execution on behalf of the user.
4. Advantages and Limitations of Resource Allocation
There are many benefits in resource allocation while using cloud computing irrespective of size of the organization and business markets. But there are some limitations as well, since it is an evolving technology. Lets have a comparative look at the advantages and limitations of resource allocation in cloud. [1]
- The biggest benefit of resource allocation is that user neither has to install software nor hardware to access the applications, to develop the application and to host the application over the internet.
- The next major benefit is that there is no limitation of place and medium. We can reach our applications and data anywhere in the world, on any system.
- The user does not need to expend on hardware and software systems.
- Cloud providers can share their resources over the internet during resource scarcity.
- Since users rent resources from remote servers for their purpose, they dont have control over their resources.
- Migration problem occurs, when the users wants to switch to some other provider for the better storage of their data. Its not easy to transfer huge data from one provider to the other.
- In public cloud, the clients data can be susceptible to hacking or phishing attacks. Since the servers on cloud are interconnected, it is easy for malware to spread.
- Peripheral devices like printers or scanners might not work with cloud. Many of them require software to be installed locally. Networked peripherals have lesser problems.
- More and deeper knowledge is required for allocating and managing resources in cloud, since all knowledge about the working of the cloud mainly depends upon the cloud service provider.
CONCLUSION

Cloud computing technology is increasingly being used in enterprises and business markets. A review shows that dynamic resource allocation is growing need of cloud providers for more number of users and with the less response time. In cloud paradigm, an effective resource allocation strategy is required for achieving user satisfaction and maximizing the profit for cloud service providers. This paper summarizes the main types of RAS and its impacts in cloud system. Some of the strategies discussed above mainly focus on memory resources but are lacking in other factors. Hence this survey paper will hopefully motivate future researchers to come up with smarter and secured optimal resource allocation algorithms and framework to strengthen the cloud computing paradigm.

REFERENCES

V. Vinothina, Dr. R. Shridaran, and Dr. Padmavathi Ganpathi, A survey on resource allocation strategies in cloud computing, International Journal of Advanced Computer Science and Applications, 3(6):97–104, 2012.
Gunho Lee, Niraj Tolia, Parthasarathy Ranganathan, and Randy H. Katz, Topology aware resorce allocation for data-intensive workloads, ACM SIGCOMM Computer Communication Review, 41(1):120–124, 2011.
Abirami S.P. and Shalini Ramanathan, Linear scheduling strategy for resource allocation in cloud environment, International Journal on Cloud Computing: Services and Architecture(IJCCSA), 2(1):9–17, 2012.
Daniel Warneke and Odej Kao, Exploiting dynamic resource allocation for efficient parallel data processing in the cloud, IEEE Transactions On Parallel And Distributed Systems, 2011.
Atsuo Inomata, Taiki Morikawa, Minoru Ikebe and Md. Mizanur Rahman, Proposal and Evaluation of Dynamic Resource Allocation Method Based on the Load Of VMs on IaaS, IEEE, 2010.
Dorian Minarolli and Bernd Freisleben, Uitlitybased Resource Allocations for virtual machines in cloud computing, IEEE, 2011.
Jiyani, Adaptive resource allocation for preemptable jobs in cloud systems, IEEE, 2010.
Bo An, Victor Lesser, David Irwin and Michael Zink, Automated Negotiation with Decommitment for Dynamic Resource Allocation in Cloud Computing, Conference at University of Massachusetts, Amherst, USA.
Gihun Jung and Kwang Mong Sim, Location-Aware Dynamic Resource Allocation Model for Cloud Computing Environment, International Conference on Information and Computer Applications (ICICA), IACSIT Press, Singapore, 2012.
Chandrashekhar S. Pawar and R.B. Wagh, A review of resource allocation policies in cloud computing, World Journal of Science and Technology, 2(3):165-167, 2012.
Sandeep Tayal, Tasks Scheduling Optimization for the Cloud Computing systems, International Journal of Advanced Engineering Sciences and Technologies (IJAEST), 5(2): 111 – 115, 2011.

Survey on Resource Allocation Strategies in Cloud Computing

Leave a Reply