Web server load balancing

A queueing analysis

Zhongju Zhang, Weiguo Fan

Research output: Contribution to journalArticle

26 Citations (Scopus)

Abstract

Over the last few years, the Web-based services, more specifically different types of E-Commerce applications, have become quite popular, resulting in exponential growth in the Web traffic. In many situations, this has led to unacceptable response times and unavailability of services, thereby driving away customers. Many companies are trying to address this problem using multiple Web servers with a front-end load balancer. Load balancing has been found to provide an effective and scalable way of managing the ever-increasing Web traffic. However, there has been little attempt to analyze the performance characteristics of a system that uses a load balancer. This paper presents a queuing model for analyzing load balancing with two Web servers. We first analyze the centralized load balancing model, derive the average response time and the rejection rate, and compare three different routing policies at the load balancer. We then extend our analysis to the distributed load balancing and find the optimal routing policy that minimizes the average response time.

Original languageEnglish (US)
Pages (from-to)681-693
Number of pages13
JournalEuropean Journal of Operational Research
Volume186
Issue number2
DOIs
StatePublished - Apr 16 2008
Externally publishedYes

Fingerprint

Queueing Analysis
Web Server
Load Balancing
Resource allocation
Servers
Response Time
Routing
Traffic
traffic
Queuing Model
Electronic commerce
Exponential Growth
Electronic Commerce
Rejection
commerce
Web-based
customer
Customers
Minimise
Queueing

Keywords

  • Load balancing
  • Parallel queues
  • Queueing theory
  • Routing

ASJC Scopus subject areas

  • Information Systems and Management
  • Management Science and Operations Research
  • Statistics, Probability and Uncertainty
  • Applied Mathematics
  • Modeling and Simulation
  • Transportation

Cite this

Web server load balancing : A queueing analysis. / Zhang, Zhongju; Fan, Weiguo.

In: European Journal of Operational Research, Vol. 186, No. 2, 16.04.2008, p. 681-693.

Research output: Contribution to journalArticle

@article{ebcd6574ddcb4bdebf9420b337909c0d,
title = "Web server load balancing: A queueing analysis",
abstract = "Over the last few years, the Web-based services, more specifically different types of E-Commerce applications, have become quite popular, resulting in exponential growth in the Web traffic. In many situations, this has led to unacceptable response times and unavailability of services, thereby driving away customers. Many companies are trying to address this problem using multiple Web servers with a front-end load balancer. Load balancing has been found to provide an effective and scalable way of managing the ever-increasing Web traffic. However, there has been little attempt to analyze the performance characteristics of a system that uses a load balancer. This paper presents a queuing model for analyzing load balancing with two Web servers. We first analyze the centralized load balancing model, derive the average response time and the rejection rate, and compare three different routing policies at the load balancer. We then extend our analysis to the distributed load balancing and find the optimal routing policy that minimizes the average response time.",
keywords = "Load balancing, Parallel queues, Queueing theory, Routing",
author = "Zhongju Zhang and Weiguo Fan",
year = "2008",
month = "4",
day = "16",
doi = "10.1016/j.ejor.2007.02.011",
language = "English (US)",
volume = "186",
pages = "681--693",
journal = "European Journal of Operational Research",
issn = "0377-2217",
publisher = "Elsevier",
number = "2",

}

TY - JOUR

T1 - Web server load balancing

T2 - A queueing analysis

AU - Zhang, Zhongju

AU - Fan, Weiguo

PY - 2008/4/16

Y1 - 2008/4/16

N2 - Over the last few years, the Web-based services, more specifically different types of E-Commerce applications, have become quite popular, resulting in exponential growth in the Web traffic. In many situations, this has led to unacceptable response times and unavailability of services, thereby driving away customers. Many companies are trying to address this problem using multiple Web servers with a front-end load balancer. Load balancing has been found to provide an effective and scalable way of managing the ever-increasing Web traffic. However, there has been little attempt to analyze the performance characteristics of a system that uses a load balancer. This paper presents a queuing model for analyzing load balancing with two Web servers. We first analyze the centralized load balancing model, derive the average response time and the rejection rate, and compare three different routing policies at the load balancer. We then extend our analysis to the distributed load balancing and find the optimal routing policy that minimizes the average response time.

AB - Over the last few years, the Web-based services, more specifically different types of E-Commerce applications, have become quite popular, resulting in exponential growth in the Web traffic. In many situations, this has led to unacceptable response times and unavailability of services, thereby driving away customers. Many companies are trying to address this problem using multiple Web servers with a front-end load balancer. Load balancing has been found to provide an effective and scalable way of managing the ever-increasing Web traffic. However, there has been little attempt to analyze the performance characteristics of a system that uses a load balancer. This paper presents a queuing model for analyzing load balancing with two Web servers. We first analyze the centralized load balancing model, derive the average response time and the rejection rate, and compare three different routing policies at the load balancer. We then extend our analysis to the distributed load balancing and find the optimal routing policy that minimizes the average response time.

KW - Load balancing

KW - Parallel queues

KW - Queueing theory

KW - Routing

UR - http://www.scopus.com/inward/record.url?scp=35349020409&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=35349020409&partnerID=8YFLogxK

U2 - 10.1016/j.ejor.2007.02.011

DO - 10.1016/j.ejor.2007.02.011

M3 - Article

VL - 186

SP - 681

EP - 693

JO - European Journal of Operational Research

JF - European Journal of Operational Research

SN - 0377-2217

IS - 2

ER -