System Models For Distributed and Cloud Computing

Download as pptx, pdf, or txt
Download as pptx, pdf, or txt
You are on page 1of 9

SYSTEM MODELS FOR DISTRIBUTED

AND CLOUD COMPUTING


Clusters of Cooperative Computers

A computing cluster consists of interconnected stand-alone computers which work


cooperatively as a single integrated computing resource. In the past, clustered
computer systems have demonstrated impressive results in handling heavy workloads
with large data sets.

Cluster Architecture

shows the architecture of a typical server cluster built around a low-latency, high
bandwidth interconnection network. This network can be as simple as a SAN (e.g.,
Myrinet) or a LAN (e.g., Ethernet). To build a larger cluster with more nodes, the
interconnection network can be built with multiple levels of Gigabit Ethernet, Myrinet,
or InfiniBand switches.

Through hierarchical construction using a SAN, LAN, or WAN, one can build scalable
clusters with an increasing number of nodes. The cluster is connected to the Internet
via a virtual private network (VPN) gateway. The gateway IP address locates the cluster.
The system image of a computer is decided by the way the OS manages the shared
cluster resources. Most clusters have loosely coupled node computers. All resources
of a server node are managed by their own OS. Thus, most clusters have multiple
system images as a result of having many autonomous nodes under different OS
control.

A cluster of servers interconnected by a high-bandwidth SAN or LAN with shared I/O


devices and disk arrays;

the cluster acts as a single computer attached to the Internet.


Single-System Image
An ideal cluster should merge multiple system images into a single-system image (SSI).
Cluster designers desire a cluster operating system or some middleware to support SSI
at various levels, including the sharing of CPUs, memory, and I/O across all cluster
nodes.

An SSI is an illusion created by software or hardware that presents a collection of


resources as one integrated, powerful resource. SSI makes the cluster appear like a
single machine to the user. A cluster with multiple system images is nothing but a
collection of independent computers.

Hardware, Software, and Middleware Support

Clusters exploring massive parallelism are commonly known as MPPs. Almost all HPC
clusters in the Top 500 list are also MPPs. The building blocks are computer nodes (PCs,
workstations, servers, or SMP), special communication software such as PVM or MPI,
and a network interface card in each computer node.

Most clusters run under the Linux OS. The computer nodes are interconnected by a
high-bandwidth network (such as Gigabit Ethernet, Myrinet, InfiniBand, etc.).
Major Cluster Design Issues

Unfortunately, a cluster-wide OS for complete resource sharing is not available yet.


Middleware or OS extensions were developed at the user space to achieve SSI at
selected functional levels.

Without this middleware, cluster nodes cannot work together effectively to achieve
cooperative computing.

The software environments and applications must rely on the middleware to achieve
high performance. The cluster benefits come from scalable performance, efficient
message passing, high system availability, seamless fault tolerance, and cluster-wide
job management.
Energy Efficiency in Distributed Computing

Primary performance goals in conventional parallel and distributed computing systems are
high performance and high throughput, considering some form of performance reliability
(e.g., fault tolerance and security). However, these systems recently encountered new
challenging issues including energy efficiency, and workload and resource outsourcing.

This section reviews energy consumption issues in servers and HPC systems, an area
known as distributed power management (DPM).

Energy Consumption of Unused Servers

To run a server farm (data center) a company has to spend a huge amount of money for
hardware, software, operational support, and energy every year. Therefore, companies
should thoroughly identify whether their installed server farm (more specifically, the
volume of provisioned resources) is at an appropriate level, particularly in terms of
utilization. It was estimated in the past that, on average, one-sixth (15 percent) of the full-
time servers in a company are left powered on without being actively used (i.e., they are
idling) on a daily basis.
This indicates that with 44 million servers in the world, around 4.7 million servers are not
doing any useful work.
Reducing Energy in Active Servers

In addition to identifying unused/underutilized servers for energy savings, it is also


necessary to apply appropriate techniques to decrease energy consumption in active
distributed systems with negligible influence on their performance.

Power management issues in distributed computing platforms


can be categorized into four layers (see Figure above): the application layer,
middleware layer, resource layer, and network layer.

You might also like