System Models For Distributed and Cloud Computing

SYSTEM MODELS FOR DISTRIBUTED
AND CLOUD COMPUTING

Clusters of Cooperative Computers
A computing cluster consists of interconnected stand-alone computers which work

cooperatively as a single integrated computing resource. In the past, clustered
computer systems have demonstrated impressive results in handling heavy workloads
with large data sets.
Cluster Architecture
shows the architecture of a typical server cluster built around a low-latency, high
bandwidth interconnection network. This network can be as simple as a SAN (e.g.,
Myrinet) or a LAN (e.g., Ethernet). To build a larger cluster with more nodes, the
interconnection network can be built with multiple levels of Gigabit Ethernet, Myrinet,
or InfiniBand switches.
Through hierarchical construction using a SAN, LAN, or WAN, one can build scalable
clusters with an increasing number of nodes. The cluster is connected to the Internet
via a virtual private network (VPN) gateway. The gateway IP address locates the cluster.
The system image of a computer is decided by the way the OS manages the shared
cluster resources. Most clusters have loosely coupled node computers. All resources
of a server node are managed by their own OS. Thus, most clusters have multiple
system images as a result of having many autonomous nodes under different OS
control.
A cluster of servers interconnected by a high-bandwidth SAN or LAN with shared I/O

devices and disk arrays;
the cluster acts as a single computer attached to the Internet.

Single-System Image
An ideal cluster should merge multiple system images into a single-system image (SSI).
Cluster designers desire a cluster operating system or some middleware to support SSI
at various levels, including the sharing of CPUs, memory, and I/O across all cluster
nodes.
An SSI is an illusion created by software or hardware that presents a collection of

resources as one integrated, powerful resource. SSI makes the cluster appear like a
single machine to the user. A cluster with multiple system images is nothing but a
collection of independent computers.
Hardware, Software, and Middleware Support
Clusters exploring massive parallelism are commonly known as MPPs. Almost all HPC
clusters in the Top 500 list are also MPPs. The building blocks are computer nodes (PCs,
workstations, servers, or SMP), special communication software such as PVM or MPI,
and a network interface card in each computer node.
Most clusters run under the Linux OS. The computer nodes are interconnected by a
high-bandwidth network (such as Gigabit Ethernet, Myrinet, InfiniBand, etc.).
Major Cluster Design Issues
Unfortunately, a cluster-wide OS for complete resource sharing is not available yet.

Middleware or OS extensions were developed at the user space to achieve SSI at
selected functional levels.
Without this middleware, cluster nodes cannot work together effectively to achieve
cooperative computing.
The software environments and applications must rely on the middleware to achieve
high performance. The cluster benefits come from scalable performance, efficient
message passing, high system availability, seamless fault tolerance, and cluster-wide
job management.
Energy Efficiency in Distributed Computing
Primary performance goals in conventional parallel and distributed computing systems are
high performance and high throughput, considering some form of performance reliability
(e.g., fault tolerance and security). However, these systems recently encountered new
challenging issues including energy efficiency, and workload and resource outsourcing.
This section reviews energy consumption issues in servers and HPC systems, an area
known as distributed power management (DPM).
Energy Consumption of Unused Servers
To run a server farm (data center) a company has to spend a huge amount of money for
hardware, software, operational support, and energy every year. Therefore, companies
should thoroughly identify whether their installed server farm (more specifically, the
volume of provisioned resources) is at an appropriate level, particularly in terms of
utilization. It was estimated in the past that, on average, one-sixth (15 percent) of the full-
time servers in a company are left powered on without being actively used (i.e., they are
idling) on a daily basis.
This indicates that with 44 million servers in the world, around 4.7 million servers are not
doing any useful work.
Reducing Energy in Active Servers
In addition to identifying unused/underutilized servers for energy savings, it is also

necessary to apply appropriate techniques to decrease energy consumption in active
distributed systems with negligible influence on their performance.
Power management issues in distributed computing platforms

can be categorized into four layers (see Figure above): the application layer,
middleware layer, resource layer, and network layer.

System Models For Distributed and Cloud Computing

Uploaded by

Copyright:

Available Formats

System Models For Distributed and Cloud Computing

Uploaded by

Document Information

Original Title

Copyright

Available Formats

Share this document

Share or Embed Document

Sharing Options

Did you find this document useful?

Is this content inappropriate?

Copyright:

Available Formats

System Models For Distributed and Cloud Computing

Uploaded by

Copyright:

Available Formats

SYSTEM MODELS FOR DISTRIBUTED

AND CLOUD COMPUTING

A computing cluster consists of interconnected stand-alone computers which work

A cluster of servers interconnected by a high-bandwidth SAN or LAN with shared I/O

the cluster acts as a single computer attached to the Internet.

An SSI is an illusion created by software or hardware that presents a collection of

Hardware, Software, and Middleware Support

Unfortunately, a cluster-wide OS for complete resource sharing is not available yet.

Energy Consumption of Unused Servers

In addition to identifying unused/underutilized servers for energy savings, it is also

Power management issues in distributed computing platforms

You might also like