2020KS Sharma-VMware Cloud Foundations
2020KS Sharma-VMware Cloud Foundations
2020KS Sharma-VMware Cloud Foundations
REDEFINING SOFTWARE-DEFINED
DATACENTERS
Anuj Sharma
Sr. Principal Engineer
Dell
Anuj.sharma@dell.com
Benedikt Mayer
Sr. Principal Engineer
Dell
Benedikt.mayer@dell.com
Courses are offered to meet different learning styles and schedules, including self-paced On Demand,
remote-based Virtual Instructor-Led and in-person Classrooms.
Whether you are an experienced IT professional or just getting started, Dell Technologies Proven
Professional certifications are designed to clearly signal proficiency to colleagues and employers.
Disclaimer: The views, processes or methodologies published in this article are those of the authors.
They do not necessarily reflect Dell Technologies’ views, processes or methodologies.
Dell.com/certification 3
Overview
Industries are undergoing digital transformation on a path to innovate and adapt to new customer
behavior. This rapid pace of change requires a cloud platform that can be quickly deployed and
consumed. VMware Cloud Foundation combines VMware’s leading technologies to deliver a
standardized software defined datacenter (SDDC) platform to environments of all size.
VMware Cloud Foundation provides automated end to end lifecycle management combines compute,
network and storage virtualization as well as cloud management into as single solution.
VMware Cloud Foundation (VCF) on VxRail provides the simplest path to a multi-cloud solution through
automated deployment of a fully integrated platform that leverages native VxRail hardware and
software capabilities paired with VMware’s stack. These components work together to deliver a new
turnkey solution.
VCF on VxRail is engineered to standardize the architecture of all SDDC components like vSphere, VSAN,
and NSX to provide time to value from the large-scale data center to edge locations. Another important
aspect is lifecycle automation provided by Cloud Foundation, that streamlines the upgrade experience
for the full SDDC software and hardware stack.
This chapter will discuss VMware Cloud Foundations Architecture models along with other relevant topics
related to architecture of VMware Cloud Foundations.
Dell.com/certification 5
TOR-1
TOR-2
NSX-V NSX-T
NSX-V
VRLI
PSC2 CNTL 2
WN1
MWD
NSX-V
TENANT VRLI
CNTL 3
Network
Security
VCR WN2
MWD
TENANT
TENANT
NSX
VM
TENANT
TENANT
NSX
VM
CNTL 2
VSAN
TENANT
TENANT
NSX
VM
CNTL 3
TENANT
VM
Figure 1
Figure 1 shows a single rack deployment with one Management Workload Domain with four Nodes and
one Tenant Workload Domain with four Nodes. Minimum recommended nodes for Management Cluster
are four.
SPINE-2
TOR-1 TOR-1
TOR-2 TOR-2
CNTL1 T2
VSAN
2 VCR MN
VSAN
MWD
NSX-V
NSV-V NSX-V NSX-V TENANT
VRLI
PSC2 CNTL 2 MGR CNTL3 T2 2 VM
T2 WN1
MWD
NSV-V TENANT TENANT
TENANT 1 HORIZON VRLI
Network
CNTL 3
Security
VIEW VMs
2 VM 2 VM
Network
Security
VM
TENANT 3 NSX-V
TENANT 1
VDI CNTL2 T3
HORIZON
VM
VSAN
NSX-T
VSAN
TENANT 1 TENANT 3
VDI
VM
Figure 2
Figure 2 depicts a multi-rack deployment in Leaf Spine Network Topology where each rack has two Top
of the Rack Switches, with each rack connecting to Spine Switches. Also in the above deployment we can
see that we have one Management Workload Domain, one NSX-V Tenant Workload Domain, one NSX-T
Tenant Workload Domain and one Horizon Workload Domain. This shows that multiple tenants with
different configurations can be accommodated in a Single VMware Cloud Foundation Instance.
Dell.com/certification 7
Management Workload Domain Components
TOR-1
TOR-2
NSX-V SDDC
vCenter
MGR MGR
NSX-V VRLI
Management and Tenant ESXi Cluster
TENANT
PSC1 VM
CNTL-1 MN
6 or Less Nodes
Figure 3
Figure 9 depicts a single ESXi Cluster with six Nodes with all the Management VM’s as well as Tenant
VM’s. Single vCenter manages the environment and resource pools are used to provide isolation
between management and tenant workloads. The environment is managed from a single vCenter Server
and vSphere resource pools provide isolation between management and user workloads. Consolidate
Architecture can be converted to Standard Deployment Model by adding nodes and creating a new
Tenant Workload Domain followed by migration Tenant Workload VM’s from Management Domain to
New Tenant Workload Domain. This deployment model is not supported with VxRail.
Dell.com/certification 9
Stretched vSAN Cluster with VMware Cloud Foundations
vSAN Stretched Cluster with VMware Cloud Foundation works similarly as a normal vSAN Stretched
Cluster. With VMware Cloud Foundation we get the benefit of automated deployment of the Stretched
Workload Domain and using NSX-V or NSX-T. With VCF on VxRail we recommend physical and logical
configuration based on the VVD standardized architecture design which includes a Management Cluster
(4 node cluster minimum) and Compute/Edge Workload Domain Cluster (4 node cluster minimum).
• We deploy the NSX-T workload domain using SDDC and configure NSX-T.
• Prepare the IP addresses and forward/reserve DNS records for the PKS API, Pivotal Operations
Manager and the Harbor Registry (optional)
• Generate the certificates and private keys from a trusted certificate authority that include the
fully qualified domain names for each PKS management component.
• Prepare the NSX-T Tier-0 router, node and pod IP blocks and a floating IP pool for Kubernetes
cluster resources.
• Prepare the network settings and resources for the availability zones. This includes the network
CIDR, gateway, reserved IP ranges, target logical switch and vSphere cluster for management
and Kubernetes availability zone.
Figure 4
Begin the Automated Bringup of Management • Using SDDC manager start Automated Bring Up of Management work load domain.
Workload Domain • This will install SDDC Manager , vCenter , PSC, NSX Manager , NSX Controllers and Log Insight
Initiate Automated Compute Workload Domain • Once the Management Workload is deployed we can deploy Compute/Horizon/PKS workload Domain as and when required .
Deployment as per requirements
• If the deployed Tenant Workload Domain is NSX-V Compute Workload domain then SDDC Manager will deploy vCenter in
NSX-V Workload Domain Management Workload Domain , Deploy NSX Manager in Management workload Domain , Registerr NSX Manager with Tenant
vCenter and Deploy 3 NSX Controller in Tennat Workload Domain.
• If we want to deploy Horizon Workload Domain then first we deploy a normal VI compute worklad domain and then use that
compute workload domain to deploy SDDC VDI's. Horizon View Management VM's like Composers , Connection Servers, App
NSX-V Horizon Workload Domain Volume Managers, User Environment Manager , Unified Access Gateway , Load Balancers are deployed on Management Workload
Domain. Also . There are some pre-requistes before deploying that should be met. We should refer VCF Administratior Guide for
more details on this.
• The first NSX-T Workload Domain that we will deploy will lead to deployment of 3 NSX Managers on the Management Workload
NSX-T VI Workload Domain Domain. Any future NSX-T workload domains will also use the existing NSX-Managers.
VMware PKS Workload Domain • We can deploy VMware PKS Workload Domain with NSX-T as defined in previous section .
Dell.com/certification 11
Why VMware Cloud Foundation on VxRAIL
• Co-engineered with VMware and only VMware Co-engineered Solution available.
• Single point of contact for support-related issues.
• Proactive Dial Home Support.
• Automated end-to-end lifecycle management including automated VMware Components
Upgrades along with VxRail Node Hardware firmware upgrades.
• Standardized on VMware SDDC Architecture.
• Tighter VxRail Integration within vSphere Client enables management of VxRail nodes from a
single pane of glass.
• VxRail Software Bundles alignment with VMware Cloud Foundations Release.
• Dell Smart Fabric Integration for automation of Switch Configuration.
• Industry-Leading Compute Infrastructure.
Operating at the forefront of technology, the airline has also become one of Ethiopia's major industries
and a veritable institution in Africa. It commands a lion's share of the pan-African network including
multiple daily east-west flights across the continent and currently serves 81 international destinations
operating the newest and youngest fleets.
African Airlines’ current vSphere infrastructure is based on IBM hardware acquired in 2014. The vSphere
environment based on vSphere 6.5 has reached maximum capacity. The perception from African Airlines
staff is that the current support is unreliable and costly. Currently, there is no disaster recovery plan in
place to deal with a complete site failure. As a main driver for African Airlines’ IT transformation strategy
an agile approach that includes automation and self-service shall be adopted.
African Airlines engaged Dell Technologies to design and implement a new infrastructure to replace the
current solution. The solution needs to address the current resource constraints and lacking site failover
capabilities.
Dell.com/certification 13
Conceptual Design
Initial design workshops with the project sponsors and all involved stakeholders of African Airlines
defined their business goals along with their business requirements. In these meetings requirements,
constraints, assumptions and risks (shown in the sections below) were also identified that influenced the
design.
Business Requirements
Item Design Quality Description
R02 Scalability Account for 40% growth in the number of workloads based on
the existing metrics.
R04 Availability Design must provide failover capabilities to sustain two server
failures at the same time.
R06 Availability Solution to span two data center in the main location and a
second site within the same metro area.
R13 Performance, The design must provide configuration parameters for best
Availability performance and resiliency.
R14 Manageability Design must maintain simplicity where possible to allow existing
operations teams to manage the new environments.
A02 Infrastructure NTP server is available for time synchronization of all vSphere
components.
A03 Infrastructure Sufficient power, cooling and rack space is available to support
the solution.
A05 Operations African Airlines has sufficient knowledge of how to manage the
vSphere environment.
A07 Network African Airlines is responsible for creating all VLANs that will be
used in this solution at the core level as well as providing routing
capabilities if required.
A09 Network IP address space is available for all components in this solution.
Dell.com/certification 15
Design Constraints
Item Design Quality Description
C02 General During normal operations no production workload should run in the DR
site. Secondary site should only be utilized in a DR case.
C05 Manageability Integration into existing monitoring solutions - IBM Tivoli Monitoring for
application monitoring, VMware vRealize Operations Manager for
vSphere monitoring.
Resources are pooled together while adhering to the requirements and constraints of this design to run
the workload. The environment is separated into management and production. Replication between
sites is facilitating a complete site failover. All production VM on the current vSphere environment will
be migrated onto the target solution as part of a migration project. Figure 5 illustrates the conceptual
design of the environment.
Region A Region B
VM VM VM VM VM VM VM VM VM VM VM VM
Compute Compute
Replication
Storage Storage
Network Network
Security Security
Figure 5
Dell.com/certification 17
Figure 6 outlines the logical design.
Region A Region B
External and
Inter-site Connectivity
VM VM VM VM VM VM
Hypervisor Hypervisor
IP Based Replication
Hyperconverged Nodes Hyperconverged Nodes
Physical and Software Defined Networking Physical and Software Defined Networking
VM VM VM VM VM VM
Hypervisor Hypervisor
IP Based Replication
Hyperconverged Nodes Hyperconverged Nodes
Physical and Software Defined Networking Physical and Software Defined Networking
Figure 6
The following table summarizes the results of the analysis. It details the overall requirements for the
vSphere environment to support the workloads of the existing workload. All values have been rounded
up to ensure sufficient resources are available during peak times.
Metric Value
A total of 40% growth needs to be accounted for in the environment based on the current resource
usage. The table below lists the resource requirements including growth.
Metric Value
Dell.com/certification 19
Host Design
Design Decision ID DD07
Design Decision Use Dell AF-4 Ready Node as building block for the SDDC.
Design Justification To account for the resource requirements, the following VSAN-
ready nodes have been selected, while aligning to VMware
recommendation in terms of sizing.
Region A:
Management Workload Domain
AZ1: 4x AF-4 Nodes
AZ2: 4x AF-4 Nodes
Production Workload Domain
AZ1: 12x AF-4 Nodes
AZ2: 12x AF-4 Nodes
Region B:
Management Workload Domain
4x AF-4 Nodes
Production Workload Domain
12x AF-4 Nodes
The table below outlines the individual node configuration per Workload Domain.
Stretched Stretched
Management
management cluster management cluster
Cluster
Availability Zone 1 Availability Zone 2
(4 ESXi hosts)
(4 ESXi hosts) (4 ESXi hosts)
Figure 7
Network Design
Design Decision ID DD08
Design Decision Reuse existing Nexus 9236C switches.
Design Justification Customer has an existing investment in Nexus 9236C switches
that shall be reused. Each rack will include a pair of TOR
switches.
Each Hyperconverged node will be connected to each switch for
redundancy.
Dell.com/certification 21
Management Cluster - Region A Management Cluster - Region B
Management Management
vCenter Server vCenter Server
Production Production
vCenter Server vCenter Server
NSX Manager Pairing NSX Manager
NSX Manager
Primary Secondary
NSX Controller 1
NSX Controller 2
Universal
NSX Controller 3 Controller
Cluster
NSX Controller 1
NSX Controller 2
Universal
NSX Controller 3 Controller
Cluster
Figure 8
Figure 9
The table below provides an overview of business continuity and disaster recovery.
Data Center Redundant power, cooling and Site failover via RP4VM
backup generators
Dell.com/certification 23
Troubleshooting
As with VMware Cloud Foundations on VxRail we have virtualized Compute, Storage and Network along
with Automation Layer. With so many abstraction layers troubleshooting becomes difficult. As per our
experience we follow the approach below for troubleshooting issues and found it to be very helpful.
Identify the •Navigate through the various component logs and correlate with time stamps.
•Validate if there is any relation with regard to recent environment changes.
•Navigate existing Knowledgebase .
VMware Cloud Foundations includes very helpful Supportablity and Serviceability (SoS) CLI utility. This
utility can be used for log bundle collection, detailed health checks and other maintenance related tasks.
Location of the utility is /opt/vmware/sddc-support/sos on SDDC Manager VM.
Above command will collect SDDC Manager logs along with VxRail Manager Logs.
./sos - - health-check
sos –help
We also have an important command to see the password of the environment. Below command needs
to be run from SDDC Manager.
/use/bin/lookup_password
References
https://docs.vmware.com/en/VMware-Cloud-Foundation/
Dell.com/certification 25
Dell Technologies believes the information in this publication is accurate as of its publication date. The
information is subject to change without notice.
THE INFORMATION IN THIS PUBLICATION IS PROVIDED “AS IS.” DELL TECHNOLOGIES MAKES NO
RESPRESENTATIONS OR WARRANTIES OF ANY KIND WITH RESPECT TO THE INFORMATION IN THIS
PUBLICATION, AND SPECIFICALLY DISCLAIMS IMPLIED WARRANTIES OF MERCHANTABILITY OR FITNESS
FOR A PARTICULAR PURPOSE.
Use, copying and distribution of any Dell Technologies software described in this publication requires an
applicable software license.
Copyright © 2020 Dell Inc. or its subsidiaries. All Rights Reserved. Dell Technologies, Dell, EMC, Dell EMC
and other trademarks are trademarks of Dell Inc. or its subsidiaries. Other trademarks may be
trademarks of their respective owners.