IBM Cloud Pak For Data Express Parts On SNO
IBM Cloud Pak For Data Express Parts On SNO
IBM Cloud Pak For Data Express Parts On SNO
IBM Cloud Pak for Data (CPD) is a fully-integrated data and AI platform, built on top of Red Hat
OpenShift container platform and can run on your private, on-premises cluster or an OpenShift
deployment on cloud. CPD simplifies data access, automates data discovery and curation, and safeguards
sensitive information by automating policy enforcement for all users in your organization. It helps
connect to your data, govern it, find it, and use it for analysis.
Note: The major tradeoff with an installation on a single node is the lack of high availability. In
environments that require high availability, it is recommended that the architecture be configured in a
way in which if the hardware was to fail, those workloads are transitioned to other sites or nodes while
the impacted node is recovered.
Create a Red Hat OpenShift account. Open the hybrid console (https://console.redhat.com/openshift/)
and select Cloud option from the cluster tab and create cluster by selecting AWS option. Select Linux
Installation
There are different choices to perform installation of the single node OpenShift cluster. Here we
explore one simple way we have been using to deploy an SNO from scratch on AWS. The OpenShift
installer will provision following resources:
• Virtual private cloud (VPC) that spans three Availability Zones (private and one public subnet in
each Availability Zone).
• Single EC2 compute/worker node instance.
• Internet gateway to provide internet access to each subnet.
• Load Balancer for access to the OpenShift API.
Again, it’s important to note that running OpenShift with only a single node is not recommended for
production environments due to the lack of high availability and scalability.
Note: RedHat does not claim support for single node OpenShift clusters on cloud providers.
Deploy The OpenShift Cluster
Our environment is now ready for OpenShift cluster installation on AWS Cloud Infrastructure.
1. For <installation_directory>, specify the directory name to store the files that the installation
program creates. After you run the command, it will prompt you for a few inputs
• SSH public key
• Cloud Provider: Select the cloud provider from the listed items
• Region: Select the Region where you want to deploy the cluster
• Base Domain: select from the dropdown
• Cluster Name: provide a unique name.
• Red Hat Pull Secret
q Run the installation program. Execute the following command to create a cluster.
During installation, installation program will create a temporary extra bootstrap node which will get
automatically torn down by the installer when installation is done, leaving you with a single-node
OpenShift installation. The process might take around 20 minutes to get the cluster up and running.
Do not delete the installation program or the files that the installation program creates. Both are
required to delete the cluster.
When the cluster deployment completes, directions for accessing your cluster, including a link to its web
console and credentials for the kubeadmin user, displays as below in your terminal.
Optional Services - Analytics Engine, SPSS Modeler, Decision Optimization, Watson Pipelines, RStudio
Note: Depending on number of services to be installed, it may be needed to increase the pod limit
(default maximum pods per node is 250). Refer below:
https://docs.openshift.com/container-platform/4.12/nodes/nodes/nodes-nodes-managing-max-
pods.html
This can cause the node to reboot, so it would be better to do this before starting CPD installation.
1. For <installation_directory>, specify the path to the directory that you stored the installation
files in.
2. To view different details, specify warn, debug, or error instead of info
Conclusion
In conclusion, SNO is a cost-effective tool for developers and system administrators who want to
deploy Cloud Pak for Data Express in a small-scale environment. It provides a way to make use of
various CPD services with minimum resource overhead for running smaller, fault-tolerant, and short-
term projects. Overall, SNO is a viable option for anyone who wants to test their applications in a
smaller environment or manage a small number of applications without the need for a full-fledged
cluster.
References:
• Single Node OpenShift RedHat documentation: https://docs.openshift.com/container-
platform/4.12/installing/installing_sno/install-sno-installing-sno.html
• Installing a cluster on AWS with customizations: https://docs.openshift.com/container-
platform/4.10/installing/installing_aws/installing-aws- customizations.html
• Configuring persistent storage: https://access.redhat.com/documentation/en-
us/openshift_container_platform/4.3/html/storage/configuring-persistent-storage
Authors
This paper was produced by IBM Cloud Pak for Data Multi-Cloud team led by Anuj Sharma.
Contributed by the following people.
Aparna Sreekumar, Software Developer for Cloud Pak for Data Multi-
Cloud team, India Software Labs (ISL). Working with IBM since August
2022.
This information was developed for products and services offered in the U.S.A. IBM may not offer the
products, services, or features discussed in this document in other countries. Consult your local IBM
representative for information on the products and services currently available in your area. Any
reference to an IBM product, program, or service is not intended to state or imply that only that IBM
product, program, or service may be used. Any functionally equivalent product, program, or service
that does not infringe any IBM intellectual property right may be used instead. However, it is the user's
responsibility to evaluate and verify the operation of any non-IBM product, program, or service.
IBM may have patents or pending patent applications covering subject matter described in this
document. The furnishing of this document does not give you any license to these patents. You can
send license inquiries, in writing, to:
IBM Director of Licensing
IBM Corporation
North Castle Drive
Armonk, NY 10504-1785 U.S.A.
For license inquiries regarding double-byte character set (DBCS) information, contact the IBM
Intellectual Property Department in your country or send inquiries, in writing, to:
Intellectual Property Licensing
Legal and Intellectual Property Law
IBM Japan, Ltd.
19-21, Nihonbashi-Hakozakicho, Chuo-ku
Tokyo 103-8510, Japan
The following paragraph does not apply to the United Kingdom or any other country where such
provisions are inconsistent with local law:
INTERNATIONAL BUSINESS MACHINES CORPORATION PROVIDES THIS PUBLICATION "AS IS"
WITHOUT WARRANTY OF ANY KIND, EITHER EXPRESS OR IMPLIED, INCLUDING, BUT NOT
LIMITED TO, THE IMPLIED WARRANTIES OF NON-INFRINGEMENT, MERCHANTABILITY OR
FITNESS FOR A PARTICULAR PURPOSE.
Some states do not allow disclaimer of express or implied warranties in certain transactions, therefore
this statement might not apply to you. This information could include technical inaccuracies or
typographical errors. Changes are periodically made to the information herein; these changes will be
incorporated in new editions of the publication. IBM may make improvements and/or changes in the
product(s) and/or the program(s) described in this publication at any time without notice.
Any references in this information to non-IBM Web sites are provided for convenience only and do not
in any manner serve as an endorsement of those Web sites. The materials at those Web sites are not
part of the materials for this IBM product and use of those Web sites is at your own risk.
IBM Corporation
2Z4A/101
11400 Burnet Road
Austin, TX 78758 U.S.A.
Such information may be available, subject to appropriate terms and conditions, including in some
cases payment of a fee.
The licensed program described in this document and all licensed material available for it are provided
by IBM under terms of the IBM Customer Agreement, IBM International Program License Agreement
or any equivalent agreement between us.
Any performance data contained herein was determined in a controlled environment. Therefore, the
results obtained in other operating environments may vary significantly. Some measurements may
have been made on development-level systems and there is no guarantee that these measurements
will be the same on generally available systems. Furthermore, some measurement may have been
estimated
through extrapolation. Actual results may vary. Users of this document should verify the applicable
data for their specific environment.
Information concerning non-IBM products was obtained from the suppliers of those products, their
published announcements or other publicly available sources. IBM has not tested those products and
cannot confirm the accuracy of performance, compatibility or any other claims related to non-IBM
products. Questions on the capabilities of non-IBM products should be addressed to the suppliers of
those products. All statements regarding IBM's future direction or intent are subject to change or
withdrawal without notice and represent goals and objectives only. All IBM prices shown are IBM's
suggested retail prices, are current and are subject to change without notice. Dealer prices may vary.
This information is for planning purposes only. The information herein is subject to change before the
products described become available. This information contains examples of data and reports used in
daily business operations. To illustrate them as completely as possible, the examples include the
names of individuals, companies, brands, and products. All of these names are fictitious and any
similarity to the names and addresses used by an actual business enterprise is entirely coincidental
COPYRIGHT LICENSE
This information contains sample application programs in source language, which illustrate
Trademarks
IBM, the IBM logo, and ibm.com are trademarks or registered trademarks of International Business
Machines Corp., registered in many jurisdictions worldwide. Other product and service names might be
trademarks of IBM or other companies. A current list of IBM trademarks is available on the Web at
Copyright and trademark information at ibm.com/legal/copytrade.shtml.
Adobe, Acrobat, PostScript and all Adobe-based trademarks are either registered trademarks or
trademarks of Adobe Systems Incorporated in the United States, other countries, or both. IT
Infrastructure Library is a registered trademark of the Central Computer and Telecommunications
Agency which is now part of the Office of Government Commerce. Intel, Intel logo, Intel Inside, Intel
Inside logo, Intel Centrino, Intel Centrino logo, Celeron, Intel Xeon, Intel SpeedStep, Itanium, and
Pentium are trademarks or registered trademarks of Intel Corporation or its subsidiaries in the United
States and other countries. Linux is a trademark of Linus Torvalds in the United States, other countries,
or both. Microsoft, Windows, Windows NT, and the Windows logo are trademarks of Microsoft
Corporation in the United States, other countries, or both. ITIL is a registered trademark, and a
registered community trademark of the Office of Government Commerce, and is registered in the U.S.
Patent and Trademark Office. UNIX is a registered trademark of The Open Group in the United States
and other countries. Java and all Java-based trademarks and logos are trademarks or registered
trademarks of Oracle and/or its affiliates. Cell Broadband Engine is a trademark of Sony Computer
Entertainment, Inc. in the United States, other countries, or both and is used under license therefrom.
Linear Tape-Open, LTO, the LTO Logo, Ultrium, and the Ultrium logo are trademarks of HP, IBM Corp.
and Quantum in the U.S. and other countries.