Fairwinds Kubernetes Benchmark Report 2023
Fairwinds Kubernetes Benchmark Report 2023
Fairwinds Kubernetes Benchmark Report 2023
INTRODUCTION
Kubernetes security, cost efficiency and reliability are top employ Kubernetes governance and guardrails to ensure that
concerns to cloud native users. Unfortunately, many DevOps whether you are developing with cloud native technologies
team leaders lack visibility into what’s happening within or on the DevOps and platform engineering team building
clusters. The result is unnecessary risk, cloud cost overruns the infrastructure, the platform is secure, highly performant
and lost customers due to poor application performance. and cost effective. Otherwise, the benefits of the investment
in container and Kubernetes technology will never be
The Kubernetes Benchmark Report 2023 evaluates results fully realized by the business, nor will the leaders of your
from over 150,000 workloads and hundreds of organizations organization understand the full value it can bring.
using the Fairwinds Insights platform. The report serves
as a tool for Kubernetes users to benchmark their clusters On the positive side, there are areas where Kubernetes
against reliability, security and cost efficiency. guardrails are helping organizations accelerate remediation.
Fairwinds evaluated organizations who are using extensive
Things are getting worse, not better. Over the last year, guardrails vs. those who are not. Organizations who
the benchmark data shows that people are not configuring implement guardrails in shift-left scenarios (e.g., catching
Kubernetes according to best practices. In the previous misconfigurations before they are released), or at time of
report we saw many results with less than 10% of workloads deployment (e.g., via Admission Controllers) are able to fix
impacted, but this year the spread has become more varied 36% more issues where CPU and Memory configurations
across all the core issues: reliability, security and cost are missing than those who do not use guardrails. Features
efficiency. like Fairwinds’ application right-sizing recommendations
help provide these recommendations in a self-service way,
It’s clear DevOps teams are outnumbered and we need to
removing the DevOps team as a barrier. Similarly, companies
do better as a community to support them. As Kubernetes
leveraging guardrails are fixing 15% more image vulnerabilities
usage expands, it’s harder for DevOps to manage
than those without guardrails.
configuration risk introduced by new teams. We need to
Use this report to understand your cluster deficiencies, where Fairwinds is the trusted partner for Kubernetes governance
to make investments and how to configure Kubernetes to have and guardrails. With Fairwinds, customers ship cloud native
a positive business impact. Data from the benchmark results is applications faster, more cost-effectively and with less risk.
anonymous and sourced from users of Fairwinds Insights. Fairwinds provides a unified view between dev, sec and ops,
removing friction between those teams with software that
simplifies complexity.
DATASHEET // DATADOG & FAIRWINDS INSIGHTS: A POWERFUL COMBINATION
KUBERNETES BENCHMARK REPORT // 3
RELIABILITY
Missing Memory Limits & Missing Memory Requests
While Kubernetes best practices dictate that you should always set resource limits and requests on your workloads, it is not always
easy to know what values to use for each application. As a result, some teams never set requests or limits at all, while others set
them too high during initial testing and then never course correct.
Last year’s benchmark report showed that nearly half of organizations had set memory requests and limits for over 90% of their
workloads. In our latest findings, that number is down to around 20%. The spread varies after that, but we are seeing that more
workloads are impacted compared to previous years.
This could be because developers and DevOps teams do not know what limits to set or because Kubernetes consumption is
growing and there is a loss of control and visibility into how it is being configured, or both.
The key to ensuring scaling actions work properly is dialing in your memory limits and requests on each pod so workloads run
efficiently. Setting memory limits and requests is essential to operating applications on Kubernetes clusters as efficiently and
reliably as possible.
Eighty-three percent of organizations are not setting liveness or readiness probes for more than 10% of workloads. That is an
increase of 18 percentage points year over year against the previous benchmark report. Based on the reviewed data from the past
year, we see indications that this issue is getting worse, not better.
Again, we see an increase this year in all workloads impacted. While the distribution of percentage varies, we see that 25% of
organizations are relying on cached images for nearly all their workloads - up 10 percentage points from the previous report, which
is impacting the reliability of applications.
Deployments help maintain the stability and high availability of containers. Without these in place, if a node crashes, a
Deployment will still replace pods if the replica count is 1, but during that time there will be 0 replicas, which may cause
unavailability of that application.
Percentage of organizations
DATASHEET // DATADOG & FAIRWINDS INSIGHTS: A POWERFUL COMBINATION
KUBERNETES BENCHMARK REPORT // 6
If a single pod is allowed to consume all of the node CPU and memory, then other pods will be starved for resources. Setting
resource requests increases reliability by guaranteeing the pod will have access to those resources—and preventing other
pods from consuming all of the available resources on a node (this is referred to as the “noisy neighbor problem”).
SECURITY
Insecure Capabilities
Certain Linux capabilities are enabled by default for Kubernetes workloads, though most workloads don’t really need these
capabilities. The effort organizations made to pare back these capabilities has dropped. Whereas in 2021 42% of organizations
were turning off these capabilities for the vast majority of workloads, that number is only 10% now.
Instead, now we see 33% of organizations with more than 90% of workloads running with insecure capabilities. This is an
unexpected change as it shows that more organizations are now running workloads insecurely.
Whereas we previously saw a binary distribution of organizations locking down filesystems inside their containers, this year we see
a huge increase of workloads impacted. Today we see 56% of organizations seemingly unaware of the need to override the insecure
defaults for the majority (71%-100%) of their workloads compared to only 23% in 2021. This is not a good trend as Kubernetes usage
and adoption grows.
What is unsettling in this year’s findings is the increase in workloads open to privilege escalation. In 2021, we saw that 42% of
organizations managed to lock down the vast majority of workloads. That has dipped to just 10%. Whereas only 18% of organizations
had 91% or greater impacted workloads in 2021, that number has risen to 29% of organizations. It is a dangerous trend.
Runs as Privileged
The command, privileged, determines if any container in a pod can enable privileged mode. By default, a container is not allowed to
access any devices on the host, but a privileged container is given access to all devices on the host. This feature allows the container
nearly all the same access as processes running on the host, which is useful for containers that need to to use Linux capabilities,
such as manipulating the network stack and accessing devices.
Fortunately, the privileged flag is off by default, and 87% of organizations have stuck with that default, helping to increase the
security of their workloads. We’ve seen very little change in this over the last year, though like other checks, it’s trending in the
wrong direction.
Image Vulnerability
Workloads impacted by image vulnerabilities (such as last year’s log4j vulnerability) grew significantly this year. The 2021-2022
data showed that 40% of organizations had less than 10% of workloads impacted; in the last year that number has fallen to only
12% of organizations. There was a spike from 9% to 25% of organizations seeing greater than 90% of workloads impacted. Known
vulnerabilities can be exploited by malicious actors and need to be patched or remediated.
62% of organizations have more than 50% of workloads impacted by vulnerabilities. It is likely your organization is running images
with vulnerabilities.
When benchmarking yourself against these findings, the important takeaway is to update outdated helm charts or container
images. While you might be on the lower spectrum of impacted workloads, it still could introduce avoidable risk.
COST EFFICIENCY
CPU Requests and Limits
Organizations appear to be setting CPU limits and requests adequately. Data shows that only 72% of organizations are setting
0-10% of workload limits too high. At the other end of the spectrum, only 94% of organizations are setting 0-10% of workload
limits too low (CPU Limits set too high chart below, too low chart not shown).
Percentage of organizations
To mitigate the event of an Out-of-Memory error (OOMKill), Fairwinds Insights provides users with a tool that detects and reports
OOMKills. This has helped SREs reduce pager fatigue.
Avoid issues with too high or low memory requests by using tools that analyze usage and make suggestions to help you right-size
memory requests. Goldilocks, an open source tool, can help, or use the Fairwinds Insights platform if you are running multiple
clusters across multiple teams.
CONCLUSION
Fairwinds will continue to update the results of this benchmark data to help the cloud native community understand how they
stack up against peers. The important takeaway is this: it’s hard to configure Kubernetes to keep security, cost efficiency and
reliability consistent.
Misconfigurations present too many risks to organizations increasingly moving workloads to cloud native infrastructure.
Empower your team to deploy confidently by applying Kubernetes governance and guardrails to ensure Kubernetes
configurations are done right. Kubernetes governance solutions, such as Fairwinds Insights, can provide Dev, Sec and Ops
the ability to migrate to Kubernetes more quickly, ship applications faster, optimize cloud spend and achieve the application
performance customers expect.
Fairwinds Insights
Fairwinds Insights is Kubernetes governance software
“Insights saves the Variant platform team
that provides DevOps teams with the ability to set cost one to two weeks of work —and developers
and security guardrails. Users reduce risk, optimize cloud even more time. We’ve gone from a manual
spending and enhance application reliability all while
process over weeks to a fully automated
enabling developers to innovate and ship applications faster.
process saving significant time and
Insights provides a centralized view of multiple clusters and resources. My team can now focus on our
infrastructure as code, consistently aligns dev, sec and ops
and helps teams prioritize what matters most. Kubernetes
sprints, the R&D work we prefer.”
guardrails are implemented in shift-left scenarios (e.g.,
Vibin Daniel, Manager, Platform Engineering, Variant
catching misconfigurations before they are released), or
at time of deployment (e.g., via Admission Controllers) to
eliminate security vulnerabilities and blind spots, monitor
and optimize cost, equip developers to fix issues and
achieve compliance.
DevOps teams using Insights can stop serving as a “The cloud will hammer you to death with
Kubernetes help desk. bills. A major benefit of Fairwinds Insights is
its resource utilization and cost optimization.
Using Fairwinds Insights, we were able
to right-size the number of nodes and
databases per cluster. That understanding
Organizations of all sizes can use
helped us reduce our cost per cluster by
Fairwinds Insights. Learn more at 25%. With 25+ clusters in production and
www.fairwinds.com/insights. growing, that cost saving is significant.”
G
len Zangirolami, Principal DevOps Architect at
Decisio Health
WHY FAIRWINDS
Fairwinds is your trusted partner for Kubernetes governance and guardrails. With Fairwinds, customers ship
cloud-native applications faster, more cost effectively, and with less risk. We provide a unified view between
dev, sec, and ops, removing friction between those teams with software that simplifies complexity. Fairwinds
Insights is built on Kubernetes expertise and integrates our leading open source tools to help you save time,
reduce risk, and deploy with confidence.
WWW.FAIRWINDS.COM