What is Sedai's AI-powered rightsizing for AWS EC2 VMs?
Sedai's AI-powered rightsizing for AWS EC2 VMs is an autonomous optimization solution that continuously analyzes your EC2 virtual machines, detects inefficiencies, and implements optimizations in real-time. It ensures your VMs are optimally sized to meet application performance and reliability requirements while minimizing costs, without requiring manual intervention. [Source]
How does Sedai's rightsizing process for AWS EC2 VMs work?
The process includes five steps: Discover (identifying your EC2 VM infrastructure and application patterns), Recommend (suggesting optimal settings based on deep insights), Validate (performing safety and timing checks), Execute (autonomously applying changes), and Learn (tracking results and updating the optimization model). This closed-loop approach ensures ongoing efficiency and reliability. [Source]
What are the main benefits of using Sedai for AWS EC2 VM optimization?
Sedai delivers up to 30% or more in cloud cost savings, up to 25% better latency for customer-facing services, and reduces the time needed to rightsize VMs by up to 90%. These improvements are achieved without impacting application performance or reliability. [Source]
How does Sedai's AI differ from traditional rightsizing tools for AWS EC2?
Unlike traditional tools that require manual effort or only provide recommendations, Sedai autonomously executes optimizations in real-time. It uses a full set of golden metrics (latency, errors, saturation, throughput) and performs safety checks before making changes, ensuring both cost efficiency and application reliability. [Source]
What types of AWS EC2 workloads benefit most from Sedai's rightsizing?
Sedai is particularly effective for applications not suited to microservices architectures, workloads with bursty traffic patterns, and environments where vertical scaling is preferred over horizontal scaling. It is ideal for dev/test workloads, small databases, and legacy applications running directly on VMs. [Source]
How does Sedai identify optimization opportunities for AWS EC2 VMs?
Sedai discovers your VM infrastructure, standardizes metrics across heterogeneous fleets, and identifies golden signals (latency, error, saturation, throughput) to drive optimization. Its AI analyzes these metrics to generate actionable recommendations and autonomously applies them after validation. [Source]
What is the impact of overprovisioning AWS EC2 VMs?
Overprovisioning leads to low utilization rates (often below 10%), unnecessary cloud costs, and inefficient resource allocation. Sedai addresses this by rightsizing VMs to actual workload requirements, reducing costs while maintaining performance. [Source]
How does Sedai handle bursty workloads on AWS EC2?
Sedai's AI analyzes traffic patterns and workload bursts, recommending suitable instance types (such as burstable VMs) and adjusting resources to handle surges efficiently without overprovisioning for average load. [Source]
What metrics does Sedai use for AWS EC2 VM optimization?
Sedai uses a comprehensive set of golden metrics: latency, error rates, saturation, and throughput. These metrics are standardized across different monitoring exporters and inform the AI's optimization decisions. [Source]
How does Sedai ensure the safety of optimization actions on AWS EC2 VMs?
Before executing any optimization, Sedai performs safety and timing checks to ensure actions can be safely applied without risk to application performance or reliability. Only after passing these checks does Sedai proceed with autonomous execution. [Source]
Pricing & Availability
How is Sedai's AWS EC2 VM optimization service priced?
Sedai offers flexible pricing based on the scale of your AWS EC2 VM deployment. For specific pricing details, you can request a demo or contact Sedai directly. [Source]
Is Sedai's AWS EC2 VM optimization service available now?
Yes, Sedai's AI-powered rightsizing for AWS EC2 VMs is available now. You can request a demo to see how it works for your environment. [Source]
Implementation & Technical Requirements
How long does it take to implement Sedai for AWS EC2 VM optimization?
Sedai is designed for quick, plug-and-play implementation. Setup typically takes just 5 minutes for general use and 15 minutes for specific use cases like AWS Lambda. This ensures minimal disruption and rapid time to value. [Source]
What technical resources are needed to get started with Sedai?
To get started, you need to provide cloud access (using IAM), a monitoring source, and, for Kubernetes clusters, integration via Sedai's Smart Agent. Assistance from your security team may be required to provide sufficient access. [Source]
What support resources does Sedai provide for onboarding?
Sedai offers live onboarding support, comprehensive documentation, a Slack community for real-time help, and the option to schedule personalized onboarding calls. [Source]
Does Sedai provide technical documentation for AWS EC2 VM optimization?
Yes, Sedai provides extensive technical documentation, including a Getting Started Guide and detailed resources for platform configuration and optimization. Access the documentation at docs.sedai.io/get-started/.
Performance, Security & Compliance
What performance improvements can be expected from Sedai's AWS EC2 VM optimization?
Customers can expect up to 30% or more in cloud cost savings, up to 25% better latency, and a 90% reduction in operations effort for rightsizing VMs. These improvements are based on early adopter results and case studies. [Source]
Is Sedai SOC 2 certified?
Yes, Sedai is SOC 2 certified, demonstrating adherence to stringent security and compliance standards for data protection. Learn more on the Sedai Security page.
Integrations & Ecosystem
What integrations does Sedai support for AWS EC2 VM optimization?
Sedai integrates with major cloud platforms (AWS, Google Cloud, Azure, IBM Cloud, Oracle Cloud), notification providers (Slack, Teams, Webhook, Email), ITSMs (BMC, Jira, ServiceNow), monitoring tools (AppDynamics, CloudWatch, DataDog, Dynatrace, New Relic, Prometheus), IaC/CI/CD tools (GitHub, GitLab, Bitbucket, Terraform), and more. See the full list at sedai.io/integrations.
Use Cases & Customer Success
Can you share an example of cost savings achieved with Sedai's AWS EC2 VM optimization?
Yes. A technology company identified over $75,000 in annual savings (a 34% reduction) in its dev/test environments through rightsizing with Sedai. [Source]
What other customer success stories are available for Sedai?
Sedai has helped KnowBe4 achieve up to 50% cost savings, Palo Alto Networks save $3.5 million, and Belcorp reduce AWS Lambda latency by 77%. See more stories at sedai.io/resources#Customer-Stories.
Which industries are represented in Sedai's case studies?
Industries include cybersecurity (Palo Alto Networks, KnowBe4), IT (HP), information services (Experian), financial services (Capital One), SaaS (Freshworks, Inflection), supply chain (Flex), insurance software (Guidewire), scientific research (Oak Ridge National Laboratory), e-commerce, and online travel. [Source]
Who are some of Sedai's notable customers?
Notable customers include Palo Alto Networks, HP, Experian, KnowBe4, Capital One, Flex, Guidewire, Oak Ridge National Laboratory, and Freshworks. [Source]
Competition & Differentiation
How does Sedai compare to AWS Compute Optimizer for EC2 rightsizing?
Sedai goes beyond AWS Compute Optimizer by using a broader set of metrics (not just utilization), performing safety checks, and autonomously executing optimizations. Compute Optimizer often requires manual effort and does not validate changes for safety or timing. [Source]
What makes Sedai's AWS EC2 VM optimization unique compared to other solutions?
Sedai is 100% autonomous, optimizes across multiple platforms, uses AI-driven insights, proactively resolves issues, and tracks release impacts. It reduces manual toil and delivers measurable cost and performance improvements, as validated by customer case studies. [Source]
What are the advantages of Sedai for different user segments?
Enterprises benefit from significant cost savings and compliance, DevOps teams gain productivity by reducing manual toil, cloud engineers enjoy less monitoring and manual adjustment, and startups/SMBs appreciate quick setup and flexible pricing. [Source]
Pain Points & Problems Solved
What core problems does Sedai solve for AWS EC2 VM users?
Sedai addresses high cloud costs from overprovisioning, poor application performance due to misconfigured VMs, operational inefficiency from manual rightsizing, and the complexity of choosing optimal VM types among hundreds of options. [Source]
What pain points do Sedai customers commonly express?
Customers often face high cloud costs, application latency, availability issues, manual operational toil, and challenges with release quality. Sedai's autonomous optimization directly addresses these pain points, as shown in customer case studies. [Source]
Target Audience & Use Cases
Who is the target audience for Sedai's AWS EC2 VM optimization?
Target users include Site Reliability Engineers, Platform Engineers, DevOps teams, Engineering Leaders, CTOs, and Architects at organizations managing cloud operations across industries such as cybersecurity, SaaS, financial services, e-commerce, and more. [Source]
What use cases are best suited for Sedai's AWS EC2 VM optimization?
Best suited use cases include cost optimization for underutilized VMs, performance improvement for customer-facing services, reducing manual rightsizing effort, and optimizing legacy or monolithic applications that cannot easily move to microservices or serverless architectures. [Source]
Introducing AI-Powered Rightsizing for AWS EC2 VMs
JJ
John Jamie
Content Writer
May 7, 2024
Featured
Summary
AWS EC2 VMs are poorly utilized. It is possible to see CPU utilization under 10%. Many VMs are oversized, leading to unnecessary costs. Doubling utilization, which is often possible, halves costs.
Current rightsizing solutions like AWS Compute Optimizer often (1) require manual effort (2) do not use the full set of golden metrics to optimize applications (e.g., using utilization metrics only and not considering latency and errors, and (3) do not perform safety checks to validate the change can be made.
Sedai’s AWS EC2 VM optimization finds the lowest cost VM type subject to performance and reliability requirements.
Early users of Sedai's optimization technology have found reductions in cloud costs without impacting application performance
Introduction
In environments where applications are not suitable for microservices architectures, rightsizing and in particular vertical scaling becomes a critical strategy to achieve cost-effective operations while meeting performance requirements. This approach involves choosing the right virtual machine type based on the CPU and memory resources required. Rightsizing is a known best practice for AWS EC2 VM cost optimization but is hard to implement in practice.
The Rightsizing Problem
Public Cloud VM Utilization is Low
While AWS utilization data has not been disclosed publicly, analysis of the most recently released Azure VM usage dataset (available on Microsoft’s Github here) shows that VM users had an an average utilization of just 8.2%, with 72% of users having an average utilization of below 20% (see distribution below). A common pattern uncovered in the dataset was the selection of a small number of powerful, but oversized VMs.
Source: “Using Virtual Machine Size Recommendation Algorithms to Reduce Cloud Cost”, March 2023
Below is an example from a Sedai AWS EC2 customer environment of an underutilized instance with averaging just a few percent of CPU being used across a one month period:
Causes of Overprovisioning for AWS EC2 VMs
Developer Bias to Overprovision
Effective vertical scaling is also important as developers often default to overprovisioning AWS EC2 VM resources, opting for a simpler and quicker setup rather than conducting extensive testing across multiple instance types. This approach, while expedient, typically results in selecting VM configurations that exceed the application's actual requirements, leading to increased costs. The reluctance to engage in detailed testing stems from the time and complexity involved in evaluating each instance type's performance under different workloads. Consequently, developers lean towards a 'better safe than sorry' strategy. Although reducing the risk of underperformance this inefficiently raises cloud costs.
Bursty Workloads
Many Virtual Machine workloads have bursty traffic patterns, especially for small databases, development environments, and low-traffic websites. For example, in the case below of a dev/test workload CPU utilization stays around 5% but surges a few times to the 30-40% range.
Given warm up periods may range from a few minutes to a few hours, horizontal scaling may not be viable. Finding suitable burstable instance types may be the preferred approach. In the absence of that, low average utilization will be achieved.
App Architecture Limits Horizontal Scaling
A high proportion of applications running on AWS EC2 run directly on virtual machines. These applications have not been replatformed to a microservice architecture such as Kubernetes (including AWS EC2 Kubernetes Service (AKS)), or serverless frameworks (AWS EC2 Functions). One key reason is that many of these applications do not benefit significantly from the horizontal scaling capabilities offered by microservices architectures. They may have architectural or design constraints that make such a transition complex or suboptimal, limiting their ability to efficiently use newer computing paradigms.
Complex Set of VM Choices
Vertical scaling is also complicated by the many types of VMs available. There are currently709 types of AWS EC2 Virtual Machine options offering varying Compute, Memory and other characteristics.
Asking an engineer to make the optimal choice across potentially hundreds or thousands of services can be challenging, especially if new code updates change the service's characteristics and then require a new determination of the right instance type.
Importance of Vertical Scaling for AWS EC2 VMs
Vertical scaling is particularly advantageous for applications that require high-performance levels from single instances or have dependencies that complicate distribution across multiple servers. By optimizing the configuration of AWS EC2 VMs to align closely with actual workload requirements, organizations can ensure that their applications perform optimally without incurring unnecessary costs from overprovisioning. This method allows for more precise control over resource allocation, leading to enhanced performance and reduced expenditures.
Rightsizing AWS EC2 Virtual Machines with Sedai
Key Capabilities
Sedai’s Automated Optimization utilizes advanced AI technology to deeply comprehend AWS EC2 VM configurations and their impact on application cost and performance. This results in AWS EC2 VMs that are optimally sized and configured to meet the specific needs of applications without incurring unnecessary costs or performance issues. Key benefits include:
Cloud Cost Efficiency: AWS EC2 VM costs can be reduced by up to 30% or more through optimized resource allocation.
Performance Improvement: Enhance customer-facing services with up to 25% better latency, ensuring a smoother user experience.
Reduced operations effort. The time needed to rightsize VMs is reduced by up to 90%.
Sedai’s Automated Optimization uses advanced AI that not only deeply understands AWS EC2 VM configurations and how they are impacting application cost and performance. This results in VMs that are optimally sized and configured to meet the specific needs of their applications without any excess cost or underperformance.
How It Works
Our AI-driven platform continuously analyzes your AWS EC2 VMs to detect inefficiencies. It then autonomously implements optimizations, adjusting resources in real-time without requiring manual intervention.
The Sedai platform operates on a simple yet effective process: Discover, Recommend, Validate, Execute, and Track:
Discover: Sedai first discovers your AWS EC2 VM infrastructure and application pattern, going through three steps:some textIdentifying the app boundary by looking at traffic patterns (e.g., because they use a common load balancer, or by virtual machine tagging). A set of virtual machines doing the same task and expected to behave similarly can be termed an application. This definition means that a collective action will be taken on all the instances of the app.Standardizing metrics for optimization. In a heterogeneous fleet, a service may use Node Exporter for Linux, or WMI Exporter or Windows exporter for Windows. It is important that the metrics are labeled correctly such that the system can precisely identify the metrics of a specific application.Identifying golden signals to drive optimization. Finding the right signal to listen to can seem like finding a needle in a haystack. Sedai will look for the best golden metrics (latency, error, saturation, and throughput of an application) so that this information can be used in algorithms and machine learning systems such that a recommendation can be generated.
Identifying the app boundary by looking at traffic patterns (e.g., because they use a common load balancer, or by virtual machine tagging). A set of virtual machines doing the same task and expected to behave similarly can be termed an application. This definition means that a collective action will be taken on all the instances of the app.
Standardizing metrics for optimization. In a heterogeneous fleet, a service may use Node Exporter for Linux, or WMI Exporter or Windows exporter for Windows. It is important that the metrics are labeled correctly such that the system can precisely identify the metrics of a specific application.
Identifying golden signals to drive optimization. Finding the right signal to listen to can seem like finding a needle in a haystack. Sedai will look for the best golden metrics (latency, error, saturation, and throughput of an application) so that this information can be used in algorithms and machine learning systems such that a recommendation can be generated.
Recommend: then recommends optimal settings based on deep insights into service behavior and dependencies. Recommendations may be provided on a manual basis or occur automatically based on user settings.
Validate: After validating potential changes through multiple safety checks, a sequence of steps so that it could be performed safely on the customer environment:some textSafety check: If there is an action, we need to ask whether this action can be safely performed on that application without risk. If you have a green signal there, you go to the next one.Timing check: We see if it is the right time to apply the action, or is there a later preferred time to execute this particular action on this application?
Safety check: If there is an action, we need to ask whether this action can be safely performed on that application without risk. If you have a green signal there, you go to the next one.
Timing check: We see if it is the right time to apply the action, or is there a later preferred time to execute this particular action on this application?
Execute: Once we have a go-ahead for these validation steps, Sedai goes ahead and performs the action.
Learn: After performing the action, we need to figure out if the app is healthy. Updates are also tracked with a full audit trail of changes made to the infrastructure. This step is also important because this allows us to close our learning loop and use this information for further actions.
These capabilities form part of Sedai’s overall AWS EC2 VM optimization approach which can be seen below:
Some of the key elements above are:
Access to Cloud APIs which allow Sedai to identify and discover the components of your infrastructure. Sedai’s inference engine actually utilizes this information to build a topology. With this topology information, we deduce the application.
Metrics exporter & Sedai Core. With the information about the application, Sedai’s metric exporter takes the data from all the monitoring providers. With the information about the application and the metrics, Sedai machine learning algorithms can generate optimization and remediation opportunities.
Execution engine. The recommendations are given to the execution engine. The execution engine utilizes AWS APIs to perform the actions on the EC2 VMs.
Example 34% Cost Saving from AWS EC2 Rightsizing
Early adopters have seen significant improvements in both performance and cost efficiency. For instance, a technology company has identified over $75k of annual savings, a 34% saving, in its dev / test environments through rightsizing using Sedai’s optimization:
To gain insights on the state of your VM fleet you can scan it at a glance to see where applications are over or under provisioned as well as optimized based on Sedai’s findings. The example below only 12% of the apps have been optimized to date (shown as green).
Pricing and Availability
The service is available now, with flexible pricing based on the scale of your AWS EC2 VM deployment. Request a demo to see how Sedai can help you rightsize your AWS EC2 VMs.