Published on 00/00/0000
Last updated on 00/00/0000
Published on 00/00/0000
Last updated on 00/00/0000
Share
Share
PRODUCT
7 min read
Share
Cloud cost management series: Overspending in the cloud Managing spot instance clusters on Kubernetes with Hollowtrees Monitor AWS spot instance terminations Diversifying AWS auto-scaling groups Draining Kubernetes nodes Cluster recommender Cloud instance type and price information as a service
You may remember the Hollowtrees project we open sourced a few weeks ago: a framework for the management of AWS spot instance clusters, batteries included:
Hollowtrees Last week we introduced a spot instance termination exporter for Prometheus. In this post we're going to take a deep dive into another core component, the AWS Autoscaling action plugin. The action plugin (and other components) can be used independently of Pipeline or Hollowtrees.
We wanted to use spot instances in our long-running AWS Kubernetes clusters. Accordingly, we needed to find a way to use auto-scaling groups with multiple instance types and different bid prices. In order to accomplish that, we created a project that listens on a gRPC
interface, and can be instructed to swap a running instance in an auto-scaling group with another instance with different cost or stability characteristics. This diversifies instance types in a group, and reduces the chances of critical failure occuring in the cluster.
Using spot instances can be useful for fault-tolerant workloads, when it doesn't matter if (some) instances are taken away. When using spot instances in auto scaling groups, the naive approach is to set instance type and spot price during the launch configuration. But that means that even if your auto-scaling group is spread across availability zones, a very large portion of instances will be running in the same spot market and, therefore, there exists a high probability that such instances would be taken offline when spot pricing surges above the bid price, which would have a critical impact on the cluster. AWS came up with Spot Fleets for that very reason, but a spot fleet is a completely different service with a different API that doesn't integrate as easily with other AWS services (like ELB). If you're not familiar with spot instances, make sure to read our previous blog post first, which recaps the lifecycle of spot instances and describes the different ways of requesting spot instances. Or, for a more in-depth guide, read the corresponding part of the AWS documentation
AWS auto-scaling groups only use one launch configuration, which describes the instance type and the spot price of all the instances in the group. So if you want to use multiple types you'll need to use a little trick: the AWS API allows detaching and attaching instances to a group, regardless of its instance type and other configurations. The business logic
of this plugin lies in using this trick to swap existing instances in the cluster for different instances. Let's see the steps laid out in the code:
This project is a Hollowtrees action plugin, so its gRPC
interface follows the standard Hollowtrees plugin interface. This interface uses an event-based schema, and is described in a simple .proto
file in the Hollowtrees repository, here. The interface is very simple, it has only one rpc
function wherein the requested payload is an AlertEvent
message that follows the structure of current CloudEvents specifications. Routing inside the plugin is done via the EventType
of the message. We've implemented a very simple "SDK" for this interface. To use it, the plugin needs only to do two things:
AlertHandler
interface's Handle
method with custom behaviour. In this case, it calls the instance swap mechanism with the payload from the event.Serve()
function from the SDK via the new AlertHandler
. This will start the gRPC server that listens on the configured port.To keep things simple, building the project is as simple as running a go build
command. The result is a statically linked executable binary.
go build .
The following options can be configured while starting the binary. Configuration is done through a plugin-config.toml
file that can be placed in the ./conf/
directory or near the binary. Notes:
instance profile
configuration files in the home directory (~/.aws/credentials
) as well as environment variables can be used.[log]
format = "text"
level = "debug"
[plugin]
port = "8888"
region = "eu-west-1"
Running the project is as easy as running the executable binary from the build.
At Banzai Cloud we run all our deployments inside Kubernetes. We use a standard Helm package manager, but all our deployments use Pipeline - we've previously made Helm deployments available over a RESTful API, as well. The charts are available as part of our GitHub charts repository. Install the exporter's chart with the release name aws-asg-action-plugin
, like so:
$ helm install --name aws-asg-action-plugin banzaicloud-incubator/ht-aws-asg-action-plugin
The helm chart will deploy a configmap for the configuration file, a pod for the action plugin, and a service to reach the gRPC interface.
To test the plugin, you'll need to write a client that calls the gRPC interface, or use the Hollowtrees engine and configure some rules that govern when to swap an instance. Or, if you just want to play around, use a tool like grpcc to connect to the plugin and send a few basic events.
We consider this plugin to be in an early alpha version. There are important things missing, like security on the gRPC interface, things to improve, like the recommendation engine that's called to discern new instance-types and bid prices, and new features that will keep the auto-scaling group in a stable state, like cooldown periods, or that keep a percentage of the cluster on on-demand instances.
Get emerging insights on innovative technology straight to your inbox.
Discover why security teams rely on Panoptica's graph-based technology to navigate and prioritize risks across multi-cloud landscapes, enhancing accuracy and resilience in safeguarding diverse ecosystems.
The Shift is Outshift’s exclusive newsletter.
The latest news and updates on cloud native modern applications, application security, generative AI, quantum computing, and other groundbreaking innovations shaping the future of technology.