LogoLogo
Reliability HubAPI DocsPlatform
  • Welcome to Steadybit
  • Quick Start
    • First Steps
    • Compatibility
    • Install Agent and Extensions
    • Run an Experiment
    • Deploy Example Application
  • Concepts
    • Actions
    • Discovery
    • Query Language
  • Install and Configure
    • Install Agent
      • Architecture
      • Install on Kubernetes
      • Install on Linux Hosts
      • Install using Docker Compose
      • Install on Amazon ECS
      • Extension Registration
      • Using Mutual TLS for Extensions
      • Configuration Options
      • Agent State
      • Agent API
    • Install On-Prem Platform
      • Install on Minikube
      • Advanced Agent Authentication
      • Configuration Options
      • Maintenance & Incident Support
      • Syncing Teams via OIDC Attribute
    • Manage Environments
    • Manage Teams and Users
      • Users
      • Teams
      • Permissions
    • Manage Experiment Templates
  • Use Steadybit
    • Experiments
      • Design
      • Run
      • Run History
      • Schedule
      • Variables
      • Emergency Stop
      • Share
        • Templates
        • Duplicate
        • File
      • OpenTelemetry Integration
    • Explorer
      • Landscape
      • Targets
      • Advice
    • Reporting
  • Integrate with Steadybit
    • Extensions
      • Anatomy of an Extension
      • Extension Installation
      • Extension Kits
      • Available Extensions
    • API
      • Interactive API Documentation
    • CLI
    • Badges
    • Webhooks
      • Custom Webhooks
      • Preflight Webhooks
    • Preflight Actions
    • Slack Notifications
    • Audit Log
    • Hubs
  • Troubleshooting
    • How to troubleshoot
    • Common fixes
      • Extensions
      • Agents
      • On-prem platform
Powered by GitBook

Extension Docs

  • ActionKit
  • DiscoveryKit
  • EventKit

More Resources

  • Reliability Hub
  • API Docs
On this page
  • Containers
  • Network Attacks
  • Resource Attacks
  • State Attacks
  • Kubernetes
  • Attacks
  • Checks
  • Other Actions
  • Physical and Virtual Hosts
  • Network Attacks
  • Resource Attacks
  • State Attacks
  • Cloud Providers
  • AWS
  • Azure
  • GCP
  • Service Mesh & API Gateway
  • Kafka
  • Broker
  • Consumer
  • Topics
  • Java Virtual Machine (JVM) / Spring Applications
  • Observability
  • Load and API Testing

Was this helpful?

Edit on GitHub
  1. Quick Start

Compatibility

Last updated 27 days ago

Was this helpful?

Steadybit offers a wide variety of fault injections, checks and integrations (called for Chaos experiments which are provided by Steadybit's open-source . These actions can be combined in a timeline-based editor to build your chaos engineering experiment.

This page serves you as an overview of the supported technologies. A detailed list is available on the .

Containers

The following capabilities are available when targeting containers, running standalone or in a container orchestration platform like Kubernetes:

Network Attacks

Block DNS
Block Traffic
Corrupt Outgoing Packages
Delay Outgoing Traffic
Drop Outgoing Traffic
Limit Outgoing Bandwidth

Docker

✅

✅

✅

✅

✅

✅

CRI-O

✅

✅

✅

✅

✅

✅

containerd

✅

✅

✅

✅

✅

✅

Kubernetes

✅

✅

✅

✅

✅

✅

Red Hat OpenShift

✅

✅

✅

✅

✅

✅

AWS Elastic Kubernetes Service (EKS)

✅

✅

✅

✅

✅

✅

AWS Elastic Container Service (ECS) on

✅

✅

✅

✅

✅

✅

AWS Elastic Container Service (ECS) on Fargate

❌

❌

❌

❌

❌

❌

Google Kubernetes Engine (GKE)

✅

✅

✅

✅

✅

✅

Google Kubernetes Engine (GKE, )

✅

✅

✅

✅

✅

✅

Azure Kubernetes Service (AKS)

✅

✅

✅

✅

✅

✅

minikube

✅

✅

✅

✅

✅

✅

Resource Attacks

Fill Disk
Fill Memory
Stress CPU
Stress IO
Stress Memory

Docker

✅

✅

✅

✅

✅

CRI-O

✅

✅

✅

✅

✅

containerd

✅

✅

✅

✅

✅

Kubernetes

✅

✅

✅

✅

✅

Red Hat OpenShift

✅

✅

✅

✅

✅

AWS Elastic Kubernetes Service (EKS)

✅

✅

✅

✅

✅

AWS Elastic Container Service (ECS) on

✅

✅

✅

✅

✅

AWS Elastic Container Service (ECS) on Fargate

✅

❌

✅

✅

✅

Google Kubernetes Engine (GKE)

✅

✅

✅

✅

✅

Google Kubernetes Engine (GKE, )

✅

✅

✅

✅

✅

Azure Kubernetes Service (AKS)

✅

✅

✅

✅

✅

minikube

✅

✅

✅

✅

✅

State Attacks

Pause Container
Stop Container

Docker

✅

✅

CRI-O

✅

✅

containerd

✅

✅

Kubernetes

✅

✅

Red Hat OpenShift

✅

✅

AWS Elastic Kubernetes Service (EKS)

✅

✅

AWS Elastic Container Service (ECS) on

✅

✅

AWS Elastic Container Service (ECS) on Fargate

❌

✅

Google Kubernetes Engine (GKE)

✅

✅

Google Kubernetes Engine (GKE, )

✅

✅

Azure Kubernetes Service (AKS)

✅

✅

minikube

✅

✅

Kubernetes

On top, Steadybit supports attacks based on the Kubernetes API:

Attacks

Cause Crash Loop
Delete Pod
Rollout Restart Deployment
Scale Deployment
Scale StatefulSet
Taint Node

Kubernetes

✅

✅

✅

✅

✅

✅

Red Hat OpenShift

✅

✅

✅

✅

✅

✅

AWS Elastic Kubernetes Service (EKS)

✅

✅

✅

✅

✅

✅

Google Kubernetes Engine (GKE)

✅

✅

✅

✅

✅

✅

Google Kubernetes Engine (GKE, Autopilot)

✅

✅

✅

✅

✅

❌

Azure Kubernetes Service (AKS)

✅

✅

✅

✅

✅

✅

minikube

✅

✅

✅

✅

✅

✅

Checks

DaemonSet Pod Count
Deployment Pod Count
Deployment Rollout Status
Node Count
StatefulSet Pod Count

Kubernetes

✅

✅

✅

✅

✅

Red Hat OpenShift

✅

✅

✅

✅

✅

AWS Elastic Kubernetes Service (EKS)

✅

✅

✅

✅

✅

Google Kubernetes Engine (GKE)

✅

✅

✅

✅

✅

Google Kubernetes Engine (GKE, Autopilot)

✅

✅

✅

✅

✅

Azure Kubernetes Service (AKS)

✅

✅

✅

✅

✅

minikube

✅

✅

✅

✅

✅

Other Actions

Display Pod Count Metrics
Display Kubernetes Event Logs

Kubernetes

✅

✅

Red Hat OpenShift

✅

✅

AWS Elastic Kubernetes Service (EKS)

✅

✅

Google Kubernetes Engine (GKE)

✅

✅

Google Kubernetes Engine (GKE, Autopilot)

✅

✅

Azure Kubernetes Service (AKS)

✅

✅

minikube

✅

✅

Physical and Virtual Hosts

The following attacks are available when working with physical hosts and virtual machines (both 64-bit PC and 64-bit ARM architectures).

Network Attacks

Block DNS
Block Traffic
Corrupt Outgoing Packages
Delay Outgoing Traffic
Drop Outgoing Traffic
Limit Outgoing Bandwidth

Ubuntu 20.04

✅

✅

✅

✅

✅

✅

Ubuntu 22.04

✅

✅

✅

✅

✅

✅

Ubuntu 24.04

✅

✅

✅

✅

✅

✅

Fedora Latest

✅

✅

✅

✅

✅

✅

Debian Bookworm

✅

✅

✅

✅

✅

✅

Debian Bullseye

✅

✅

✅

✅

✅

✅

Amazon Linux 2

✅

✅

✅

✅

✅

✅

Amazon Linux 2023

✅

✅

✅

✅

✅

✅

Other .deb and .rpm-based distributions will mostly likely work, too, but aren't explicitly tested on.

Resource Attacks

Fill Disk
Fill Memory
Stress CPU
Stress IO
Stress Memory

Ubuntu 20.04

✅

✅

✅

✅

✅

Ubuntu 22.04

✅

✅

✅

✅

✅

Ubuntu 24.04

✅

✅

✅

✅

✅

Fedora Latest

✅

✅

✅

✅

✅

Debian Bookworm

✅

✅

✅

✅

✅

Debian Bullseye

✅

✅

✅

✅

✅

Amazon Linux 2

✅

✅

✅

✅

✅

Amazon Linux 2023

✅

✅

✅

✅

✅

Other .deb and .rpm-based distributions will mostly likely work, too, but aren't explicitly tested on.

State Attacks

Shutdown Host
Stop Process
Time Travel

Ubuntu 20.04

✅

✅

✅

Ubuntu 22.04

✅

✅

✅

Ubuntu 24.04

✅

✅

✅

Fedora Latest

✅

✅

✅

Debian Bookworm

✅

✅

✅

Debian Bullseye

✅

✅

✅

Amazon Linux 2

✅

✅

✅

Amazon Linux 2023

✅

✅

✅

Other .deb and .rpm-based distributions will mostly likely work, too, but aren't explicitly tested on.

Cloud Providers

Steadybit supports fault injection on all major cloud providers and an ever-growing list of managed services.

AWS

AWS EC2

On top, Steadybit supports attacks based on the AWS EC2 API:

Hibernate
Reboot
Start
Stop
Terminate

EC2 Instance

✅

✅

✅

✅

✅

AWS ECS on EC2

On top, Steadybit provides capabilities based on the AWS ECS API:

AWS ECS on Fargate

On top, Steadybit provides capabilities based on the AWS ECS API:

AWS EKS (Elastic Kubernetes Service)

AWS ELB (Elastic Load Balancing)

Steadybit supports attacks based on the AWS ALB API:

Return Static Response

ALB

✅

AWS Elasticache

Steadybit supports attacks based on the AWS Elasticache API:

Node Group Failover

Elasticache

✅

AWS Fault Injection Service (FIS)

Start Experiment Template

FIS

✅

AWS Lambda

Block TCP Connections
Fill Diskspace
Inject Exception
Inject Latency
Inject Status Code

Lambda

✅

✅

✅

✅

✅

AWS Managed Streaming for Kafka (MSK)

On top, Steadybit provides capabilities based on the AWS MSK API:

Broker Reboot

MSK

✅

AWS Relational Database Service (RDS)

Steadybit provides capabilities based on the AWS RDS API for RDS clusters and instances.

RDS Clusters

Cluster Failover

RDS Cluster

✅

RDS Instance

Reboot
Stop

RDS Instance

✅

✅

Other AWS Services

Azure

Azure Kubernetes Service (AKS)

Azure Event Hub

Azure Virtual Machines (VMs)

On top, Steadybit supports attacks based on the Azure VMs API:

Virtual Machine

Reboot
Delete
Stop
Deallocate

Virtual Machine

✅

✅

✅

✅

Virtual Machine Scale Set Instances

Reboot
Delete
Stop
Deallocate

Scale Set Instance

✅

✅

✅

✅

Other Azure Services

GCP

Google Kubernetes Engine (GKE)

GCP Virtual machines (VMs)

On top, Steadybit supports attacks based on the GCP VMs API:

Reset
Delete
Stop
Suspend

Virtual Machine

✅

✅

✅

✅

Other GCP Services

Service Mesh & API Gateway

Steadybit provides the following fault injections for service meshes and api gateways.

GRPC Abort
HTTP Abort
HTTP Delay

Istio

✅

✅

✅

Kong

❌

✅

❌

Kafka

Steadybit offers comprehensive support for chaos experiments on Kafka infrastructure.

Broker

Limit Connection Creation Rate
Limit IO Thread
Limit Network Threads

Unmanaged Kafka

✅

✅

✅

AWS Managed Streaming for Kafka (MSK)

✅

✅

✅

Azure Event Hub (Kafka)

✅

✅

✅

Consumer

Check Consumer State
Check Topic Lag
Deny Access

Unmanaged Kafka

✅

✅

✅

AWS Managed Streaming for Kafka (MSK)

✅

✅

✅

Azure Event Hub (Kafka)

✅

✅

✅

Topics

Check Partitions
Produce Records
Reduce Message Batch Size
Delete Records
Partition Leader Election

Unmanaged Kafka

✅

✅

✅

✅

✅

AWS Managed Streaming for Kafka (MSK)

✅

✅

✅

✅

✅

Azure Event Hub (Kafka)

✅

✅

✅

✅

✅

Java Virtual Machine (JVM) / Spring Applications

Steadybit supports the following application-level faults for JVM- and Spring-based applications. These don't require any dependency at work at runtime via bytecode manipulation.

Spring Controller Delay
Spring Controller Exception
HTTP Client Delay
HTTP Client Status
Method Delay
Method Exception
JDBC Template Delay
JDBC Template Exception

JVM application

✅

✅

✅

✅

✅

✅

✅

✅

Observability

Steadybit supports the following observability-related experiment actions:

Datadog

✅

✅

✅

Dynatrace

✅

✅

✅

Grafana

✅

❌

✅

Instana

✅

✅

❌

New Relic

✅

✅

✅

Prometheus

✅

❌

❌

Splunk

✅

❌

✅

StackState

✅

❌

❌

Load and API Testing

Steadybit integrates with the following load- and API-testing solutions:

Run Test from Experiment
Run Experiment from Test

Micro Focus LoadRunner Professional

❌

✅

Micro Focus LoadRunner Enterprise

❌

✅

Gatling

✅

JMeter

✅

K6

✅

✅

K6 Cloud

✅

✅

Postman

✅

Custom HTTP Check

✅

Steadybit's capabilities for and also work within Kubernetes clusters:

Steadybit's capabilities for also work for AWS EC2 environments:

Steadybit's capabilities for , and also work for AWS ECS on EC2 environments:

Steadybit's capabilities for also work for AWS ECS on Fargate environments:

Steadybit's capabilities for , and also work for AWS EKS environments:

Steadybit integrates with AWS FIS experiment templates, which makes it easy to inject faults into additional managed services. See the for more details.

Steadybit supports attacks based on the failure injection wrapper :

Steadybit's capabilities for also work for AWS MSK environments:

Steadybit has cross-services capabilities like simulating a full or .

Additional managed services are covered via , or will be added to Steadybit natively in the future.

Steadybit's capabilities for , and also work for Azure AKS environments:

Steadybit's capabilities for also work for Azure Event Hub leveraging Kafka-compatibility:

Steadybit's capabilities for also work for Azure Virtual Machines (VMs):

More Azure services will be added to the extension in the future.

Steadybit's capabilities for , and also work for Google GKE:

Steadybit's capabilities for also work for Azure Virtual Machines (VMs):

More GCP services will be added to the extension in the future.

actions
extensions
Steadybit Reliability Hub
AWS extension
failure-lambda
availability zone outage
partial zone outage via subnets
Azure
GCP
containers
physical and virtual hosts
Containers
Physical and Virtual Hosts
physical and virtual hosts
Physical and Virtual Hosts
AWS EC2s
containers
AWS EC2
Containers
containers
Containers
containers
Kubernetes
AWS EC2s
Containers
Kubernetes
AWS EC2
Kafka
Kafka
AWS FIS
containers
Kubernetes
Azure VMs
Containers
Kubernetes
Azure Virtual Machines (VMs)
Kafka
Kafka
physical and virtual hosts
Physical and Virtual Hosts
containers
Kubernetes
GCP VMs
Containers
Kubernetes
GCP Virtual machines (VMs)
physical and virtual hosts
Physical and Virtual Hosts
Scale Service
Service Event Log
Service Task Count

AWS Elastic Container Service (ECS) on EC2

✅

✅

✅

Scale Service
Service Event Log
Service Task Count

AWS Elastic Container Service (ECS) on EC2

✅

✅

✅