- What is Chaos Engineering
- Why do we need it
- How do we do it
- LitmusChaos overview
- LitmusChaos demonstration
What is Chaos Engineering?
> “Chaos Engineering is the discipline of experimenting on a system in order to build confidence in the system’s capability to withstand turbulent conditions in production.”
Your application resilience on Kubernetes >90% of Resilience of your application depends on –
other cloud-native components and the infrastructure
Why do we need Chaos Engineering?
1. Recognize the dangers and mitigate them before they cause havoc
2.Quick reaction time when an unexpected yet known failure occurs
3.Improved Application Security & Observability for enhancing reliability
4.Asses the failover plan and the cost involved in a safe environment
Litmus: The Cloud-Native Chaos Engineering Platform
Litmus is a toolset to do cloud-native Chaos Engineering that helps both Developers and SREs in automating the chaos experiments at different stages within the DevOps pipeline like development, during CI/CD, & in production.
It adopts a “Kubernetes-native” approach to define chaos intent in a declarative manner via custom resources to target Kubernetes, Containers, Public Cloud, Bare Metal, Applications resources.
1. 100% Open-Source
2. 50+ Chaos Experiments
3. 196k+ installations and 471k+ experiment runs
4. LitmusSDK available in GoLang, Python, and Ansible
5. Adopted by VMWare, Red Hat, Intuit, Wipro, Lenskart and many more!
How to Install Litmus?
1. Works with Kubernetes 1.17 or later and a Persistent Volume of 20GB
2. Install using Helm or kubectl
3. Namespace-scoped and Cluster-scoped installation
STEP 1: Add the litmus helm repository
helm repo add litmuschaos https://litmuschaos.github.io/litmus-helm/
STEP 2: Create the namespace
kubectl create ns litmus
STEP 3: Install Litmus ChaosCenter
helm install chaos litmuschaos/litmus –namespace=litmus
A few quick handy references