Site Reliability Engineering (SRE)
Return to Kubernetes SRE, Cloud Site Reliability Engineering (Cloud SRE), SRE, Chaos engineering, Chaos, Chaos Monkey, Chaos theory, Chaotic complex system, Chaotic
- Snippet from Wikipedia: Site reliability engineering
Site Reliability Engineering (SRE) is a discipline in the field of Software Engineering and IT infrastructure support that monitors and improves the availability and performance of deployed software systems and large software services (which are expected to deliver reliable response times across events such as new software deployments, hardware failures, and cybersecurity attacks). There is typically a focus on automation and an Infrastructure as Code methodology. SRE uses elements of software engineering, IT infrastructure, web development, and operations to assist with reliability. It is similar to DevOps as they both aim to improve the reliability and availability of deployed software systems.