Understanding SRE Principles

Reduce MTTR for on-call engineers by 5%

Develop buffers to ensure incidents remain at < 75% of the error budget

Mitigate false positive system alerts to reduce on-call staff costs

Speed up the resolution of critical incidents by 5%

Increase the coverage of 4-point SLIs from 90% of services to 100%

Reduce manual toil from 25% of responder time to 20%

Increase increment velocity in SRE project work with one-sprint reduction

Reduce operational work from 65% of total work time to 55%

Reduce incident recurrence from 8 out of 10 to 6 out of 10 incidents

Assure realistic SLA targets in line with current SLIs for > 97.5% of accounts

7 fundamental principles of SRE