1. Implement monitoring objectives, maintain alert playbooks, create reliability strategies (incl. maintaining SLOs3 & error budgets)
2. Triage operation / customer-reported incidents, provide on-call support for production issues, assist customer and support teams
3. Facilitate reliability tests (i.e., Root Cause Finding, Chaos Engineering, Failure Mode and Effects Analysis) to assess robustness of products