AutoZone's Site Reliability Engineering (SRE) team is seeking a Systems Engineer with a focus on SRE Enablement. This position is responsible for promoting reliability and operational excellence throughout the engineering organization. The successful candidate will play a key role in establishing standards, developing shared tools, providing guidance to development teams, and cultivating a culture of reliability across our hybrid infrastructure, which includes a primary emphasis on the Google Cloud Platform (GCP), as well as on-premises servers and applications.
The SRE Enablement Engineer collaborates closely with application, infrastructure, and architecture teams to integrate SRE best practices early in the software development lifecycle and ensure platforms adhere to rigorous production readiness standards.
Responsibilities
- Define enterprise-wide reliability standards, Service-Level Objective (SLO) frameworks, and error budget policies.
- Establish production readiness criteria that teams must meet prior to launching and conduct production readiness reviews across teams.
- Own, document, and maintain the internal SRE handbook and reliability playbooks.
- Build, maintain, and standardize shared observability platforms, specifically leveraging Dynatrace to be consumed by all engineering teams.
- Provide templates for alerting, dashboards, and runbooks across both cloud and on-premises application workloads.
- Participate in the incident management process, including post-mortem analysis, to continuously strengthen systemic reliability.
- Run SRE training programs and reliability workshops for engineering teams.
- Coach and mentor teams on SLO-based thinking and error budget management.
- Embed proactive SRE practices and a continuous improvement mindset into the broader engineering culture.
- Track and report reliability metrics across the enterprise, rather than just a single service.
- Identify systemic reliability gaps and trends across cross-functional teams.
- Report organizational reliability health to leadership and hold teams accountable to agreed-upon operational standards.
- Act as an internal consultant during architecture and system design reviews.
- Advise development teams on reliability design patterns (e.g., circuit breakers, retries, graceful degradation) suitable for a hybrid GCP and on-premises environment.
- Engage early in new product development to influence system reliability from the outset.
Qualifications
- Bachelor's degree in computer science, MIS, Information Technology, or a related field, or equivalent practical experience.
4 to 7 years of experience in Systems Engineering, DevOps, or SRE-related roles. - Deep understanding of Site Reliability Engineering principles, particularly regarding SLOs, SLIs, error budgets, and TOIL reduction.
- Hands-on experience building, administering, and optimizing observability and APM pipelines, with a strong focus on Dynatrace.
- Strong experience deploying and supporting workloads in Google Cloud Platform (GCP), as well as maintaining legacy on-premises servers and applications.
- Experience with container orchestration platforms (e.g., Kubernetes).
- Strong programming/scripting skills (e.g., Python, Golang, Java) and experience with IaC tools (e.g., Terraform, Ansible).
Exceptional communication and consulting skills, with the ability to influence architecture decisions and translate technical concepts to non-technical leadership.
Company
- Competitive pay
- Unrivaled company culture
- Medical, dental and vision plans
- Exclusive discounts and perks, including an AutoZone in-store discount
- 401(k) with company match and Stock Purchase Plan
- AutoZoners Living Well Program for free mental health support
- Opportunities for career growth
- Paid time off
- Life, and short- and long-term disability insurance options
- Health Savings and Flexible Spending Accounts with wellness rewards
- Tuition reimbursement
Minimum age requirements may apply. Eligibility and waiting period requirements may apply; benefits for AutoZoners in Puerto Rico, Hawaii, or the U.S. Virgin Islands may differ. Learn more about all that AutoZone has to offer at Careers.AutoZone.com.