Posted 1mo ago

Reliability Engineer

@ Systems Engineering Solutions
Hanscom Air Force Base, Massachusetts, United States
HybridFull Time
Responsibilities:design systems, maintain availability, lead incidents
Requirements Summary:8+ years in reliability engineering with cloud, containers, CI/CD; active Secret clearance; US citizenship.
Technical Tools Mentioned:Prometheus, Grafana, ELK, Datadog, Azure Monitor, Terraform, ARM, CloudFormation, Python, Bash, Go, PowerShell, Docker, Kubernetes, CI/CD, Cloud Platforms, Linux, Windows
Save
Mark Applied
Hide Job
Report & Hide
Job Description

This role supports the U.S. Air Force Cloud One Architecture and Common Shared Services contract and currently has an opening for a Reliability Engineer. The Reliability Engineer is responsible for ensuring the availability, performance, scalability, and resiliency of mission‑critical systems. This role applies software engineering principles to infrastructure and operations, with a strong emphasis on automation, monitoring, incident response, and continuous reliability improvement. The reliability engineer serves as the bridge between development, operations, and platform teams to ensure production systems consistently meet defined service level objectives (SLOs) while supporting rapid, safe delivery of new capabilities.

 

 

Location: This position will be hybrid remote. Candidates will be required to work onsite as needed. Candidates preferred to be located near Hanscom AFB (Boston, MA).