Senior Manager – Cloud & Infrastructure (Platform Engineering)
Reporting to
Director – Platform Engineering
Location
Dallas
Role Summary
Catalyst Brands is expanding its Platform Engineering organization to power the next generation of digital experiences — and we’re looking for a Senior Manager, Cloud & Infrastructure to help lead that transformation. This is not a traditional infrastructure role. You’ll own and shape a platform engineering culture within Cloud & Infrastructure, grounded in customer-centricity, automation-first thinking and self-service platform capabilities. Your work will enable hundreds of engineers to build, deploy and operate high-performing digital products with autonomy, speed, reliability, and efficiency.
This is a unique opportunity to build the backbone of a modern platform engineering organization—where platform capabilities directly accelerate innovation, improve developer experience, and drive measurable business outcomes.
Key Responsibilities
Lead and Scale a Modern Cloud Platform
• Define and execute the cloud-native platform strategy enabling hundreds of engineers to build, deploy, and operate services through reusable, standardized, self-service platform capabilities used across teams and brands.
• Drive platform adoption and governance across brands and teams by treating the platform as a product — maintain a roadmap, define golden paths and standards, and ensure consistency without sacrificing team autonomy
• Build and evolve a secure, scalable, and resilient AWS-based platform using infrastructure as code (Terraform) and modern platform patterns (GitOps, Kubernetes)
• Enable a ‘you build it, you run it’ model by providing simple, reliable platform capabilities that empower teams to independently build and operate services
• Establish a paved-road developer experience, including CI/CD, deployment workflows, and environment provisioning that balance speed, safety, and consistency
• Drive a reliability-first mindset by implementing SLO/SLI frameworks, resilience patterns, and automated incident response practices
• Partner with security, compliance, and risk teams to embed security guardrails directly into the platform (policy-as-code, automated compliance checks, secure defaults) rather than treating security as an afterthought
• Own cloud cost strategy end-to-end, including visibility, attribution, forecasting, and optimization (FinOps), ensuring efficient use of resources at scale
• Prioritize automation to eliminate operational toil and improve reliability at scale
• Lead modernization of Catalyst’s cloud foundation, aligning to AWS Well-Architected principles and modern landing zone patterns
• Define and govern the multi-account strategy, including account vending, OU structure, and network topology across brands and environments
Integrate AI into the Cloud & Delivery Ecosystem
• Drive adoption of AI-driven capabilities within cloud infrastructure, CI/CD and observability to improve reliability, deployment safety, and operational efficiency
• Reduce operational toil through AI-assisted incident detection, triage, and root cause analysis
• Apply AI within observability and performance workflows, turning platform telemetry into actionable insights and automation
Build and Lead High-Impact Teams
• Hire, develop, and mentor a team of Cloud and DevOps engineers.
• Foster a culture of ownership, collaboration, and continuous learning.
• Set clear goals, provide ongoing feedback, and enable career growth.
• Promote a platform mindset focused on developer enablement and self-service.
• Serve as a strategic partner in enabling scalable, high-performing digital experiences.
• Translate business and engineering goals into platform capabilities and roadmaps.
Key Qualifications & Experience
• 8–12+ years of experience in Platform Engineering, SRE, or Cloud Infrastructure, with 5+ years leading high-performing engineering teams
• Deep expertise designing and operating scalable, multi-account AWS environments using cloud-native architectures and infrastructure as code (Terraform)
• Proven experience building and evolving internal platform capabilities (e.g., CI/CD, deployment systems, developer self-service platforms) used by multiple engineering teams
• Strong hands-on experience with Kubernetes and modern deployment patterns (GitOps, container orchestration, release automation)
• Experience establishing and scaling DevSecOps practices, integrating security into platform and delivery workflows
• Demonstrated ownership of observability and reliability practices, including metrics, logging, tracing, and SLO/SLI frameworks
• Proven track record driving cloud cost optimization and financial transparency (FinOps) at scale
• Experience applying AI-driven capabilities within cloud, delivery, or observability systems to improve efficiency, reliability, or developer productivity (preferred, not required)
• Proven ability to lead through ambiguity and deliver results in complex, fast-paced, enterprise-scale environments
Pay Range:
USD $97,200.00 - USD $162,000.00 /Yr.