Job Summary:
We are seeking an experienced Sr. Cloud Infrastructure Engineer to lead our multi-cloud networking and infrastructure, ensuring secure, efficient, and cost-effective connectivity between customers and our datacenters. This role is responsible for designing, optimizing, and maintaining cloud networking solutions across AWS, Azure, and Google Cloud Platform (GCP) while implementing cost-control and governance strategies.
A core focus of this role is managing customer connectivity, including VPC Endpoint Services, Load Balancing (NLB, ALB), VPNs, and Direct Connect/MPLS across cloud environments. The ideal candidate will have strong networking expertise, cloud cost optimization experience, and hands-on infrastructure automation skills.
Key Responsibilities:
Cloud Networking & Infrastructure Management:
• Design, deploy, and manage multi-cloud networking solutions (VPC, VNet, Cloud VPN, Private Link, Direct Connect, ExpressRoute, and Interconnect).
• Implement VPC Endpoint Services, NLBs, and target groups to facilitate secure customer connectivity.
• Optimize traffic routing, hybrid cloud networking, and cross-region connectivity to ensure high availability and performance.
• Manage firewalls, security groups, and network ACLs to maintain a secure infrastructure.
• Monitor latency, throughput, and reliability metrics to improve overall network efficiency.
• Troubleshoot complex networking issues, including VPN tunnels, BGP peering, and DNS resolution.
Cloud Cost Optimization & Efficiency:
• Implement cloud cost management strategies, including reserved instances, spot instances, auto-scaling, and rightsizing.
• Monitor and optimize data transfer costs, ensuring efficient routing and minimizing egress charges.
• Establish FinOps best practices, providing cost visibility and accountability across teams.
• Automate infrastructure provisioning and scaling using Infrastructure-as-Code (IaC) tools (Terraform, CloudFormation, ARM Templates).
Security, Compliance, and Reliability:
• Ensure cloud environments adhere to security best practices and regulatory standards (SOC 2, ISO 27001).
• Implement monitoring, logging, and alerting solutions across cloud environments (AWS CloudWatch, Azure Monitor, GCP Cloud Operations Suite).
• Collaborate with security teams to enforce Zero Trust architectures and least privilege access controls.
• Maintain disaster recovery (DR) and business continuity (BC) plans for cloud infrastructure.
Collaboration & Leadership:
• Work closely with Engineering, DevOps, Security, and FinOps teams to align cloud strategies with business needs.
• Provide leadership and mentorship to cloud engineers and infrastructure specialists.
• Standardize multi-cloud management processes to ensure consistent operations across AWS, Azure, and GCP.