Career Opportunities: Technical Lead (48977)
Company Overview
Incedo is a US-based consulting, data science and technology services firm with over 3000 people helping clients
from our six offices across US, Mexico and India. We help our clients achieve competitive advantage through
end-to-end digital transformation. Our uniqueness lies in bringing together strong engineering, data science, and
design capabilities coupled with deep domain understanding. We combine services and products to maximize
business impact for our clients in telecom, Banking, Wealth Management, product engineering and life science
& healthcare industries.
Working at Incedo will provide you an opportunity to work with industry leading client organizations, deep
technology and domain experts, and global teams. Incedo University, our learning platform, provides ample
learning opportunities starting with a structured onboarding program and carrying throughout various stages of
your career. A variety of fun activities is also an integral part of our friendly work environment. Our flexible
career paths allow you to grow into a program manager, a technical architect or a domain expert based on your
skills and interests.
Our Mission is to enable our clients to maximize business impact from technology by
- Harnessing the transformational impact of emerging technologies
- Bridging the gap between business and technology
Role Description
The Resiliency and Continuity Specialist serves as a technology resilience subject matter expert, supporting the execution, governance, and quality assurance of cloud resilience exercises and system recovery readiness. This role ensures cloud-hosted applications and platforms are demonstrably recoverable through high-quality System Recovery Plans (SRPs), resilience/chaos test plans, and complete, auditable evidence aligned to defined recovery objectives.
The Specialist partners with application teams, engineering/SRE, and resilience governance stakeholders to support resilience testing, review plans and artifacts, and drive remediation of gaps to reduce outage risk and improve recoverability in alignment with internal resilience standards.
Key Responsibilities
Coordinate & Govern Cloud Technology Resilience Exercises
- Coordinate and support the planning, execution, and governance of in-region and cross-region resilience testing for cloud-hosted systems to validate recovery capability meets expectations.
- Ensure all pre-test artifacts are complete (scope, success criteria, steps, roles, dependencies) and that testing is performed using the program’s standard templates and acceptance guidance.
- Promote disciplined execution during exercises, including accurate capture of start/end times, step outcomes, and deviations; ensure evidence supports stated success criteria.
- Partner with technical teams to validate resilience and chaos scenarios common to cloud architectures and ensure exercise design is meaningful and testable.
System Recovery Plan (SRP) Quality & Compliance Reviews
- Perform quality reviews of System Recovery Plans (SRPs) to ensure they are usable, complete, and aligned to expected recovery approach and operational execution.
- Confirm teams are using SRPs sourced from the appropriate repository/process and applying the latest templates/standards where required.
- Validate task sequencing, ownership, timing, and execution readiness; identify gaps and drive corrective actions.
Technical Skills
Required Knowledge & Experience
- 3+ years of experience in technology resilience, disaster recovery, operational resilience, SRE/operations, technology risk, or governance within a complex technology environment.
- Demonstrated experience coordinating or executing resilience/DR tests and producing audit-ready documentation (plans, results, and evidence).
- Working understanding of cloud operating models and cloud architecture patterns, including how systems achieve availability and recoverability in cloud environments.
- Practical familiarity with concepts such as regions/AZs, load balancing, autoscaling, infrastructure-as-code, backups/restore, replication patterns, and service dependencies
- Familiarity with Chaos Engineering / testing practices including observability and monitoring.
- Familiarity with using tools ServiceNow, GRC, Harness
- Proficiency in Microsoft Office (including MS Project, Visio)
- Experience in continuity metrics and reporting and knowledge of reporting tools (Power BI, Power Excel, Tableau, Crystal Reports etc.) a plus.
Nice-to-have skills
Qualifications
Certifications and/or Other Professional Credentials:
Cloud certification - Practitioner (required) / Architecture (a plus)
CBCP / Disaster Recovery certification preferred but not required
Project Management certification a plus
Hours & Work Schedule
Work Schedule: Monday through Friday 8:00AM - 5:00PM
Company Value
We value diversity at Incedo. We do not discriminate based on race, religion, color, national origin, gender, sexual orientation, age, marital status, veteran status, or disability status.
-
-
- The job has been sent to
| Please provide the information below | |
|---|---|
| Job title: | |
| *Your friend’s email address: | |
| Message: Maximum character limit: 1000 | |
| *Confirm you are not a robot: | |
Company Overview
Incedo is a US-based consulting, data science and technology services firm with over 3000 people helping clients
from our six offices across US, Mexico and India. We help our clients achieve competitive advantage through
end-to-end digital transformation. Our uniqueness lies in bringing together strong engineering, data science, and
design capabilities coupled with deep domain understanding. We combine services and products to maximize
business impact for our clients in telecom, Banking, Wealth Management, product engineering and life science
& healthcare industries.
Working at Incedo will provide you an opportunity to work with industry leading client organizations, deep
technology and domain experts, and global teams. Incedo University, our learning platform, provides ample
learning opportunities starting with a structured onboarding program and carrying throughout various stages of
your career. A variety of fun activities is also an integral part of our friendly work environment. Our flexible
career paths allow you to grow into a program manager, a technical architect or a domain expert based on your
skills and interests.
Our Mission is to enable our clients to maximize business impact from technology by
- Harnessing the transformational impact of emerging technologies
- Bridging the gap between business and technology
Role Description
The Resiliency and Continuity Specialist serves as a technology resilience subject matter expert, supporting the execution, governance, and quality assurance of cloud resilience exercises and system recovery readiness. This role ensures cloud-hosted applications and platforms are demonstrably recoverable through high-quality System Recovery Plans (SRPs), resilience/chaos test plans, and complete, auditable evidence aligned to defined recovery objectives.
The Specialist partners with application teams, engineering/SRE, and resilience governance stakeholders to support resilience testing, review plans and artifacts, and drive remediation of gaps to reduce outage risk and improve recoverability in alignment with internal resilience standards.
Key Responsibilities
Coordinate & Govern Cloud Technology Resilience Exercises
- Coordinate and support the planning, execution, and governance of in-region and cross-region resilience testing for cloud-hosted systems to validate recovery capability meets expectations.
- Ensure all pre-test artifacts are complete (scope, success criteria, steps, roles, dependencies) and that testing is performed using the program’s standard templates and acceptance guidance.
- Promote disciplined execution during exercises, including accurate capture of start/end times, step outcomes, and deviations; ensure evidence supports stated success criteria.
- Partner with technical teams to validate resilience and chaos scenarios common to cloud architectures and ensure exercise design is meaningful and testable.
System Recovery Plan (SRP) Quality & Compliance Reviews
- Perform quality reviews of System Recovery Plans (SRPs) to ensure they are usable, complete, and aligned to expected recovery approach and operational execution.
- Confirm teams are using SRPs sourced from the appropriate repository/process and applying the latest templates/standards where required.
- Validate task sequencing, ownership, timing, and execution readiness; identify gaps and drive corrective actions.
Technical Skills
Required Knowledge & Experience
- 3+ years of experience in technology resilience, disaster recovery, operational resilience, SRE/operations, technology risk, or governance within a complex technology environment.
- Demonstrated experience coordinating or executing resilience/DR tests and producing audit-ready documentation (plans, results, and evidence).
- Working understanding of cloud operating models and cloud architecture patterns, including how systems achieve availability and recoverability in cloud environments.
- Practical familiarity with concepts such as regions/AZs, load balancing, autoscaling, infrastructure-as-code, backups/restore, replication patterns, and service dependencies
- Familiarity with Chaos Engineering / testing practices including observability and monitoring.
- Familiarity with using tools ServiceNow, GRC, Harness
- Proficiency in Microsoft Office (including MS Project, Visio)
- Experience in continuity metrics and reporting and knowledge of reporting tools (Power BI, Power Excel, Tableau, Crystal Reports etc.) a plus.
Nice-to-have skills
Qualifications
Certifications and/or Other Professional Credentials:
Cloud certification - Practitioner (required) / Architecture (a plus)
CBCP / Disaster Recovery certification preferred but not required
Project Management certification a plus
Hours & Work Schedule
Work Schedule: Monday through Friday 8:00AM - 5:00PM
Company Value
We value diversity at Incedo. We do not discriminate based on race, religion, color, national origin, gender, sexual orientation, age, marital status, veteran status, or disability status.
-
- The job has been sent to
Company Overview
Incedo is a US-based consulting, data science and technology services firm with over 3000 people helping clients
from our six offices across US, Mexico and India. We help our clients achieve competitive advantage through
end-to-end digital transformation. Our uniqueness lies in bringing together strong engineering, data science, and
design capabilities coupled with deep domain understanding. We combine services and products to maximize
business impact for our clients in telecom, Banking, Wealth Management, product engineering and life science
& healthcare industries.
Working at Incedo will provide you an opportunity to work with industry leading client organizations, deep
technology and domain experts, and global teams. Incedo University, our learning platform, provides ample
learning opportunities starting with a structured onboarding program and carrying throughout various stages of
your career. A variety of fun activities is also an integral part of our friendly work environment. Our flexible
career paths allow you to grow into a program manager, a technical architect or a domain expert based on your
skills and interests.
Our Mission is to enable our clients to maximize business impact from technology by
- Harnessing the transformational impact of emerging technologies
- Bridging the gap between business and technology
Role Description
The Resiliency and Continuity Specialist serves as a technology resilience subject matter expert, supporting the execution, governance, and quality assurance of cloud resilience exercises and system recovery readiness. This role ensures cloud-hosted applications and platforms are demonstrably recoverable through high-quality System Recovery Plans (SRPs), resilience/chaos test plans, and complete, auditable evidence aligned to defined recovery objectives.
The Specialist partners with application teams, engineering/SRE, and resilience governance stakeholders to support resilience testing, review plans and artifacts, and drive remediation of gaps to reduce outage risk and improve recoverability in alignment with internal resilience standards.
Key Responsibilities
Coordinate & Govern Cloud Technology Resilience Exercises
- Coordinate and support the planning, execution, and governance of in-region and cross-region resilience testing for cloud-hosted systems to validate recovery capability meets expectations.
- Ensure all pre-test artifacts are complete (scope, success criteria, steps, roles, dependencies) and that testing is performed using the program’s standard templates and acceptance guidance.
- Promote disciplined execution during exercises, including accurate capture of start/end times, step outcomes, and deviations; ensure evidence supports stated success criteria.
- Partner with technical teams to validate resilience and chaos scenarios common to cloud architectures and ensure exercise design is meaningful and testable.
System Recovery Plan (SRP) Quality & Compliance Reviews
- Perform quality reviews of System Recovery Plans (SRPs) to ensure they are usable, complete, and aligned to expected recovery approach and operational execution.
- Confirm teams are using SRPs sourced from the appropriate repository/process and applying the latest templates/standards where required.
- Validate task sequencing, ownership, timing, and execution readiness; identify gaps and drive corrective actions.
Technical Skills
Required Knowledge & Experience
- 3+ years of experience in technology resilience, disaster recovery, operational resilience, SRE/operations, technology risk, or governance within a complex technology environment.
- Demonstrated experience coordinating or executing resilience/DR tests and producing audit-ready documentation (plans, results, and evidence).
- Working understanding of cloud operating models and cloud architecture patterns, including how systems achieve availability and recoverability in cloud environments.
- Practical familiarity with concepts such as regions/AZs, load balancing, autoscaling, infrastructure-as-code, backups/restore, replication patterns, and service dependencies
- Familiarity with Chaos Engineering / testing practices including observability and monitoring.
- Familiarity with using tools ServiceNow, GRC, Harness
- Proficiency in Microsoft Office (including MS Project, Visio)
- Experience in continuity metrics and reporting and knowledge of reporting tools (Power BI, Power Excel, Tableau, Crystal Reports etc.) a plus.
Nice-to-have skills
Qualifications
Certifications and/or Other Professional Credentials:
Cloud certification - Practitioner (required) / Architecture (a plus)
CBCP / Disaster Recovery certification preferred but not required
Project Management certification a plus
Hours & Work Schedule
Work Schedule: Monday through Friday 8:00AM - 5:00PM
Company Value
We value diversity at Incedo. We do not discriminate based on race, religion, color, national origin, gender, sexual orientation, age, marital status, veteran status, or disability status.