Technology Resiliency and Recovery (Disaster Recovery) with Automation and AWS Expertise
Description
We are seeking a highly skilled and motivated Technology Resiliency and Recovery Specialist with deep expertise in disaster recovery, automation, and AWS cloud infrastructure. This role will focus on ensuring that the organizations IT infrastructure remains resilient and capable of recovering quickly in the event of any disasters or disruptions. You will leverage automation tools and AWS services to design, implement, and maintain robust disaster recovery strategies that minimize downtime and ensure business continuity.
Roles And Responsibilities
Disaster Recovery Planning & Implementation:
Design, implement, and maintain disaster recovery (DR) plans for the organizations IT infrastructure, ensuring business continuity.
Assess and analyze business impact, defining recovery objectives (RTO and RPO) and aligning them with organizational goals.
Regularly test disaster recovery procedures through simulations and mock drills to ensure operational readiness.
Work with different teams to identify critical systems and services that need to be included in the disaster recovery plan.
Evaluate DR tools and solutions, focusing on AWS-based services, to ensure a scalable and cost-effective recovery solution.
Technology Resiliency and Business Continuity:
Ensure that all IT systems are designed with resiliency in mind, ensuring high availability and fault tolerance.
Implement and maintain cloud-based disaster recovery strategies using AWS services such as Amazon EC2, S3, RDS, Route 53, and more.
Collaborate with architecture teams to ensure resiliency and continuity measures are embedded into infrastructure design.
Oversee and optimize backup strategies, ensuring that systems can be quickly restored with minimal data loss.
Automation & Infrastructure as Code (IaC):
Automate disaster recovery processes and workflows using modern DevOps tools such as AWS CloudFormation, Tidal, Terraform, Ansible, or other automation frameworks.
Implement Infrastructure as Code (IaC) practices to streamline the provisioning and management of recovery environments.
Use SumoLogic, Dynatrace, AWS Lambda, CloudWatch, and other automation tools to proactively monitor and respond to system events or failures.
Documentation & Reporting:
Maintain clear and up-to-date documentation of disaster recovery plans, runbooks, and processes.
Provide detailed post-disaster recovery reports, outlining the effectiveness of the recovery process and any lessons learned.
Report on resiliency metrics, recovery objectives, and automation progress to senior leadership.
Incident Response & Post-Incident Analysis:
Lead the response during actual disaster recovery events, coordinating with IT and business units to ensure a smooth recovery process.
Perform post-incident analysis to identify root causes, implement corrective actions, and improve recovery plans.
Collaboration & Training:
Collaborate closely with cross-functional teams including IT operations, security, engineering, and business continuity.
Provide training and awareness on disaster recovery procedures to staff, helping them understand the importance of disaster recovery and their roles during recovery scenarios.
Required
Skills & Qualifications:
Disaster Recovery & Business Continuity Expertise:
Proven experience in designing, implementing, and managing disaster recovery plans for both on-premises and cloud-based infrastructure.
Experience with automation tools such as Tidal, Terraform, AWS CloudFormation, Ansible, or similar.
Proficiency in scripting languages (Python, Shell, etc.) to automate processes and workflows.
Excellent verbal and written communication skills for technical and non-technical stakeholders.
Ability to lead recovery efforts, coordinate between various teams, and communicate effectively during high-pressure situations.
AWS Certified Practitioner and Solutions Architect
Preferred
Proficient in monitoring, alerting, and performance tuning using AWS and third-party monitoring tools like SumoLogic, Dynatrace and such others.
Strong understanding of IT resilience, high availability architectures, RTO/RPO objectives, and best practices for disaster recovery.
Knowledge of DevOps principles, Continuous Integration (CI), Continuous Deployment (CD), and configuration management.
ITIL Foundation or similar business continuity certifications.
Certified Business Continuity Professional (CBCP) or similar DR/BCP certification
Searching, interviewing and hiring are all part of the professional life. The TALENTMATE Portal idea is to fill and help professionals doing one of them by bringing together the requisites under One Roof. Whether you're hunting for your Next Job Opportunity or Looking for Potential Employers, we're here to lend you a Helping Hand.
Disclaimer: talentmate.com is only a platform to bring jobseekers & employers together.
Applicants
are
advised to research the bonafides of the prospective employer independently. We do NOT
endorse any
requests for money payments and strictly advice against sharing personal or bank related
information. We
also recommend you visit Security Advice for more information. If you suspect any fraud
or
malpractice,
email us at abuse@talentmate.com.
You have successfully saved for this job. Please check
saved
jobs
list
Applied
You have successfully applied for this job. Please check
applied
jobs list
Do you want to share the
link?
Please click any of the below options to share the job
details.
Report this job
Success
Successfully updated
Success
Successfully updated
Thank you
Reported Successfully.
Copied
This job link has been copied to clipboard!
Apply Job
Upload your Profile Picture
Accepted Formats: jpg, png
Upto 2MB in size
Your application for Disaster Recovery
has been successfully submitted!
To increase your chances of getting shortlisted, we recommend completing your profile.
Employers prioritize candidates with full profiles, and a completed profile could set you apart in the
selection process.
Why complete your profile?
Higher Visibility: Complete profiles are more likely to be viewed by employers.
Better Match: Showcase your skills and experience to improve your fit.
Stand Out: Highlight your full potential to make a stronger impression.
Complete your profile now to give your application the best chance!