Overview
Leads architecture, deployment, and optimization of hybrid Azure/on-premises environments, drives projects, ensures disaster recovery, manages incidents, automates operations, and fosters continuous improvement and inclusive teamwork.
Inception, a G42 company, is the region’s leading innovator of AI-powered domain-specific as well as industry-agnostic products, built on a rich heritage of research and development. Within the G42 ecosystem, Inception functions as the core intelligence layer – transforming data and compute infrastructure into real-world, applied AI solutions. Beyond its commercial endeavors, Inception is committed to creating positive societal impact. For more information, please visit www.inceptionai.ai.
Responsibilities
- Lead the architecture, deployment, and optimization of on-premises and Azure cloud environments, leveraging Azure services, Infrastructure as Code (IaC), and IT Service Management tools (ServiceNow, Jira, etc.).
- Drive one or more infrastructure engineering projects or workstreams, applying technical expertise, DevSecOps practices, and problem-solving methodologies to ensure successful delivery and modernization.
- Collaborate with diverse teams to architect, implement, and troubleshoot complex infrastructure changes across Azure, networking, and Azure Virtual Desktop, resolving issues and supporting system modernization.
- Develop and maintain disaster recovery and business continuity plans, including risk assessment and mitigation strategies for critical applications and infrastructure.
- Troubleshoot complex, high-priority incidents, considering upstream and downstream system impacts, and recommend mitigation actions while ensuring clear communication with stakeholders and adherence to ITIL change management processes.
- Partner with engineering and product teams to remediate issues, identify trends, and drive product and platform improvements across cloud, security, and networking layers.
- Demonstrate a strong aptitude for learning and coaching as new technologies and features (Azure, DevSecOps, containers, AVD) are introduced, fostering a culture of continuous improvement.
- Contribute to a team culture that values diversity, equity, inclusion, and respect, while promoting collaborative ways of working.
- Lead a high-performing team, managing their deliverables, KPIs, and professional development while ensuring alignment with architectural standards.
- Be proficient in Azure cloud infrastructure, security controls, Infrastructure as Code, and networking, with a focus on designing, securing, and operating enterprise environments.
- Be skilled in DevOps/DevSecOps practices and container orchestration platforms (Kubernetes, OpenShift) to enable scalable and secure application deployment.
- Bring experience in high-performance computing (HPC) environments, including GPU clusters (NVIDIA/AMD) and SLURM workload management, to support advanced workloads
- Demonstrate deep expertise in identity and access management (Entra ID) and data encryption standards to ensure regulatory compliance and data protection.
- Be experienced in managing RFPs and architecting enterprise-level solutions, including cost, risk, and compliance considerations.
- Possess in-depth knowledge of ISO 27001, SOC 2, HIPAA, and GDPR standards and their implementation within Azure and hybrid environments.
Qualifications
- Certified Azure Architect with advanced, hands-on experience designing and operating Azure environments.
- Deep knowledge of core infrastructure domains, including networking, databases, storage, deployment, integration, automation, scaling, resilience, and performance tuning
- Strong expertise in scripting languages such as PowerShell and Python, and in Infrastructure as Code (IaC) for automated provisioning and configuration
- Extensive experience with cloud infrastructure across Azure, AWS, and private cloud, including migrations between public and private environments
- Proven experience in business continuity planning, risk assessment, and disaster recovery design and testing for mission-critical systems
- Proficiency with ITSM and operations tools (e.g., ServiceNow, Jira, Sentinel) for incident, change, and problem management, aligned with ITIL change management practices.
- Hands-on experience with DevSecOps practices, Azure Virtual Desktop, and enterprise networking concepts supporting secure, scalable solutions
- Demonstrated ability to handle AI infrastructure and clusters, including design and operations for GPU-based and high-performance environments.
- Strong communication skills, with the ability to explain complex technical concepts to diverse stakeholders and collaborate effectively across engineering, product, and business teams.
- Continuous learner with a strong drive to expand technical and cross-functional knowledge, staying current with emerging cloud, security, and AI infrastructure trends and best practices.
What We Look For
If you are a performance-driven, inquisitive mind with the agility to adapt to ambiguity, you will fit right in. You should be eager to explore opportunities to build meaningful collaborations with stakeholders and aspire to create unique customer-centric solutions. Bias for action and a passion to conquer new frontiers in the AI space is at the heart of the Inception community.
What Working At Inception Offers
Culture: An open, diverse and inclusive environment with a global vision that encourages personal growth and focuses on ground-breaking, industry-first innovations.
Career: Outstanding learning, development & growth opportunities via structured training programs and innovative, high-tech projects.
Rewards: A competitive remuneration package with a host of perks including healthcare, education support, leave benefits and more.
If you can confidently demonstrate that you meet the criteria above, please contact us as soon as possible.