You are a reliability-owning, hands-on solver. Not just a "break-fix engineer." As a DRI (directly responsible individual) for our clients most critical systems, you’ll be the go-to expert within the squad that ensures their environments are secure, reliable, and optimized 24/7. You will deliver measurable impact – improved uptime, faster response times, and real cost savings. Not just closed tickets. Not just alerts. Real outcomes you engineer yourself.
You will lead the charge on technical execution, from complex troubleshooting and root cause analysis to engineering proactive, automated solutions. This role is about building the future of reliable cloud operations and shipping it into todays production environments.
Your Responsibilities
what you will wake up to solve.
Role
This isn’t a “manage tickets” role. You are the architect, the executioner and the DRI for our Cloud Managed Services GTM, deploying solutions that turn operational noise into hardened outcomes. Here’s how you’ll make your mark:
Own Service Reliability: You will be the go-to technical expert for 24/7 cloud operations and incident management. Youll ensure strict adherence to SLOs by getting your hands dirty, leading high-stakes troubleshooting to deliver a superior client experience.
Engineer the Blueprint: Youll translate client needs into scalable, automated, and secure cloud architectures. You will write and maintain the operational playbooks and Infrastructure as Code (IaC) that your squad uses every day.
Automate with Intelligence: Youll lead the charge from the keyboard to futurify our operations. Youll embed AI-driven automation, predictive monitoring, and AIOps into core processes to eliminate toil and preempt incidents.
Drive FinOps & Impact: Youll own the technical execution of the FinOps framework. You will continuously analyze, configure, and optimize cloud spend for clients through hands-on engineering.
Be the Expert in the Room: Youll share your knowledge through internal demos, documentation, and technical deep dives, representing the deep expertise that turns operational complexity into business resilience.
Mentor & Elevate: You will be a technical mentor for your peers. Through code reviews and collaborative problem-solving, youll help build a high-performing squad that lives the “Always Hardened” mindset.
Experience & Relevance
We are looking for future technology leaders, not just coders. We value raw intelligence, analytical rigor, and an obsessive passion for technology over any prior experience.
Cloud Operations Pedigree: 3+ years of experience in GCP cloud infrastructure, with a significant portion in a cloud managed services.
Commercial Acumen: Proven track record of building and scaling a net-new managed services business.
Client-Facing Tech Acumen: 2+ years of experience in a client-facing technical role, acting as the trusted advisor for cloud operations, security, and reliability.
Functional Skills
Service Reliability Engineering. A deep understanding of MSP business models, SLAs, and the importance of client satisfaction in an operational context.
Client Engagement: Ability to ask appropriate questions to get to the heart of an operational issue and win trust with stakeholders.
Cross-Functional Catalyst: Thrive in multi-disciplinary teams, bringing together operations, security, and development teams.
Repository builder: Creates reusable frameworks, IaC modules, and operational playbooks for scale.
Join the ‘real solvers’ ready to futurify?
If you are excited by the possibilities of what an AI-native engineering-led, modern tech consultancy can do to futurify businesses, apply here and experience the ‘Art of the possible’. Don’t Just Send a Resume. Send a Statement.
Searching, interviewing and hiring are all part of the professional life. The TALENTMATE Portal idea is to fill and help professionals doing one of them by bringing together the requisites under One Roof. Whether you're hunting for your Next Job Opportunity or Looking for Potential Employers, we're here to lend you a Helping Hand.
Disclaimer: talentmate.com is only a platform to bring jobseekers & employers together.
Applicants
are
advised to research the bonafides of the prospective employer independently. We do NOT
endorse any
requests for money payments and strictly advice against sharing personal or bank related
information. We
also recommend you visit Security Advice for more information. If you suspect any fraud
or
malpractice,
email us at abuse@talentmate.com.
You have successfully saved for this job. Please check
saved
jobs
list
Applied
You have successfully applied for this job. Please check
applied
jobs list
Do you want to share the
link?
Please click any of the below options to share the job
details.
Report this job
Success
Successfully updated
Success
Successfully updated
Thank you
Reported Successfully.
Copied
This job link has been copied to clipboard!
Apply Job
Upload your Profile Picture
Accepted Formats: jpg, png
Upto 2MB in size
Your application for Site Reliability Engineer
has been successfully submitted!
To increase your chances of getting shortlisted, we recommend completing your profile.
Employers prioritize candidates with full profiles, and a completed profile could set you apart in the
selection process.
Why complete your profile?
Higher Visibility: Complete profiles are more likely to be viewed by employers.
Better Match: Showcase your skills and experience to improve your fit.
Stand Out: Highlight your full potential to make a stronger impression.
Complete your profile now to give your application the best chance!