At Sana Commerce were committed to an inclusive environment and recognize that our diverse work\force is one of our greatest strengths.
It all started in 2007, with a pizza and a plan. Sana Commerce is an e-commerce platform designed to help manufacturers, distributors and wholesalers succeed by fostering lasting relationships with customers who depend on them. We’re a fast-growing SaaS company that allows you to take ownership of your career.
At Sana Commerce, were looking for a Manager SRE to build & manage our global SRE team that manages and monitors all installed systems, environments and infrastructure and resolves issues that come in through our notification system.
What youll get:
The opportunity to make an impact at a fast-growing SaaS scale-up;
Up to 5 weeks “work from anywhere” per year;
A global and customized onboarding program (9,1/10 rated by previous hires);
A hybrid working model – 3 days from the office, 2 day from home;
Weekly company lunch on us.
Job Description
What youll be doing:
Leading the SRE team, setting objectives, and guiding the team towards achieving high reliability while balancing cost and performance SLAs.
Collaborating with platform & product engineering teams to embed reliability and operational best practices into the software development lifecycle.
Developing and implementing SRE policies and practices, including service level objectives (SLOs), service level indicators (SLIs), and error budgets.
Driving automation across operations to reduce toil, improve system performance, ensure scalability, with a reasonable amount of allergic response towards repetitive manual work.
Overseeing incident management, post-mortem analyses, and root cause investigations to prevent future outages and enhance system reliability.
Facilitating capacity planning and scalability exercises to manage growth and ensure the efficient use of resources.
Facilitating disaster recovery plans & testing to ensure business continuity for our customers’ webstores.
Encouraging a culture of continuous improvement by mentoring team members and fostering innovation within the team.
Staying up to date with the latest trends and technologies in SRE and advocating for their adoption where appropriate.
Qualifications
What youll bring:
Bachelor’s or Master’s degree in Computer Science, Engineering, or a related technical field.
At least 5 years of experience in Site Reliability Engineering, with 2+ years in a leadership or management role.
Proven, hands-on expertise in Microsoft Azure, including designing, deploying, and managing cloud-native infrastructure. Experience with container orchestration (e.g., Kubernetes) is required.
A deep understanding of network protocols, load balancing, and high availability configurations.
Experience in applying software development solutions to SRE and familiarity with programming languages such as (preferably) PowerShell and C# or else Python, Go, Java etc.
Experience with automation tools, infrastructure as code (e.g., Terraform, Ansible).
Proficiency in monitoring and logging tools (e.g., Prometheus, Grafana, ELK Stack) and in implementing comprehensive monitoring solutions. Dynatrace knowledge is a plus.
Excellent problem-solving skills, with a proven ability to tackle complex issues under pressure.
Outstanding leadership qualities, with a track record of mentoring and developing high-performing teams.
Exceptional communication and collaboration skills, capable of working effectively with cross-functional teams.
Additional Information
Who we are
At Sana Commerce, our values drive everything we do:
Champions of Our League – We deliver lasting success, balancing quick wins and long-term value
Supercharge Our Customers – We’re revolutionizing B2B commerce together, helping our customers to lead and succeed
Determined to Grow – We embrace challenges, growing and raising the bar for ourselves and our industry
Bold Together – We dare to be bold because we have each other’s back
Job descriptions can be tough to interpret. Even if you may not tick all the boxes, wehave ambitious plans and we encourage people who share our vision and look forward to growing with us. Apply now.
Searching, interviewing and hiring are all part of the professional life. The TALENTMATE Portal idea is to fill and help professionals doing one of them by bringing together the requisites under One Roof. Whether you're hunting for your Next Job Opportunity or Looking for Potential Employers, we're here to lend you a Helping Hand.
Disclaimer: talentmate.com is only a platform to bring jobseekers & employers together.
Applicants
are
advised to research the bonafides of the prospective employer independently. We do NOT
endorse any
requests for money payments and strictly advice against sharing personal or bank related
information. We
also recommend you visit Security Advice for more information. If you suspect any fraud
or
malpractice,
email us at abuse@talentmate.com.
You have successfully saved for this job. Please check
saved
jobs
list
Applied
You have successfully applied for this job. Please check
applied
jobs list
Do you want to share the
link?
Please click any of the below options to share the job
details.
Report this job
Success
Successfully updated
Success
Successfully updated
Thank you
Reported Successfully.
Copied
This job link has been copied to clipboard!
Apply Job
Upload your Profile Picture
Accepted Formats: jpg, png
Upto 2MB in size
Your application for Manager Site Reliability Engineer
has been successfully submitted!
To increase your chances of getting shortlisted, we recommend completing your profile.
Employers prioritize candidates with full profiles, and a completed profile could set you apart in the
selection process.
Why complete your profile?
Higher Visibility: Complete profiles are more likely to be viewed by employers.
Better Match: Showcase your skills and experience to improve your fit.
Stand Out: Highlight your full potential to make a stronger impression.
Complete your profile now to give your application the best chance!