The Site Reliability Engineer (SRE) will be a hands-on contributor within the Site Reliability Engineering Center of Excellence (CoE), responsible for building monitoring and observability solutions, troubleshooting production issues, and participating in 24x7 on-call operations.
This role focuses on the execution of reliability practices, implementing observability tooling, improving MTTR/MTTD through automation, and ensuring production systems are resilient, observable, and performant. The SRE will collaborate closely with Principal and Senior Staff SREs, adopting best practices and frameworks defined by the CoE while directly contributing to enterprise reliability goals. This role reports to the Sr. Manager, SRE.
Key Responsibilities
Execution & CoE Alignment
Implement SRE frameworks, best practices, and playbooks provided by the CoE.
Act as a hands-on engineer, contributing to observability, reliability, and incident response initiatives.
Partner with senior SREs and leadership to maintain consistency in monitoring and incident processes.
Contribute to automation projects that improve reliability and reduce manual work.
Observability & Monitoring
Build and maintain monitoring solutions with New Relic, Datadog, Prometheus, Grafana, CloudWatch, OpenTelemetry, Graylog.
Create and refine dashboards, metrics, and alerts for proactive anomaly detection.
Extend observability coverage across infrastructure, applications, APIs, and databases.
Reliability Engineering & Automation
Implement SLIs, SLOs, SLAs, and error budgets in partnership with product and platform teams.
Contribute to reducing MTTD and MTTR through improved instrumentation and automation.
Participate in capacity planning, resiliency testing, and scaling reviews.
Support chaos engineering and reliability validation activities.
Incident & Problem Management
Participate in incident response, including on-call rotations for 24x7 coverage.
Assist with root cause analysis (RCA) and implement corrective actions.
Ensure alignment with ITSM processes for incident, problem, and change management.
Contribute to playbooks and runbooks to strengthen on-call readiness.
Collaboration & Knowledge Sharing
Collaborate with Engineering, Product, Security, Cloud, and DevSecOps teams to embed reliability practices.
Provide input on instrumentation, monitoring hooks, and operational readiness for services.
Work with DBAs and platform teams on database observability and performance optimization.
Share knowledge within the SRE team and adopt practices from Staff and Principal SREs.
Required
Qualifications & Experience
7+ years in SRE, Operations, or Infrastructure Engineering.
Strong hands-on experience with monitoring and observability platforms.
Experience with tools such as New Relic, Datadog, Prometheus, Grafana, CloudWatch, OpenTelemetry, Graylog.
Proven experience in incident response, troubleshooting production issues, and improving MTTR/MTTD.
Good knowledge of SLIs, SLOs, SLAs, and error budgets.
Global Healthcare Exchange (GHX) enables better patient care and billions in savings for the healthcare community by maximizing automation, efficiency and accuracy of business processes.
GHX is a healthcare business and data automation company, empowering healthcare organizations to enable better patient care and maximize industry savings using our world class cloud-based supply chain technology exchange platform, solutions, analytics and services. We bring together healthcare providers and manufacturers and distributors in North America and Europe - who rely on smart, secure healthcare-focused technology and comprehensive data to automate their business processes and make more informed decisions.
It is our passion and vision for a more operationally efficient healthcare supply chain, helping organizations reduce - not shift - the cost of doing business, paving the way to delivering patient care more effectively. Together we take more than a billion dollars out of the cost of delivering healthcare every year. GHX is privately owned, operates in the United States, Canada and Europe, and employs more than 1000 people worldwide. Our corporate headquarters is in Colorado, with additional offices in Europe.
Disclaimer
Global Healthcare Exchange, LLC and its North American subsidiaries (collectively, “GHX”) provides equal employment opportunities (EEO) to all employees and applicants for employment without regard to race, color, national origin, sex, sexual orientation, gender identity, religion, age, genetic information, disability, veteran status or any other status protected by applicable law. All qualified applicants will receive consideration for employment without regard to any status protected by applicable law. This EEO policy applies to all terms, conditions, and privileges of employment, including hiring, training and development, promotion, transfer, compensation, benefits, educational assistance, termination, layoffs, social and recreational programs, and retirement.GHX believes that employees should be provided with a working environment which enables each employee to be productive and to work to the best of his or her ability. We do not condone or tolerate an atmosphere of intimidation or harassment based on race, color, national origin, sex, sexual orientation, gender identity, religion, age, genetic information, disability, veteran status or any other status protected by applicable law. GHX expects and requires the cooperation of all employees in maintaining a discrimination and harassment-free atmosphere. Improper interference with the ability of GHX’s employees to perform their expected job duties is absolutely not tolerated.
Searching, interviewing and hiring are all part of the professional life. The TALENTMATE Portal idea is to fill and help professionals doing one of them by bringing together the requisites under One Roof. Whether you're hunting for your Next Job Opportunity or Looking for Potential Employers, we're here to lend you a Helping Hand.
Disclaimer: talentmate.com is only a platform to bring jobseekers & employers together.
Applicants
are
advised to research the bonafides of the prospective employer independently. We do NOT
endorse any
requests for money payments and strictly advice against sharing personal or bank related
information. We
also recommend you visit Security Advice for more information. If you suspect any fraud
or
malpractice,
email us at abuse@talentmate.com.
You have successfully saved for this job. Please check
saved
jobs
list
Applied
You have successfully applied for this job. Please check
applied
jobs list
Do you want to share the
link?
Please click any of the below options to share the job
details.
Report this job
Success
Successfully updated
Success
Successfully updated
Thank you
Reported Successfully.
Copied
This job link has been copied to clipboard!
Apply Job
Upload your Profile Picture
Accepted Formats: jpg, png
Upto 2MB in size
Your application for Sr Site Reliability Engineer
has been successfully submitted!
To increase your chances of getting shortlisted, we recommend completing your profile.
Employers prioritize candidates with full profiles, and a completed profile could set you apart in the
selection process.
Why complete your profile?
Higher Visibility: Complete profiles are more likely to be viewed by employers.
Better Match: Showcase your skills and experience to improve your fit.
Stand Out: Highlight your full potential to make a stronger impression.
Complete your profile now to give your application the best chance!