Job Description

Job Description – SRE

Location: Pan India

Experience: 5–10 Years

Role Type: SRE

About The Role

We are looking for an experienced Site Reliability Engineer (SRE) to design and implement scalable monitoring frameworks, lead incident response processes, and build automation-driven solutions that reduce operational toil. The ideal candidate must have hands-on experience with Grafana, Prometheus, Kubernetes (AKS), CI/CD pipelines, IaC, and Azure cloud services.

Key Responsibilities

Monitoring, Logging & Alerting

Design and implement end-to-end monitoring, observability, and alerting frameworks using tools such as Grafana, Prometheus, Loki, Elasticsearch, etc.

Build dashboards, metrics pipelines, and log aggregation systems for application and infrastructure visibility.

Ensure proactive detection of performance issues, errors, anomalies, and service health degradation.

Incident Response & Reliability Engineering

Lead P1/P2 incident response, root cause analysis (RCA), and post-incident reviews.

Define and maintain SLOs/SLIs, error budgets, and reliability KPIs.

Reduce operational toil through automation, self-healing solutions, and proactive remediation strategies.

Build runbooks, playbooks, and incident workflows to improve response efficiency.

Kubernetes, Cloud & Automation

Manage and optimize Kubernetes clusters (AKS) including deployments, scaling, networking, and security.

Implement Infrastructure as Code (IaC) using Terraform, ARM templates, or Bicep.

Build and maintain CI/CD pipelines using Azure DevOps, GitHub Actions, Jenkins, etc.

Automate infrastructure provisioning, environment setup, backup, failover, and compliance workflows.

Dev Collaboration & Architecture Reliability

Partner with development teams to design reliable and scalable systems, embedding SRE principles from the start.

Participate in architecture discussions, reviewing system design for reliability, performance, and observability.

Influence engineering decisions regarding performance optimization, resiliency patterns, distributed tracing, and fault tolerance.

Azure Cloud Expertise

Strong Working Knowledge Of Azure Services, Including

AKS

Azure Monitor / Log Analytics

Application Gateway / Front Door

Azure Functions

Azure Storage

App Services

Azure Networking (VNet, NSG, Load Balancers)

Understanding of cloud cost optimization and capacity planning.

Required Skills

5–10 years of experience in SRE, DevOps, or Infrastructure Engineering.

Hands-on experience with Grafana, Prometheus, Loki, Alertmanager.

Strong understanding of Kubernetes (AKS) and containerization.

Experience with CI/CD pipelines and deployment automation.

Proficiency in Infrastructure as Code (Terraform/ARM/Bicep).

Solid understanding of Azure Cloud Architecture.

Expertise in incident management, RCA, SLO/SLI design.

Good experience in Shell, Python, or Go for automation.

Preferred Skills

Experience with Service Mesh (Istio/Linkerd)

Distributed tracing tools: Jaeger, OpenTelemetry

Knowledge of GitOps models (ArgoCD/Flux)

Background in microservices architecture

Soft Skills

Strong analytical and troubleshooting skills

Ability to lead during critical incidents

Excellent communication and stakeholder handling

Passion for automation and reliability engineering


Job Details

Role Level: Mid-Level Work Type: Full-Time
Country: India City: Bangalore Urban ,Karnataka
Company Website: http://www.natobotics.com Job Function: Engineering
Company Industry/
Sector:
Information Technology and Services

What We Offer


About the Company

Searching, interviewing and hiring are all part of the professional life. The TALENTMATE Portal idea is to fill and help professionals doing one of them by bringing together the requisites under One Roof. Whether you're hunting for your Next Job Opportunity or Looking for Potential Employers, we're here to lend you a Helping Hand.

Report

Disclaimer: talentmate.com is only a platform to bring jobseekers & employers together. Applicants are advised to research the bonafides of the prospective employer independently. We do NOT endorse any requests for money payments and strictly advice against sharing personal or bank related information. We also recommend you visit Security Advice for more information. If you suspect any fraud or malpractice, email us at abuse@talentmate.com.


Recent Jobs
View More Jobs
Talentmate Instagram Talentmate Facebook Talentmate YouTube Talentmate LinkedIn