Job Description

About The Company

iLink Digital is a Global Software Solution Provider and Systems Integrator, delivers next-generation technology solutions to help clients solve complex business challenges, improve organizational effectiveness, increase business productivity, realize sustainable enterprise value and transform your business inside-out. iLink integrates software systems and develops custom applications, components, and frameworks on the latest platforms for IT departments, commercial accounts, application services providers (ASP) and independent software vendors (ISV). iLink solutions are used in a broad range of industries and functions, including healthcare, telecom, government, oil and gas, education, and life sciences. iLink’s expertise includes Cloud Computing & Application Modernization, Data Management & Analytics, Enterprise Mobility, Portal, collaboration & Social Employee Engagement, Embedded Systems and User Experience design etc.

What makes iLinks offerings unique is the fact that we use pre-created frameworks, designed to accelerate software development and implementation of business processes for our clients. iLink has over 60 frameworks (solution accelerators), both industry-specific and horizontal, that can be easily customized and enhanced to meet your current business challenges.

Requirements

Job Title: Site Reliability Engineer (SRE) Shift: 24x7 Support with Rotational Night Shifts Job Summary We are looking for a skilled and motivated Site Reliability Engineer (SRE) to support, automate, and enhance the reliability, scalability, and performance of our cloud-based systems. The ideal candidate should have strong experience in Azure DevOps, Azure Cloud services, automation using PowerShell/Python, and monitoring practices. This role requires close collaboration with global teams including stakeholders in the US. Key Responsibilities

  • Ensure availability, scalability, and reliability of production systems and services.
  • Manage and maintain Azure DevOps (Repos, Pipelines, Test Plans, Boards) including YAML pipeline creation and deployment automation.
  • Provide 24x7 on-call production support in a rotational shift schedule.
  • Use Kusto Query Language (KQL) for monitoring, logging, and diagnosing issues using Azure Monitor/Application Insights or Log Analytics.
  • Develop automation scripts using PowerShell and Python for operational tasks, deployments, and incident remediation.
  • Work with Azure Cloud services (VMs, Storage, Networking, Functions, App Services, Key Vault, Monitor, etc.).
  • Use Microsoft ICM tool for incident and problem management (ticket creation, tracking, resolution).
  • Collaborate with cross-functional and US-based teams to suggest improvements and drive automation initiatives.
  • Build and manage dashboards using Power BI or Fabric for operational visibility and reporting.
  • Participate in incident response, root cause analysis (RCA), post-incident reviews, and documentation.
  • Implement CI/CD best practices, monitoring, alerting, resilience, and backup strategies.
  • Contribute to SLA/SLO/SLI definitions and monitoring aligned with SRE best practices. Required Skills & Qualifications Core Technical Skills
  • Hands-on experience with Azure DevOps (Git, Pipelines, Test Plans, Boards).
  • Strong understanding of Azure Cloud fundamentals and related services.
  • Basic understanding of Microsoft Power BI and Fabric (analytics and dashboarding).
  • Experience using KQL to analyze logs and generate monitoring dashboards.
  • Automation skills using PowerShell and Python.
  • Ability to design and create YAML-based CI/CD pipelines.
  • Knowledge of Microsoft ICM or similar incident management tools. SRE-Specific Skills
  • Familiarity with SRE principles: reliability, availability, scalability, resilience, automation, and monitoring.
  • Experience in handling 24x7 production support and on-call rotations.
  • Understanding of SLIs, SLOs, and SLAs and their implementation.
  • Strong knowledge of monitoring tools, logging systems, and alerting mechanisms.
  • Basic understanding of disaster recovery, backup, and high-availability strategies.
  • Experience with incident response, root cause analysis (RCA), and creating runbooks/playbooks. Soft Skills
  • Strong problem-solving and analytical abilities.
  • Excellent communication and ability to work effectively with cross-functional and global teams (including US counterparts).
  • Proactive mindset with the ability to suggest automation ideas and process improvements.
  • Ability to work independently and in a team environment.

Benefits

  • Competitive salaries
  • Medical Insurance
  • Employee Referral Bonuses
  • Performance Based Bonuses
  • Flexible Work Options & Fun Culture
  • Robust Learning & Development Programs
  • In-House Technology Training


Job Details

Role Level: Entry-Level Work Type: Full-Time
Country: India City: Chennai ,Tamil Nadu
Company Website: http://www.ilink-digital.com Job Function: Engineering
Company Industry/
Sector:
IT Services and IT Consulting

What We Offer


About the Company

Searching, interviewing and hiring are all part of the professional life. The TALENTMATE Portal idea is to fill and help professionals doing one of them by bringing together the requisites under One Roof. Whether you're hunting for your Next Job Opportunity or Looking for Potential Employers, we're here to lend you a Helping Hand.

Report

Disclaimer: talentmate.com is only a platform to bring jobseekers & employers together. Applicants are advised to research the bonafides of the prospective employer independently. We do NOT endorse any requests for money payments and strictly advice against sharing personal or bank related information. We also recommend you visit Security Advice for more information. If you suspect any fraud or malpractice, email us at abuse@talentmate.com.


Recent Jobs
View More Jobs
Talentmate Instagram Talentmate Facebook Talentmate YouTube Talentmate LinkedIn