Job Description

Job Description


The Senior ITSMA Observability Engineer is responsible for the design and development of the Elastic and Prometheus Stack, as well as, AWS Observability tools that monitor and manage critical applications and infrastructure at HedgeServ. As an important member of the ITSMA Monitoring and Analytics Team, the Senior Engineer will be responsible for the operation and design of the portfolio of tools, which include alerting mechanisms and escalation, dashboards, and the overall framework to support the management of HedgeServ’s infrastructure, systems, and applications. Additionally, this role entails leading IT infrastructure monitoring projects and vendor management and handling daily operations with SME (Subject Matter Expert) escalation support as needed. The successful applicant should possess the ability to collaborate with various IT teams to gather requirements and develop solutions by means of existing monitoring capabilities or customized monitors (scripts)


Role Responsibilities



The Senior ITSMA Observability Engineer will collaborate with the ITSMA Monitoring and Analytics Team to design, build, secure, maintain, optimize, and document solutions utilizing Elastic Cloud Stack and AWS-managed Promethe

  • Proficiency with Elasticsearch, Logstash, Kibana, Beats, APM with X-Pack, Prometheus, Grafana, AWS CloudWatch, and other observability tools
  • Experience with OTEL Collectors
  • Engage closely with application owners, engineers, and development teams to evaluate requirements, architect, and support an Elasticsearch Stack solution, as well as structure queries to enhance system performance and efficiency
  • Design and configure ETL data pipelines using Elastic Common Schema for onboarding application logs and metrics
  • Configure index templates and manage data lifecycle (ILM) for effective data retention
  • Develop Ansible playbooks for automated deployment of Beat agents across on-premises and AWS systems; utilize Terraform for safe management of production infrastructure, employing methodologies such as Infrastructure as Code within AWS environments
  • Create Elastic alerting solutions via Watcher and Kibana Alerts integrated with existing ticketing tools and MS Teams
  • Develop Machine Learning jobs to dynamically monitor and provide alerts based on specific metrics and KPIs
  • Build Elastic and AWS observability AI solutions that enable infrastructure engineering and operations teams to address production issues efficiently
  • Adhere to lifecycle processes for transitioning solutions from Development to QA to Production
  • Actively participate in collaborative group sessions, attend agile sprint daily meetings, and share progress to ensure solution development aligns with organizational requirements



Pre-Requisite Knowledge, Skills and Experience


  • Technical Degree in Information Technology
  • Experience with Elastic Cloud and AWS Managed Prometheus
  • Knowledge of installation, system tasks, data collection, network troubleshooting, data pipelines, and cluster administration
  • Proficient in Python, Bash, PowerShell, Painless, and other scripting languages
  • Extensive ELK Stack expertise, including Elasticsearch, Logstash, Kibana, Beats, Machine Learning, APM, X-Pack, and REST API integration
  • Skilled in evaluating and tuning Elastic clusters, configurations, indexing, search performance, security, and administration
  • Proficient with Prometheus, Grafana, AWS observability tools, and their performance, security, and management
  • Experienced with security integrations (Windows SAML, LDAP, Kerberos) in Elasticsearch
  • Adept with AWS services: CloudWatch, CloudTrail, Kubernetes, Docker, Lambda
  • Integrated Elastic alerting with third-party ticketing tools
  • Experienced in implementing and integrating observability AI agents and frameworks for automated analysis, incident detection, and proactive resolution across complex systems


Job Details

Role Level: Mid-Level Work Type: Full-Time
Country: Philippines City: Manila
Company Website: http://www.hedgeserv.com Job Function: Software Development
Company Industry/
Sector:
Financial Services Investment Banking And Funds And Trusts

What We Offer


About the Company

Searching, interviewing and hiring are all part of the professional life. The TALENTMATE Portal idea is to fill and help professionals doing one of them by bringing together the requisites under One Roof. Whether you're hunting for your Next Job Opportunity or Looking for Potential Employers, we're here to lend you a Helping Hand.

Report

Disclaimer: talentmate.com is only a platform to bring jobseekers & employers together. Applicants are advised to research the bonafides of the prospective employer independently. We do NOT endorse any requests for money payments and strictly advice against sharing personal or bank related information. We also recommend you visit Security Advice for more information. If you suspect any fraud or malpractice, email us at abuse@talentmate.com.


Recent Jobs
View More Jobs
Talentmate Instagram Talentmate Facebook Talentmate YouTube Talentmate LinkedIn