Job Description

Your Role

  • Develop operational strategies and act as a subject matter expert on microservices and distributed systems for FinTech applications.
  • Lead and coordinate incident response efforts, ensuring timely resolution and minimizing impact on operations.
  • Proactively diagnose and resolve issues using in-depth log analysis, and prevent recurring problems.
  • Continuously improve processes by identifying inefficiencies and applying solutions to improve service reliability.
  • Create and maintain comprehensive documentation of known issues, workarounds, and resolutions.
  • Conduct knowledge sharing sessions and workshops within the team.


Your Qualifications

  • Support critical production services: Respond to and resolve major incidents and outages, ensuring minimal service disruption.
  • Large-scale distributed systems: Work in SRE or DevOps environments, managing and optimizing highly available and resilient infrastructure in cloud platforms such as AWS or GCP.
  • Containerization: Deploy, manage, and scale containerized applications using platforms like Docker or Kubernetes in cloud environments like AWS or GCP.
  • CI/CD Pipelines: Support applications and environments that utilize continuous integration and delivery pipelines, ensuring smooth integration and deployment processes.
  • Logging and Monitoring: Utilize observability platforms such as Splunk, Prometheus, Grafana, and other APM tools to monitor system health, troubleshoot issues, and ensure uptime.
  • Incident Management: Lead incident response using tools like PagerDuty or ServiceNow, ensuring fast resolution and root cause analysis.
  • Version Control: Manage source code and collaborate on development using Git, following best practices for branching, merging, and versioning.


Plus points if you have:

  • Operations experience in e-commerce, payment systems, or financial services: Practical experience managing critical systems in industries with high availability and security requirements.
  • Proficiency in scripting languages: Strong scripting skills in languages such as Python or Shell for automation, system maintenance, and incident resolution.
  • Familiarity with SQL queries: Ability to write and execute SQL queries for troubleshooting and resolving production issues.
  • Relevant certifications: Industry-recognized certifications in key technical areas, such as cloud platforms, DevOps, or system administration.


Job Details

Role Level: Mid-Level Work Type: Full-Time
Country: Philippines City: Mandaluyong National Capital Region
Company Website: https://www.opswerks.com Job Function: Information Technology (IT)
Company Industry/
Sector:
Technology Information and Media

What We Offer


About the Company

Searching, interviewing and hiring are all part of the professional life. The TALENTMATE Portal idea is to fill and help professionals doing one of them by bringing together the requisites under One Roof. Whether you're hunting for your Next Job Opportunity or Looking for Potential Employers, we're here to lend you a Helping Hand.

Report

Disclaimer: talentmate.com is only a platform to bring jobseekers & employers together. Applicants are advised to research the bonafides of the prospective employer independently. We do NOT endorse any requests for money payments and strictly advice against sharing personal or bank related information. We also recommend you visit Security Advice for more information. If you suspect any fraud or malpractice, email us at abuse@talentmate.com.


Recent Jobs
View More Jobs
Talentmate Instagram Talentmate Facebook Talentmate YouTube Talentmate LinkedIn