Back to jobs

Site Reliability Engineer

Job description

Company Description:

  • A global video game company

Responsibilities:

  • Ensure high reliability and availability and meet SLAs, SLOs, and SLIs.
  • Monitor the system for issues and respond to incidents, ensuring quick resolution to maintain high system availability.
  • Drive incident resolution and process improvements to minimize downtime and increase operational transparency.
  • Ensure all key services are measured, monitored and raise alerts when needed.
  • Develop comprehensive monitoring solutions to provide full visibility to the different platform components using tools and services like Kubernetes, Prometheus, Grafana, New Relic and others.
  • Support services before they go live through activities such as capacity planning, monitoring setup, logging, and production readiness reviews.
  • Engage in service capacity planning and demand forecasting, performance analysis, and system tuning.
  • Collaborate with the development teams to enhance the product's operational stability.
  • Build and drive the automation systems that maintain system health.

Requirements:

  • Minimum 6 years experience with some early experience in software engineering.
  • Proficiency in scripting languages such as Python and Bash. Strong understanding of Go and PHP will be a plus.
  • Deep knowledge of monitoring systems such as Prometheus, Grafana, New Relic or Datadog.
  • Good understanding of continuous integration/continuous delivery processes and platforms (Gitlab preferred). Experience with Helm.
  • Experience with Docker, Kubernetes, or other container orchestration systems.
  • Familiarity with infrastructure automation tools like Terraform.
  • Experience with automation, system administration, and system hardening.
  • Experience with Linux-based infrastructures, Linux/Unix administration.


To apply, please click "APPLY NOW" or email Han at wenhan.cheong@ambition.com.my quoting reference number AGP 269710. Data provided is for recruitment purposes only.

Due to the volume of applications received, we regret to inform you that only shortlisted candidates will be notified.

JTK Number: JTKSM 995 | Company Registration Number: 201301019088 (1048918-T)

If this job isn't quite right for you, but you know someone who would be great at this role, why not take advantage of our referral scheme? We offer MYR500 in shopping vouchers for every referred candidate who we place in a role. Terms & Conditions Apply. https://www.ambition.com.my/refer-a-friend