Site Reliability Engineer
Ref: JO-2510-357228
- Singapore
- DevOps & Automation, Technology
- IT
- 250 - 999 Employees
- SGD 5,000.00 - SGD 6,000.00 per month
- Environment: Hybrid
- Contract Type: Contract
- Starts: 2025-12-01
SALT is hiring Site Reliability Engineer for a global technology client in Singapore for 12 months & renewable contract assignment.
Responsibilities:
- Reliability Engineering: Define and implement SLIs, SLOs, and error budgets to measure and improve service reliability.
- Cloud Infrastructure: Design, deploy, and manage infrastructure on Google Cloud Platform (GCP) or other major cloud providers.
- Kubernetes Operations: Administer and optimize GKE (Google Kubernetes Engine) clusters, ensuring high availability and performance.
- Support & Incident Management:
- Participate in on-call rotations and handle L2/L3 support for production systems.
- Lead incident response, root cause analysis, and postmortems.
- Collaborate with teams to reduce MTTR and improve incident workflows.
- Automation & Tooling: Develop tools and scripts using Python, Go, or Bash to automate operational tasks and improve system efficiency.
- Monitoring & Observability: Implement and maintain monitoring, logging, and alerting systems using tools like Prometheus, Grafana, ELK, or Stack driver.
- API Management: Build and maintain internal APIs and integrations that support platform operations and automation.
- Infrastructure as Code: Use tools like Terraform, Helm, and GitOps to manage infrastructure in a scalable and repeatable manner
Qualification:
- 5 years of experience in SRE, DevOps, or Infrastructure Engineering roles.
- Strong hands-on experience with cloud platforms, especially GCP.
- Proficiency in scripting/programming (Python, Go, Bash).
- Deep understanding of Kubernetes, with hands-on experience in GKE.
- Solid knowledge of SQL and relational database systems.
- Experience implementing and managing SLIs/SLOs and reliability metrics.
- Familiarity with RESTful APIs and microservices architecture.
- Strong troubleshooting and debugging skills in distributed systems.
- Excellent communication and collaboration skills.
CEI No: R1659595 / EA No: 07C3147
Salt is acting as an Employment Business in relation to this vacancy.

Share: