Lead Site Reliability Engineer

Technology

Fantastic opportunity available for a Senior/Lead SRE, this role is open to candidates based in Johannesburg or Cape Town

This is a brilliant opportunity to spearhead a team and establish the structure of a team etc

Main Responsibilities include:

  • Work closely with the Platform & Product engineering teams to ensure that the platform, infrastructure and services are designed and optimised for availability, latency and performance
  • Own and configure observability tooling
  • Create and tune alerts to ensure we have adequate warning of impending failures, and check alerts as they are raised
  • Investigate and resolve support issues escalated from the Tech Support team
  • Lead incident response, resolution, root cause investigation, retrospective writing up and follow-up actions so we can take every opportunity to learn, improve and make our services more resilient
  • Identify patterns in incoming incidents and document these for further investigation
  • Collaborate with other SREs and Tech Support to improve processes and share knowledge/best practice

Skills/Experience Required:

  • End-to-end delivery/ automation in a SRE, Platform or DevOps team
  • Agile development practices & legacy platforms
  • Engineering background, and is familiar with modern programming languages, ideally Python
  • Scripting for automation
  • Experienced in investigating and resolving technical issues, spanning performance, functionality and system interactions
  • GCP, AWS, Azure (ideally GCP)
  • Has strong experience and knowledge of observability, both in terms of best practices and tooling implementation/use (Datadog preferable, others will be accepted)
  • Infrastructure as Code, such as Terraform or alternatives
  • CI/CD Tools (preferably GitLab)
  • Database experience and ability to understand/write SQL (mySQL/MariaDB preferable)
  • Solid understanding of Linux Operating Systems (Debian preferable)
  • Has understanding of the DevSecOps culture and experience in delivering technical outcomes within this culture
  • Previous exp managing / mentoring a team
  • SAAS Environment exp

Sound like an opportunity for you? Get in touch with Caitlin on cbrown@welovesalt.com

Salt is acting as an Employment Agency in relation to this vacancy.

Job Information

Job Reference: JO-2411-348852
Salary:
Salary per: annum
Job Duration:
Job Start Date: 03/02/2025
Job Industries: Technology
Job Locations: South Africa
Job Types: Permanent

Here are some related jobs

Senior Product Manager

Are you ready to take your career to the next level while exploring the breathtaking beauty of East and Southern Africa? We're working with an industry leader in travel and…

ZAR780000.00 - ZAR804000.00 per annum

Product Owner

Start 2025 with a bang by joining a company that's shaping the future of online commerce in Cape Town. This is more than a role-it's an opportunity to be part…

ZAR540000.00 - ZAR600000.00 per annum

AWS Data Engineer

Job Title: AWS Data EngineerDuration: 12-Month Renewable ContractLocation: Cape Town CBD (4 Days On-Site Per Week) Job Summary:We are seeking a skilled and motivated AWS Data Engineer to join a…

ZAR600 - ZAR650 per hour + Time and Materail, Renewable
×
ZA

Upload your CV

Upload your CV to our database.

  • Max. file size: 49 MB.
  • Hidden
  • This field is for validation purposes and should be left unchanged.
Lead Site Reliability Engineer

Please let us know where you are, or where you would like to be in the world so we can point you in the right direction.

Contact us

  • Max. file size: 49 MB.
  • Click here to find out more about Salt's Privacy Policy
  • This field is for validation purposes and should be left unchanged.