Lead Site Reliability Engineer needed at Potentiam Limited
Job title : Lead Site Reliability Engineer
Job Location : Western Cape, Cape Town
Deadline : February 10, 2025
Quick Recommended Links
- Site Reliability Engineers work tightly with Tech Support teams and product/platform engineering teams and are responsible for maximising the uptime of their platforms and clients, maintaining and enhancing their observability, responding to incidents raised and documenting/investigating/fixing the underlying root causes of these incidents. They also may work on documentation for the areas in which they specialise, in order to help upstream teams when future issues arise.
Skills & Experience
- Has previous experience leading end-to-end delivery/ automation in a SRE, Platform or DevOps team
- Is knowledgeable and comfortable with agile development practices & legacy platforms
- Comes from an engineering background, and is familiar with modern programming languages, ideally Python
- Expert level at scripting for automation
- Cloud Certifications, or demonstratable knowledge
- Is experienced in investigating and resolving technical issues, spanning performance, functionality and system interactions
- Is confident in proposing solutions to technical issues, and is able to communicate the pros and cons of said solutions
- Is capable of documenting causes of underlying issues, creating runbooks for others to follow
- Has expert level experience with public cloud providers i.e GCP, AWS, Azure (ideally GCP)
- Has strong experience and knowledge of observability, both in terms of best practices and tooling implementation/use (Datadog preferable, others will be accepted)
- Strong proficiency in using Infrastructure as Code, such as Terraform or alternatives
- CI/CD Tools (any, but preferably GitLab)
- Database experience and ability to understand/write SQL (mySQL/MariaDB preferable)
- Solid understanding of Linux Operating Systems (Debian preferable)
- Has understanding of the DevSecOps culture and experience in delivering technical outcomes within this culture
- Possesses strong communication and stakeholder management skills, with an ability to communicate complex technical topics to non-technical stakeholders
- Is comfortable with providing limited on-call cover at evenings and weekends
How to Apply for this Offer
Interested and Qualified candidates should Click here to Apply Now
- ICT jobs