SKA Mid – Compute Systems Engineer needed at South African Radio Astronomy Observatory
Job title : SKA Mid – Compute Systems Engineer
Job Location : Western Cape, Cape Town
Deadline : November 06, 2025
Quick Recommended Links
- The SKA-Mid Compute Systems Engineer supports the development, integration, and maintenance of computer hardware and related systems that enable the telescope technical and operational goals. This role focuses on implementing and maintaining reliable, secure, and well-performing infrastructure under guidance from senior staff. The SKA-Mid Compute Systems Engineer contributes to the delivery and evolution of systems by deploying, monitoring, upgrading, diagnosing and fault-finding and restoring, applying systems engineering practices, supporting deployments, participating in infrastructure planning, and helping ensure systems remain aligned with SRE requirements. By collaborating across teams, the engineer contributes to building maintainable and scalable systems, supporting both ongoing project development and sustainable steady-state operations.
Key Responsibilities:
- Install, implement and maintain computing systems and infrastructure
- Monitor the infrastructure in place
- Contribute to infrastructure planning and system integration efforts
- Assist in performance tuning and reliability improvements
- Apply basic automation and scripting to improve operations
- Support containerized environments and cloud infrastructure
- Collaborate with cross-functional teams and contribute to documentation and knowledge sharing
Key Requirements: Qualification:
- National Diploma in Information Technology, Computer Science, Software Engineering, Information Systems, Electronic Engineering or equivalent qualifications coupled with 7 years’ experience, OR
- BTech/BSc in Information Technology, Computer Science, Software Engineering, Information Systems, Electronic Engineering or equivalent qualifications coupled with 6 years’ experience, OR
- BENG/MTech in Information Technology, Computer Science, Software Engineering, Information Systems, Electronic Engineering or equivalent qualifications coupled with 4 years’ experience, OR
- MENG in Information Technology, Computer Science, Software Engineering, Information Systems, Electronic Engineering or equivalent qualifications coupled with 3 years’ experience, OR
- PhD in Information Technology, Computer Science, Software Engineering, Information Systems, Electronic Engineering or equivalent qualifications coupled with 1 years’ experience.
Experience:
- Experience working in data centres or server rooms/environments
- Experience working with server installations, monitoring and diagnoses
- Experience with hardware upgrades and repairs
- Experience working with Operating Systems, IAAS tools
- Experience in asset management practices, including maintaining asset registers, system and architectural mapping, warranty and service tracking, and related tools/processes.
- Basic experience with computer networks
- Basic experience withing with SANs and storage systems
- Experience working in international teams or initiatives that intersect with data platforms, storage, networking, and systems engineering domains.
- Hands-on experience in infrastructure design and automation, distributed systems, observability, CI/CD, container orchestration (e.g. Kubernetes), DevOps/SRE practices and cloud-native technologies (advantageous).
Knowledge:
- Strong understanding of systems engineering principles, including performance optimization and fault tolerance.
- Hands-on experience monitoring, diagnosing, and repairing (break/fix) various OEM hardware, including HPE, Dell, and Super Micro systems.
- Skilled in managing and maintaining component and spare inventories, as well as tools and workspaces for system assembly.
- Proven experience working with service levels (SLAs) and an understanding of operational frameworks such as Site Reliability Engineering (SRE), ITIL, and COBIT.
- Proficient in remote-first infrastructure management and monitoring.
- Working knowledge of networking fundamentals, including cabling and basic diagnostic procedures.
- Familiarity with containerized environments (Docker, Podman), orchestration platforms (Kubernetes, Helm), and container runtime architectures (e.g., CRI).
- Knowledge in infrastructure-as-code and CI/CD methodologies, utilizing tools such as GitLab CI, Ansible, and Terraform.
- Sound knowledge of IT security principles, especially regarding change management, physical and logical access control.
- Awareness and adherence to Health and Safety standards and best practices.
Additional Notes: Competency – Essential:
- Demonstrated ability to contribute effectively to cross-functional engineering projects and follow through on implementation plans under direction
- Hardware maintenance and support: Basic skills such as changing hardware components, e.g. Hard drives, memory modules, CPU, Motherboard
- Firmware and drivers diagnostics, configuration and updates
- Health and safety, self-care within data centres, assembly workshops and computer labs
- Tools and equipment use and management, e.g. regular cleaning, proper storage, routine maintenance, regular inspection, safe handling and usage, inventory management and asset tracking
- IT Spares Inventory and Tracking: Inventory categorization, Asset tagging and labelling, maintenance of data within the inventory management system, stock management, access control, lifecycle and warranty tracking, disposal and waste management
- Computer Infrastructure Asset Management, including tracking, maintaining, and optimizing relevant hardware and software assets across their lifecycle to ensure availability, compliance, and cost-effectiveness.
- IT Audit and documentation, including rack positions, network and server diagrams, topology maps, as well as service and support logs
- Hands-on experience in Linux systems administration, basic automation, and performance tuning, with a willingness to deepen expertise
- Proficiency in Linux command-line usage, service configuration, and troubleshooting; learning kernel and system-level tuning practices
- Ability to manage assigned tasks within an Agile environment, and collaborate effectively with teammates on sprint goals
- Effective troubleshooting skills, with a learning mindset toward root-cause analysis and improving operational resilience
Skills:
- Problem solving and analysis: Skilled in root cause analysis, systems troubleshooting, and performance bottleneck resolution.
- Communication and Collaboration: Clear articulation of technical recommendations, cross-functional stakeholder engagement, feedback integration.
- Planning and delivery: Capable of participate in Agile and Systems Engineering Processes and methodologies.
- Continuous learning: Commitment to staying current with evolving technologies in containerisation, cloud-native systems, observability, and systems automation.
- Commitment to staying current with evolving technologies in computing infrastructure, hardware, components (Storage, memory, Motherboards, Processors, I/O, GPU, HBA, NICs etc.
- Documentation and knowledge sharing: Ability to produce high-quality technical documentation and share knowledge across engineering teams.
- Teamwork: Collaborate within your team and with cross functional teams with our partners.
How to Apply for this Offer
Interested and Qualified candidates should Click here to Apply Now
- ICT jobs
Disclaimer: MRjobs.co.za is not an employer and does not directly offer jobs. We share available opportunities from verified sources to help job seekers. Please do your due diligence before applying. We are not responsible for any transactions, interviews, or outcomes from third-party employers.