KEY DUTIES AND RESPONSIBILITIES
- Design, implement and manage cloud-based and on-premises infrastructure to support mission-critical applications and services.
- Build, optimize and maintain secure CI/CD pipelines to automate build, test, deployment and rollback processes.
- Automate infrastructure provisioning, configuration, and maintenance using Infrastructure-as-Code (IaC) tools such as Terraform or Ansible.
- Implement comprehensive monitoring, logging and alerting systems (e.g. Prometheus, Grafana, ELK Stack) to ensure real-time visibility into system performance and health.
- Collaborate closely with software development, cybersecurity and IT operations teams to integrate security, reliability and compliance into every stage of the delivery lifecycle.
- Apply security best practices across infrastructure and deployment workflows, ensuring alignment with internal policies and regulatory standards (e.g. ISO 27001, NIST).
- Proactively identify, troubleshoot and resolve system issues to minimize downtime and enhance service reliability.
- Participate in agile ceremonies including sprint planning, code reviews, and incident response to support iterative delivery and cross-functional alignment.
- Maintain clear documentation of infrastructure architecture, automation scripts, and operational procedures.
- Contribute to disaster recovery, business continuity and capacity planning initiatives for critical systems.
QUALIFICATIONS AND EXPERIENCE
The ideal candidate should have:
- A Bachelor’s Degree in Software Development, Computer Science or related field.
- Professional certifications such as AWS Certified DevOps Engineer, Azure DevOps Engineer or equivalent are highly desirable.
- A project management qualification (e.g., Agile/Scrum) is preferable.
- Minimum of 3 years progressive experience in DevOps, infrastructure engineering or systems administration.
- At least 2 years hands-on experience managing CI/CD pipelines on premise and cloud environments.
SKILLS AND COMPETENCIES
- Proficiency in scripting/programming languages such as Python, Node.js, Java or Ruby.
- Strong hands-on experience with DevOps toolchains: Git, Jenkins, Docker, Kubernetes, Terraform, Ansible.
- In-depth knowledge of cloud platforms (AWS, Azure, or GCP) and container orchestration.
- Solid understanding of monitoring and observability tools (e.g., Prometheus, Grafana, ELK Stack).
- Demonstrated commitment to security by design, compliance, and operational resilience in regulated environments.
- Excellent problem-solving, communication and collaboration skills across technical and non-technical teams.
- Ability to document processes clearly and maintain operational transparency.