Cloud Reliability Engineer
Prudential plc Kuala Lumpur Full-time
Prudential’s purpose is to be partners for every life and protectors for every future. Our purpose encourages everything we do by creating a culture in which diversity is celebrated and inclusion assured, for our people, customers, and partners.
The role demands proficiency in cloud operations, especially in system administration, and site reliability engineering, coupled with a comprehensive understanding of various scripting languages and tools. This role operates within the cross-functional discipline of reliability engineering and centralized cloud operations.
We provide a platform for our people to do their best work and make an impact to the business, and we support our people’s career ambitions. We pledge to make Prudential a place where you can Connect, Grow, and Succeed.
We are seeking a skilled Cloud Reliability Engineer to join our Cloud Reliability team, under Cloud Foundation. The ideal candidate should possess a robust background in cloud platforms, with a particular emphasis on GCP and Azure, and a preference for GCP expertise.The role demands proficiency in cloud operations, especially in system administration, and site reliability engineering, coupled with a comprehensive understanding of various scripting languages and tools. This role operates within the cross-functional discipline of reliability engineering and centralized cloud operations.
The candidate must have the ability to investigate, troubleshoot, and apply a consistent engineering approach in the reliability space.
Key Responsibilities:
- Manage and maintain cloud infrastructure on GCP and Azure.
- Oversee IT operations, ensuring system stability and performance.
- Read and understand scripting languages such as Bash and PowerShell.
- Utilize infrastructure as code tools like Terraform, Ansible, and Puppet.
- Troubleshoot, debug, and investigate issues in a timely manner.
- Collaborate with development teams to implement GitOps and IaC practices.
- Perform advanced Linux system administration tasks.
- Handle basic networking tools and concepts (e.g., dig, netstat, nc, iptables).
Requirements:
- Bachelor’s degree in Computer Science, Engineering, or a related field.
- Minimum of 3 years of experience in IT operations as a system administrator.
- Proficiency with Linux and databases.
- Familiarity with Kubernetes.
- Strong understanding of Git and GitOps principles.
- Knowledge of basic networking tools and concepts.
- Experience with Python scripting is a plus.
- Excellent troubleshooting and debugging skills.
- Strong analytical and problem-solving abilities.
Preferred Qualifications:
- Experience with both GCP and Azure cloud platforms, with a preference for GCP.
- Advanced user of Linux operating systems.
- Familiarity with infrastructure as code (IaC) concepts.
Skills and Competencies:
- Strong understanding of system and network administration.
- Ability to write and understand scripts in Bash and PowerShell.
- Experience with automation tools like Terraform, Ansible, and Puppet.
- Knowledge of cloud-native tools and practices.
- Excellent communication and collaboration skills.
We encourage the same standards from our recruitment and third-party suppliers taking into account the context of grade, job and location. We also allow for reasonable adjustments to support people with individual physical or mental health requirements.
Kuala Lumpur
of client)
P/S: Please note that this role does not offer sponsorship and is only open to Malaysian citizens or RPT holders.
show more
• experience
5 years
• skills
Site Reliability Engineering (SRE), system architecture, high availability, scalability...
AirAsia XKuala Lumpur
We’re looking for a site reliability engineer. Your job will be to look after the stability and performance of our production systems as a whole and ensure that they continue to run without incident. When incidents occur, you will be on the front line...
ManulifeKuala Lumpur
We are seeking a skilled Azure API Management Specialist and motivated individual to join our team as a Senior Platform Reliability Engineer. In this role, you will be responsible for the designing, implementing, and managing API solutions using...