Site Reliability Engineer | Azure

Why this role is important to us:

In SaaS Operations – your future department – our strategic roadmap is summed up in one word: Scale. We need to deliver a reliable and ever evolving platform for our clients to run their business critical processes on using the client software products. To ensure the best possible position, we’re building our cloud platform operations department around the SRE principles, and you’re invited to assist us build this from the ground up.

In your daily work you will be collaborating with experienced operations engineers and all of our agile DevOps teams in our sister department SaaS Innovation. Your focus will be on technical deliveries, troubleshooting and continuous improvements to our platform, while also acting as an integral part of our feedback loop to developers both of platform and software features of the client’s software product. You will be part of a highly, collaborative, inclusive and iterative delivery environment, with colleagues that are motivated by doing things that are equally important and impressive.

What you will be responsible for:

The focus of the SRE team is to support and operate our SaaS platform for our clients, always doing this with an eye on improving our deliveries and thinking about how to use our maintenance budget in the wisest and most scalable way. As an SRE engineer with SaaS Operations, your responsibilities would include:

  • Manage and troubleshoot pipelines for client onboarding within the Azure platform.
  • Understand Platform-As-A-Service (PaaS) platform and align client requirements to existing features.
  • Identify, define and scope new features when necessary. 
  • Schedule batch jobs based on input from clients using an enterprise scheduler. 
  • Collaborate with clients to optimize scheduled jobs to be event driven using an enterprise scheduler. 
  • Participate in problem management for Azure technology platform. 
  • Provide weekend support as needed to ensure platform availability. 
  • Lead design workshops with clients, vendors and other key stakeholders, both onsite and virtually. 
  • Mentor junior colleagues to foster knowledge sharing and team development.
  • Work effectively within an Agile team to deliver complex onboarding programs in a sprint-based approach. 
  • Perform configuration Management and Disaster Recovery tasks on a new environment. 

What we value:

As a global company, people are the utmost important asset we have. We are committed to being a place for you to grow personally as well as professionally. While we value seniority and experience it is not a requirement for this position. We are looking for people with experience in development and system operations in a true cloud context – and most importantly, we are in the market for high agency professionals. Our delivery area is vast and full of opportunities, the follow is a non-exhaustive list of attributes that we look for in our applicants:

Essential Qualifications & Experience: 

  • Bachelor’s degree in Computer Science or related field. 
  • 5 years of operational experience with troubleshooting mission-critical software systems. Experience in financial services industry is advantageous. 
  • Hands-on experience with Public Cloud environments, primarily Azure, Certification is advantageous. 
  • Solid understanding/experience of networking, virtualization, storage, secrets management, messaging and serverless computing. 
  • Experience with Linux and Windows systems, ideally with experience in systems administration. 
  • Experience Using Command line (CLI) style interfaces. 
  • Proficiency in Infrastructure as Code (IaC) tools like Terraform, Ansible and ARM. 
  • Knowledge of Version Control tool such as Github. 
  • Experience/Understanding of containerization technologies (Kubernetes, Docker). 
  • Strong knowledge of monitoring and logging tools (Azure Monitor, Log Analytics, Application Insights, Grafana). 
  • Able to write basic relational database queries. 
  • Familiarity with CI/CD pipelines and Azure DevOps for deployment automation, especially Jenkins. 
  • Postman and Rest API

Essential Soft Skills: 

  • Strong problem-solving and troubleshooting skills. 
  • Excellent communication and teamwork abilities. 
  • Comfort and motivation working in a diverse environment that can be stressful at times. 
  • A methodical and structured approach to requirements. 
  • Willingness and ability to learn new technology and tools. 
Job Category: Technology
Contract Type: Full-Time
Location: BGC Taguig
Division: Technology
Assigned Recruiter: aliah.panaligan

Apply for this position

Allowed Type(s): .pdf, .doc, .docx