Job Title Here Experience Director

Job ID: 000000123SC
Location: London, UK
Area of interest: Investment Banking
Job type: Permanent - Full Time
Work style: Hybrid Working
Opening date: 27-Sept-2022 Closing Date: 12-Oct-2022
Apply now      >

Title:  Lead, Observability Platform Engineering

25049

Guangzhou, CN

Technology
Regular Employee
Hybrid
14 Apr 2025

Job Summary

As the Lead, Cloud & Container, Observability, Central Platform Development, you will play a critical role in making the internal state of the bank's application and infrastructure services visible to stakeholders for troubleshooting, performance analysis, capacity planning, and reporting through the Central Monitoring and Observability Platform. You will lead to develop the bank’s central monitoring and observability platform and tooling to enable product owners, developers, and operators to efficiently trace performance problems to their source and map their application performance to business objectives. You will lead the Predictive Monitoring and Predictive Observability and AIOps practices for the platform to enable observability platform to predict issues early.

Key Responsibilities

  • Awareness and understanding of the TTO’25 business strategy and model appropriate to the role. Support and the enablement of the Central Monitoring & Observability strategy, goals and objectives by developing prioritized features aligned to the Catalyst and Tech Simplification programmes.
  • The Monitoring & Observability Platform team is a global team ensuring the design, development, delivery & support of the bank’s central monitoring and observability services for all TTO teams (technology domains).
  • The ideal candidate will possess a deep understanding in one or more of the Observability technologies (Elastic Observability, Grafana Observability) and working as an Container Observability, Cloud Observability and Opentelemetry Lead, Data Transformation Lead or similar role, with a strong focus on enabling Opentelemetry techniques to real-world problems, enabling the design, development, implementation, and management of the tegrating advanced technological tools and techniques, with a strong focus on applying observability techniques to real-world problems. Participation in Weekend releases, overnight major incidents to help teams enable Observability Predictive Capability is a must as this is key capability for the role.
  • As the Lead, Cloud & Container, Observability, Central Platform Development, you will play a crucial role in ensuring the stability, reliability, and use of Machine learning of our applications and platform integrations, thereby enabling our organization to deliver predictive observability services to our internal stakeholders by adhering to the Enterprise SDLC (eSDLC) framework and guidelines.
  • The ability to interpret the Group’s technical and security (ICS) control requirements and information to identify potential risks and key issues based on this information and put in place appropriate controls and measures to mitigate or minimize risk to the central monitoring & observability platform delivery.

Qualifications

  • Our ideal candidate should have overall minimum of 8+ years of IT experience

  • Bachelor’s Degree in computer science or Information Systems or equivalent applicable experience
  • Proven experience (4 years) working as an Container Observability, Cloud Observability and Opentelemetry Lead, Data Transformation Lead or similar role, with a strong focus on enabling Opentelemetry techniques to real-world problems.
  • Design and develop AI-powered solutions for IT operations (AIOps) using Machine Learning techniques and rightful used models.
  • Must have experience on mentoring team in terms of creating structure to the book of work. Help team with organised product backlog.
  • Participation in Weekend releases, overnight major incidents to help teams enable Observability Predictive Capability is a must as this is key capability for the role.
  • Hands-on experience with machine learning frameworks (e.g., Grafana Tempo, Grafana Loki, Grafana Mimir, Victoriametrices etc.) and proficiency in programming languages such as Python, Hive, Spark.
  • Must have working experience on Grizzly and Observabilityy using models.
  • Enables Use of AI in responsible way and enable AI, ML technologies to identify historical trends, dynamic baselining and to drive Root cause analysis actions.
  • Addressed problems through risk management and contingency planning.
  • Software development life cycle knowledge in terms of analysis, development & testing phases.

About Standard Chartered

We're an international bank, nimble enough to act, big enough for impact. For more than 170 years, we've worked to make a positive difference for our clients, communities, and each other. We question the status quo, love a challenge and enjoy finding new opportunities to grow and do better than before. If you're looking for a career with purpose and you want to work for a bank making a difference, we want to hear from you. You can count on us to celebrate your unique talents and we can't wait to see the talents you can bring us.

Our purpose, to drive commerce and prosperity through our unique diversity, together with our brand promise, to be here for good are achieved by how we each live our valued behaviours. When you work with us, you'll see how we value difference and advocate inclusion.

Together we:

  • Do the right thing and are assertive, challenge one another, and live with integrity, while putting the client at the heart of what we do
  • Never settle, continuously striving to improve and innovate, keeping things simple and learning from doing well, and not so well
  • Are better together, we can be ourselves, be inclusive, see more good in others, and work collectively to build for the long term

What we offer

In line with our Fair Pay Charter, we offer a competitive salary and benefits to support your mental, physical, financial and social wellbeing.

  • Core bank funding for retirement savings, medical and life insurance, with flexible and voluntary benefits available in some locations.
  • Time-off including annual leave, parental/maternity (20 weeks), sabbatical (12 months maximum) and volunteering leave (3 days), along with minimum global standards for annual and public holiday, which is combined to 30 days minimum.
  • Flexible working options based around home and office locations, with flexible working patterns.
  • Proactive wellbeing support through Unmind, a market-leading digital wellbeing platform, development courses for resilience and other human skills, global Employee Assistance Programme, sick leave, mental health first-aiders and all sorts of self-help toolkits
  • A continuous learning culture to support your growth, with opportunities to reskill and upskill and access to physical, virtual and digital learning.
  • Being part of an inclusive and values driven organisation, one that embraces and celebrates our unique diversity, across our teams, business functions and geographies - everyone feels respected and can realise their full potential.
25049