What Add job title, key skills
Where Add location, town,city
Scroll for more!

L3 Site Reliability Engineer

London, Greater London, England

Permanent (Full time)

This job has expired.
Start a new job search

Job Description

The SRE team is a separate team from the main scrum team whose main focus is ensuring the system has maximum uptime.

The role is involved with the following:

  • Investigating and fixing performance/resilience issues with SG Digital applications.

  • Conducting capacity planning and reviewing the production estate for major events.

  • Reviewing support, technical debt and technical directive tickets.

  • Reviewing development changes where the change will have an impact on performance of an application.

  • Conducting performance tests on SG Digital environments.

  • Supporting performance tests on PPB environments and providing a summary of the impact on SG Digital applications.

  • Generating weekend performance reports and stats, then running through on a weekly SRE call with PPB.

  • Attending analysis/design meetings to provide input/sign-off on technical solutions for projects. For some technical changes, this may involve carrying out the full analysis.

  • Point of contact for PPB's development teams when input is needed on PPB projects/changes which could affect the performance of SG Digital applications.

  • Maintaining and working on the SRE backlog.

  • Resolving, or providing assistance to the support team on, critical production issues. We'll often be contacted in the first instance on Slack.

  • Providing and maintaining solutions for improving real-time monitoring, e.g. Grafana, InfluxDB.


Posted 17 days ago

This job has expired.
Start a new job search

This job has expired.
Start a new job search

Similar Jobs