See Similar Listings
Job   UK   S Yorks   Sheffield Area   Engineer -

Site Reliability Engineer | Engineer in Engineering Job in Sheffield SYK | 7210503909

This listing was posted on iSmartRecruit.

Site Reliability Engineer

Location:
Sheffield, S Yorks
Description:

Site Reliability Engineer (SRE) Role • SRE teams are expected to provide 24 x 5 (weekdays) support covering 4 time zones: UK, HK, IN, US (East Coast) which means that SREs must participate in weekend on-call rotations, and be open to flexible working hours. • Standard working is remote: 8 hours each day, 5 days per week, with 1h for lunch, exact hours to be agreed with the team lead, with flexibility to work occasionally outside of their agreed hours. • The SRE will be assigned to work in one region. Flexibility is required for this role, for example they may be asked to work 08:00-17:00 or 10:00-19:00. Start times may vary across the team and will be discussed and agreed by the team lead with each SRE. • On-call: o SREs are expected to participate in mandatory on-call weekend rotations (1 weekend in 4). Exact dates and times to be agreed with team lead to ensure coverage across all regions, o The SRE on-call is expected to acknowledge incident within 10 mins and respond appropriately. If the SRE on-call is not available for any reason, they must inform the team lead and arrange a swap with another team member, o On-call hours and any extra hours worked should be taken as time off in lieu. • Additional hours: o There are occasions when the team’s client requires 24/7 online support e.g., for major regulatory releases, where a separate support rotation and schedule may be required to accommodate the requirement, o SREs are not required to take time off in lieu following online support, it will be treated as paid overtime. Requirements • Experience as a Senior DevOps Engineer/SRE in an Agile environment. • Experience with Linux/Unix systems and Shell Scripting (BASH). • Experience with Kubernetes, preferably GKE on Prem. • Ability to program (structured and OO) with one or more high level languages, such as Python or Go. • Experience with building and managing automated CI/CD pipelines and related tools (GitLab CI/CD, Jenkins). • Experience with VMware and other virtualization platform technology. • Istio knowledge and understanding of Anthos Service Mesh. • Incident support and resolutions. • Implement and enhance monitoring, alerting, and incident response processes. • Automate manual tasks to boost efficiency and minimise human error. • Excellent analytical skills, capable of fast decision-making using sound judgement, and not afraid to explore new ideas. • Excellent interpersonal skills in dealing with customers with differing technical specializations, able to work efficiently in a high-pressure environment. • Good organizational and English communication skills are required, including prioritization of multiple projects and objectives. Desirable • Familiarity with monitoring and logging tools (Splunk, Prometheus, Datadog, Kiali). • Experience on Load Balancer and reverse Proxies (Nginx Controller / Seesaw). • Experience with containerisation technologies (Docker) and infrastructure-as-code tools (Terraform). • Kubernetes certification is desirable and considered a plus. • Knowledge of Portworx for Kubernetes Storage. • Knowledge about confluent resources like connectors, control center, replicators. • Knowledge of Kafka Stream Generator, KSQLDB, cluster federation, Spark Streams. Substitutions • It’s essential that the team maintains a resource pool from which to resource a sustainable on-call rota. • A substitute will become chargeable when the SRE to be substituted rolls off the team, which will be when the team lead has confirmed the substitution on the on-call rota. • The team lead will add the substitute to the on-call rota when they determine the substitute is able to participate as an independent member of the on-call rota, following a period (normally 8 weeks), during which the substitute will shadow their predecessor and work with other team members to ensure an effective handover. • Client may reject a substitute prior to adding them to the on-call rota if it determines that they will not perform as expected.
Posted:
March 20 on iSmartRecruit
Visit Our Partner Website
This listing was posted on another website. Click here to open: Go to iSmartRecruit
Important Safety Tips
  • Always meet the employer in person.
  • Avoid sharing sensitive personal and financial information.
  • Avoid employment offers that require a deposit or investment.

To learn more, visit the Safety Center or click here to report this listing.

More About this Listing: Site Reliability Engineer
Site Reliability Engineer is a Engineering Engineer Job located in Sheffield SYK. Find other listings like Site Reliability Engineer by searching Oodle for Engineering Engineer Jobs.