Site Reliability Engineer

Amsterdam, Noord-Holland
Vast
Voltijds

17 dagen geleden

We are seeking a Site Reliability Engineer (SRE) to help scale and secure mission-critical platforms for a leading financial institution in Amsterdam. As part of a cross-functional engineering team, you'll focus on observability, reliability, incident response and operational excellence across distributed systems.This role demands both engineering skill and operational discipline. Previous experience in a banking or regulated enterprise environment is mandatory.Responsibilities:

Build and improve monitoring, logging and alerting for high-availability systems
Support production reliability across Kubernetes, cloud and on-prem environments
Define SLOs, SLIs and error budgets in collaboration with development teams
Lead root cause analysis and incident response processes
Automate operational tasks and drive reliability through infrastructure-as-code
Contribute to playbooks, runbooks and operational readiness reviews

Requirements:

3–5 years in an SRE, DevOps or Platform Engineering role
Strong skills in observability tooling (Prometheus, Grafana, ELK, Splunk, etc.)
Experience with incident management and post-mortem analysis
Proficient with Kubernetes and infrastructure automation (Terraform, Helm)
Solid scripting (Bash, Python, Go)
Minimum 2 years in a banking or highly regulated enterprise environment
Comfortable working with InfoSec, Compliance and Risk teams

What we offer:

Full-time position with long-term (12+ month) scope
Mission-critical role with visibility across engineering and operations
Competitive salary and secondary benefits
Hybrid work setup (2–3 days onsite in Amsterdam)
Budget for tooling, training and certifications

Note: Immediate availability or short notice (≤2 weeks) required. Banking experience is a strict must.

Profi-Workers

Solliciteer