
Site Reliability Engineer
- Amsterdam, Noord-Holland
- Vast
- Voltijds
- Build and improve monitoring, logging and alerting for high-availability systems
- Support production reliability across Kubernetes, cloud and on-prem environments
- Define SLOs, SLIs and error budgets in collaboration with development teams
- Lead root cause analysis and incident response processes
- Automate operational tasks and drive reliability through infrastructure-as-code
- Contribute to playbooks, runbooks and operational readiness reviews
- 3–5 years in an SRE, DevOps or Platform Engineering role
- Strong skills in observability tooling (Prometheus, Grafana, ELK, Splunk, etc.)
- Experience with incident management and post-mortem analysis
- Proficient with Kubernetes and infrastructure automation (Terraform, Helm)
- Solid scripting (Bash, Python, Go)
- Minimum 2 years in a banking or highly regulated enterprise environment
- Comfortable working with InfoSec, Compliance and Risk teams
- Full-time position with long-term (12+ month) scope
- Mission-critical role with visibility across engineering and operations
- Competitive salary and secondary benefits
- Hybrid work setup (2–3 days onsite in Amsterdam)
- Budget for tooling, training and certifications