Your mission
Sitting directly under the CTO, you’ll join a team of two responsible for everything that runs our Adtech solution from DevEx, CI/CD, cloud infrastructure, to incident response and production reliability.
We’re not firefighters, everyone on the team owns decisions, drives improvements, and gets plenty of room to make things better. We work proactively with developers, data engineers, and other stakeholders not just to put out fires, but to build better, more resilient infrastructure as we go.
What you will work on:
- Work with our core tech stack: AWS (multi-account, multi-region), Terraform, EKS (Kubernetes), complex GitLab CI pipelines, Helm, RDS, S3, RabbitMQ, Lambda, Python, and more plus observability tools like Prometheus, Loki, Datadog, and OpenTelemetry
- Troubleshoot complex issues across distributed systems and apply SRE principles to drive root cause analysis, long-term fixes, and platform-wide reliability improvements.
- Design and implement robust backup and disaster recovery strategies for both stateless and stateful services
- Collaborate with engineers, stakeholders, and DevOps teammates to design, evolve, and maintain a scalable and secure cloud platform
- Continuously improve our tooling, automation, and operational workflows to reduce friction, enhance developer experience, and enable faster, safer shipping
- Stay current with the evolving DevOps and cloud-native ecosystem not just to grow your own skill set, but to help elevate the team’s knowledge, challenge assumptions, and introduce better ways of thinking and working.
Your profile
- You have 3–5 years of hands-on experience in DevOps, SRE, or platform engineering roles.
- You are confident working with AWS at scale, including IAM, networking, and security best practices, and you know how to balance cost, performance, and simplicity.
- You are fluent in Terraform (or OpenTofu) and use it to build clean, modular, and scalable infrastructure.
- You have solid experience managing Kubernetes (we use EKS) in production, with a strong grasp of workload security, CI/CD patterns, and day-2 operations.
- You are comfortable building and maintaining modern CI/CD pipelines using tools like GitLab CI or ArgoCD, and you care about developer experience.
- You are proficient in Python, Go, or Bash for automation, tooling, and optimising engineering workflows.
- You use observability tools such as Prometheus, Loki, OpenTelemetry, or DataDog to build insight into systems and debug issues quickly.
- You understand and apply SLIs/SLOs, and you know how to turn monitoring into actionable alerting.
- You stay calm during incidents, troubleshoot effectively, and know when to roll back or escalate.
- You communicate clearly, write good documentation, and support your decisions with reasoning.
- You are curious, you enjoy learning, questioning the status quo, and improving the platform for everyone around you.
Why us?
- Work-Life Balance: 30 days of paid vacation.
- Commuter Benefits: Public transportation tickets provided.
- Professional Development: Annual education budget of €1,500.
- Workation Opportunities: Combine work and vacation annually.
- Wellness: Access to over 7,000 gyms and spas in Germany through Wellpass.
- Catering: Monthly team lunches, daily fruits, vegetables, and a variety of beverages.
- Flexibility: Flexible working hours and hybrid work model.
- Corporate Benefits: Exclusive discounts for major brands and platforms.
- Diversity: Join an international team with diverse cultural backgrounds.
- Fun and Games: Socializing area for relaxing activities during the workday. Please note: this is not a remote only position, we offer you a flexible hybrid model here in Hamburg, Germany - working from home on Mondays & Fridays, coming to the office on Tuesday, Wednesday & Thursday!