Zum Hauptinhalt gehen
Erstellt am 28. Mai 2026

Senior Site Reliability Engineer

Jobgether
Germany Vollzeit
Reference: 113_728854_3252c25e-a5c4-49e0-a751-d0172ed61bc5

This position is posted by Jobgether on behalf of a partner company. We are currently looking for a Senior Site Reliability Engineer in Germany.

This is an exciting opportunity to join a fast-growing technology environment focused on building highly scalable and reliable cloud infrastructure that supports millions of frontline users worldwide. In this role, you will take ownership of critical reliability domains, helping shape the architecture, resilience strategy, and operational excellence of a modern SaaS platform. You will work within a highly collaborative Platform Squad, driving technical decisions, improving observability, and enabling engineering teams through automation and self-service infrastructure solutions. The position offers a strong mix of hands-on engineering, strategic impact, and mentorship, making it ideal for senior professionals passionate about scalability, reliability, and cloud-native technologies. You will contribute directly to high-availability systems, incident management, and platform evolution in a remote-first and innovation-driven environment.

Accountabilities:

  • Drive the architecture and continuous improvement of cloud infrastructure and Kubernetes environments designed for high availability and scalability
  • Define and implement resilience strategies, including disaster recovery, rollback mechanisms, zero-downtime deployments, and global scaling initiatives
  • Enhance observability frameworks and monitoring systems to ensure platform reliability and operational transparency
  • Improve Infrastructure as Code and self-service platform capabilities to reduce operational overhead and increase engineering efficiency
  • Lead major incident management processes, coordinate post-mortem analyses, and implement long-term reliability improvements
  • Mentor engineers within the Platform Squad through technical guidance, design reviews, and knowledge sharing initiatives
  • Collaborate with cross-functional teams to shape platform roadmaps, architectural standards, and infrastructure strategies
  • Contribute to CI/CD pipeline improvements, GitOps practices, and automation initiatives across the engineering organization

Requirements:

  • Minimum 5 years of hands-on experience in Site Reliability Engineering, Platform Engineering, DevOps, Cloud Infrastructure, or similar infrastructure-focused engineering roles
  • Proven experience building and operating high-throughput, highly available production systems at scale
  • Deep expertise with Kubernetes environments running on major cloud platforms
  • Strong experience with observability and monitoring technologies such as Prometheus, Grafana, Loki, ELK, Mimir, or similar tools
  • Solid programming skills in Go or Python, with experience developing infrastructure-related tooling and automation
  • Hands-on experience with Infrastructure as Code tools such as Terraform, Pulumi, OpenTofu, and GitOps frameworks like ArgoCD
  • Strong understanding of CI/CD pipelines, reliability engineering principles, SLIs, SLOs, and error budget methodologies
  • Demonstrated ability to lead complex technical initiatives, architecture discussions, and infrastructure transformations
  • Experience mentoring engineers and promoting engineering best practices within collaborative teams
  • Strong communication skills in English and willingness to participate in on-call rotations
  • Additional experience with service meshes, API gateways, Kubernetes operators, or highly available PostgreSQL environments is considered a plus

Benefits:

  • Remote-first work environment with flexibility to work from home across Europe
  • Opportunities for in-person collaboration through team events, workshops, and office gatherings in Germany
  • Work-life balance initiatives including wellness memberships and bike leasing programs
  • Dynamic and collaborative company culture with regular team events and culture-focused activities
  • Opportunity to contribute directly to the growth and scalability of a fast-growing tech organization
  • International remote work opportunities within the European Union
  • High-impact role with ownership, autonomy, and strong career development potential
  • Inclusive and diverse workplace focused on collaboration, innovation, and personal growth
How Jobgether works:
We use an AI-powered matching process to ensure your application is reviewed quickly, objectively, and fairly against the role's core requirements. Our system identifies the top-fitting candidates, and this shortlist is then shared directly with the hiring company. The final decision and next steps (interviews, assessments) are managed by their internal team.
We appreciate your interest and wish you the best!
Data Privacy Notice: By submitting your application, you acknowledge that Jobgether will process your personal data to evaluate your candidacy and share relevant information with the hiring employer. This processing is based on legitimate interest and pre-contractual measures under applicable data protection laws (including GDPR). You may exercise your rights (access, rectification, erasure, objection) at any time.
#LI-CL1
We may use artificial intelligence (AI) tools to support parts of the hiring process, such as reviewing applications, analyzing resumes, or assessing responses. These tools assist our recruitment team but do not replace human judgment. Final hiring decisions are ultimately made by humans. If you would like more information about how your data is processed, please contact us.

Jobbenachrichtigungen per Newsletter erhalten