Logo SysEleven GmbH

Senior Site Reliability Engineer - Kubernetes Platform

Job

  • Level
    Senior
  • Job Field
    IT, DevOps
  • Employment Type
    Full Time
  • Contract Type
    Permanent employment
  • Location
    Berlin
  • Working Model
    Full Remote, Onsite
  • Job Summary

    In this role, you will develop innovative observability solutions for the MKA platform, optimize Kubernetes controllers, automate production applications, and implement CI/CD workflows using GitOps.

    Job Technologies

    Your role in the team

    • As a Senior Site Reliability Engineer in the MetaKube Accelerator Team, you leverage modern Kubernetes and Cloud-Native technologies to maximize the reliability, scalability, and operational excellence of the MKA platform.
    • You solve complex platform challenges, develop production-ready systems, and contribute to shared ownership and continuous improvement.
    • Designing and implementing observability solutions with Prometheus, Loki, and Mimir, including defining meaningful alerts and continuously improving monitoring coverage.
    • Analysis, troubleshooting, and further development of proprietary Kubernetes controllers to ensure reliability and stability.
    • Development and maintenance of production applications with a focus on code quality, scalability, and operational readiness.
    • Operation, automation, and continuous development of the MKA platform with a focus on efficiency and maintainability.
    • Further development of internal tooling solutions to promote automation and reduce manual effort.

    This text has been machine translated. Show original

    Our expectations of you

    Qualifications

    • Good knowledge of Bash and/or Python for automation and tooling.
    • Understanding of CI/CD pipelines, ideally with Tekton-based workflows.
    • Very good German skills as well as good English skills (B2+) for technical collaboration.

    Experience

    • Experience in operating highly available, mission-critical applications in cloud and on-premises environments, including incident leadership.
    • Excellent Kubernetes skills as well as experience in cluster management.
    • Experience with GitOps principles and ArgoCD for deployment and delivery workflows.
    • Experience with Infrastructure as Code, particularly Terraform and Ansible.

    This text has been machine translated. Show original

    What we offer

    • You will gain in-depth practical Kubernetes experience and learn the internals at a level that only a few possess.
    • You have the freedom to solve challenges, share knowledge, and continuously learn — whether through team collaboration, internal show-and-tell sessions, or conferences like KubeCon or Container Days.

    This text has been machine translated. Show original

    Benefits

    Work-Life-Integration

    Topics that you deal with on the job

    Job Locations

    • Location Berlin

      Germany

    This is your employer

    SysEleven GmbH

    SysEleven GmbH

    Mit der SysEleven NEO-Methode sind wir von der Konzeptberatung, über Schulungen Ihrer Admins und DevOps bis hin zum Full Managed Betreib an Ihrer Seite. Dafür setzen wir uns intensiv mit Best-of-Breed-Technologien auseinander, damit Sie die freie Wahl haben, welche Technologie Sie einsetzen wollen.

    Description

  • Company Type
    Established Company
  • Working Model
    Full Remote, Hybrid, Onsite
  • Industry
    Internet, IT, Telecommunication
  • Logo SysEleven GmbH

    Senior Site Reliability Engineer - Kubernetes Platform

    Location
    Berlin
    Working Model
    Full Remote, Onsite
    Diversity
    Open for all genders

    More Jobs