Job
- Level
- Senior
- Job Field
- IT, DevOps
- Employment Type
- Full Time
- Contract Type
- Permanent employment
- Location
- Berlin
- Working Model
- Full Remote, Onsite
Job Summary
In this role, you will develop innovative observability solutions for the MKA platform, optimize Kubernetes controllers, automate production applications, and implement CI/CD workflows using GitOps.
Job Technologies
Your role in the team
- As a Senior Site Reliability Engineer in the MetaKube Accelerator Team, you leverage modern Kubernetes and Cloud-Native technologies to maximize the reliability, scalability, and operational excellence of the MKA platform.
- You solve complex platform challenges, develop production-ready systems, and contribute to shared ownership and continuous improvement.
- Designing and implementing observability solutions with Prometheus, Loki, and Mimir, including defining meaningful alerts and continuously improving monitoring coverage.
- Analysis, troubleshooting, and further development of proprietary Kubernetes controllers to ensure reliability and stability.
- Development and maintenance of production applications with a focus on code quality, scalability, and operational readiness.
- Operation, automation, and continuous development of the MKA platform with a focus on efficiency and maintainability.
- Further development of internal tooling solutions to promote automation and reduce manual effort.
This text has been machine translated. Show original
Our expectations of you
Qualifications
- Good knowledge of Bash and/or Python for automation and tooling.
- Understanding of CI/CD pipelines, ideally with Tekton-based workflows.
- Very good German skills as well as good English skills (B2+) for technical collaboration.
Experience
- Experience in operating highly available, mission-critical applications in cloud and on-premises environments, including incident leadership.
- Excellent Kubernetes skills as well as experience in cluster management.
- Experience with GitOps principles and ArgoCD for deployment and delivery workflows.
- Experience with Infrastructure as Code, particularly Terraform and Ansible.
This text has been machine translated. Show original
What we offer
- You will gain in-depth practical Kubernetes experience and learn the internals at a level that only a few possess.
- You have the freedom to solve challenges, share knowledge, and continuously learn — whether through team collaboration, internal show-and-tell sessions, or conferences like KubeCon or Container Days.
This text has been machine translated. Show original
Benefits
Work-Life-Integration
Topics that you deal with on the job
Job Locations
This is your employer
SysEleven GmbH
Mit der SysEleven NEO-Methode sind wir von der Konzeptberatung, über Schulungen Ihrer Admins und DevOps bis hin zum Full Managed Betreib an Ihrer Seite. Dafür setzen wir uns intensiv mit Best-of-Breed-Technologien auseinander, damit Sie die freie Wahl haben, welche Technologie Sie einsetzen wollen.
Description
- Company Type
- Established Company
- Working Model
- Full Remote, Hybrid, Onsite
- Industry
- Internet, IT, Telecommunication