Site Reliability Engineer
Berlin
15 job openings found.
Thales Group
In this role, you lead an SRE team to ensure the operational stability of GCP services. You handle incident management, SLIs/SLOs, and optimize processes through automation and knowledge management.
In this role, you will operate and develop high-availability GCP services, monitor SLI/SLO metrics, and analyze production incidents in a 24x7 environment to ensure system stability.
Smartclip
In this role, you will enhance our internal tooling, optimize our observability strategy, integrate security measures, and implement innovative solutions using cutting-edge open-source technologies.
Doctolib
In this role, you will develop the observability strategy, optimize logging, metrics, and tracing, drive large-scale reliability initiatives, and enhance incident management processes on our platform.
Ninox Software GmbH
In this role, you will develop reliable and secure systems, support teams in automating the application lifecycle, and tackle production issues through monitoring and incident analysis.
deepset GmbH
In this role, you will develop reliable infrastructure, optimize CI/CD pipelines, and promote best practices in scalability and security while working on self-hosted platforms in cloud and on-premises environments.
IONOS
In this role, you will develop high-availability security solutions, optimize DDoS mitigation measures, and automate infrastructure processes using modern programming languages and tools.
Assecor GmbH
In this role, you will develop SLIs, SLOs, and Error Budgets, automate processes, and build Observability solutions. You will also manage incidents and optimize systems through targeted performance engineering.
EVENTIM
In this role, you will take ownership of the reliability and security of Linux-based production systems, automate operational tasks, and optimize deployments to enhance efficiency.
SysEleven GmbH
In this role, you will develop innovative observability solutions for the MKA platform, optimize Kubernetes controllers, automate production applications, and implement CI/CD workflows using GitOps.
In this role, you design, build, and operate APIs for automating our products, optimize CI/CD pipelines, and manage containerized applications in Kubernetes to ensure the reliability of our services.
Digistore24
In this role, you automate infrastructure processes and ensure optimal system performance and availability by conducting system monitoring, incident management, and capacity planning.
In this role, you develop and optimize DDoS defense mechanisms, automate infrastructure processes, and manage highly available network systems while working with Nginx, IaC, and analysis tools.
1&1 Internet AG
In this role, you will plan and implement high-availability management services on Linux/K8s with your team, optimize infrastructure processes, and assist developers with automation tasks.
In this role, you will optimize and operate highly available services on Linux and K8s, develop in-house K8s operators, and support the automation of operational tasks along with infrastructure monitoring.
Receive new Site Reliability Engineer Jobs in Berlin by email.