Job
- Level
- Experienced
- Job Field
- IT, System
- Employment Type
- Full Time
- Contract Type
- Permanent employment
- Location
- Cologne
- Working Model
- Full Remote, Hybrid
Job Summary
In this role, you will monitor the health of complex customer environments, execute deployments and bug fixes, and support 24/7 operations by working closely with IT and OT teams.
Job Technologies
Your role in the team
- As a Platform Operations Engineer (all genders), you are responsible for the reliable operation, monitoring, and deployment of envelio's software solutions in complex customer environments.
- You ensure that updates, bug fixes, and new versions of the Intelligent Grid Platform (IGP) are rolled out smoothly in cloud, on-premise, and operational technology (OT) environments.
- A central part of your role is the daily operation of customer systems, including monitoring system health, handling incidents, and coordinating effective incident resolution.
- You actively contribute to a stable 24/7 operation by identifying issues early, responding to incidents, and ensuring clear communication and handovers.
- You work at the interface between Engineering, Operations, and Customers.
- In close collaboration with the IT and OT teams of our clients, you help clarify operating models, understand the existing infrastructure, and ensure that our software runs reliably and securely in real network environments.
- You perform software updates, patches, and bug fixes in customer environments — in cloud, on-premise, and OT infrastructures.
- You operate and maintain customer systems, ensuring a stable and secure daily operation.
- You contribute to 24/7 operations by participating in on-call duties and ensuring a quick response time in case of incidents.
- You support clients with rollouts, upgrades, and operational incidents — also outside regular business hours if necessary.
- You work directly with clients to understand their cloud customer environments (Kubernetes, mostly single-tenant per customer) as well as on-premise and OT landscapes, and to define suitable operational models.
- You analyze operational issues and coordinate troubleshooting together with Development, SRE, and Security teams.
- You document customer-specific setups, operational processes, and deployment procedures.
- You contribute to improving and standardizing deployment and operational processes across clients.
- You support internal teams by providing feedback from real customer operations to inform product and engineering decisions.
This text has been machine translated. Show original
Our expectations of you
Qualifications
- You operate productive services on cloud infrastructure (AWS/Azure/GCP) and are familiar with typical failure modes.
- You are familiar with modern operational models such as Container/Kubernetes (or comparable) and can evaluate deployments in operation (rollouts, rollbacks).
- You enjoy working hands-on operational tasks - from deployments to troubleshooting in production environments.
- You have good knowledge of fundamental security concepts.
- You enjoy working closely with clients and are able to explain technical topics clearly and pragmatically.
- You are ready and able to contribute to 24/7 operations through on-call duties as part of a shared team rotation.
- You are organized, reliable, and take responsibility for operational tasks.
- You work well with software developers and can translate operational requirements into technical specifications.
- You are familiar with parts of our tech stack or confident in your ability to quickly get up to speed.
- You are fluent in both German and English, in speaking and writing.
Experience
- You have extensive experience in operating complex cloud applications and know how to reliably run services under real-world conditions.
- You have practical experience with Linux and networking basics in troubleshooting (logs, system status, connectivity).
- You have experience with Infrastructure-as-Code tools (Terraform).
- You have experience with monitoring and observability platforms (e.g., Datadog, Grafana, or similar).
This text has been machine translated. Show original
What we offer
- Adjust the work mode to suit your lifestyle - fully remote (#LI-Remote) or hybrid with office option.
- Option for remote work from abroad (up to three months per year from anywhere in the EU or the USA).
- State-of-the-art technology and modern tech stack.
- Excellent hardware equipment (16-inch MacBooks, 2 monitors at your workstation).
- 30 vacation days + 3 company holidays.
- Supporting your health with the Urban Sports Club partnership.
- Flexible use of a monthly mobility budget (e.g., Jobrad, public transport).
- Time and budget for individual growth.
- Optional company pension scheme.
- Regular company and team events.
This text has been machine translated. Show original
Benefits
Health, Fitness & Fun
Topics that you deal with on the job
Job Locations
This is your employer
envelio GmbH
envelio GmbH, based in Cologne, is an innovative Clean-Tech software company that offers a platform for the automation and digitalization of power grid planning. It supports distribution network operators in integrating renewable energies.
Description
- Company Type
- Startup
- Working Model
- Full Remote, Hybrid, Onsite
- Industry
- Power Sector, Economy