Job
- Level
- Senior
- Job Field
- IT, DevOps, Back End
- Employment Type
- Part Time/Full Time
- Contract Type
- Permanent employment
- Location
- Meppen, Schöppingen, Osnabrück, Münster (Hessen), Kiel, Verwaltungsgemeinschaft Salem, Bocholt, Gescher
- Working Model
- Hybrid, Onsite
Job Summary
In this role, you will optimize data-intensive SaaS services, analyze operational risks, implement SRE practices, and enhance scalability and performance through concrete improvement measures.
Job Technologies
Your role in the team
- Running data-intensive SaaS services reliably is not enough for you: you want to understand where operational risks arise, where scaling becomes fragile, and how services can remain sustainably observable, performant, and manageable.
- In this role, you systematically make our platform services more robust and scalable with increasing data volume, rising usage, and a growing number of clients - from analyzing operational weaknesses to the practical implementation of specific improvements.
- You assume operational responsibility for the maturity level of data-intensive SaaS services and bring SRE/DevOps practices such as SLOs/SLIs, operational metrics, incident learning, runbooks, and continuous improvement into the engineering daily routine.
- You design the operation and further development of our central database systems for growing data volumes, complex query patterns, and different tenant profiles - considering data modeling, data access, deployment, operational costs, and product impact together.
- You proactively address scaling with capacity planning, performance testing, tenant isolation, and appropriate strategies such as partitioning or sharding before scaling limits become customer issues.
- You think of operability as a product feature: Our services should be cloud provider-agnostic and reliably installable, diagnosable, and operable in on-premises and customer-near environments — with robust migrations, diagnostic capabilities under different operating conditions, and performance on heterogeneous infrastructure.
- You embed operational knowledge within the team, strengthen shared operational responsibility, and support the Tech Lead, Product Management, and Engineering team as a sparring partner on reliability, scaling, and operational topics.
- You pragmatically utilize AI and AIOps approaches for improved operations—such as analysis, automation, documentation, and pattern recognition—and critically assess AI-related operational requirements with regard to scaling, costs, observability, security, and tenant separation.
This text has been machine translated. Show original
Our expectations of you
Qualifications
- You are familiar with production-related situations such as incidents, stabilization, operational improvements, or on-call/standby models.
- You do not wait for tickets, but independently recognize operational risks and translate them into actionable improvement work.
- You understand relational and/or analytical database systems in operation and have practical experience working with them: performance, query behavior, migrations, data growth, monitoring, backup/restore, recovery, and scaling limits are familiar topics to you — especially PostgreSQL and ClickHouse are relevant.
- You are hands-on and independently structure fuzzy operational problems — from analysis and prioritization to improvements that you can collaboratively implement with the team.
- In doing so, you can engage with the engineering team on equal footing to discuss architecture, APIs, persistence, and runtime behavior of data-intensive services.
- You enjoy working in a hybrid mode and see regular on-site presence as part of effective collaboration, especially for sparring, operational clarifications, and joint learning within the team.
Experience
- You bring senior-level experience in operating productive SaaS or platform services - including real production responsibility and continuous improvement.
- You have experience with scaling in real production environments—whether due to increasing data volume, rising usage, client growth, or heterogeneous workloads—and evaluate the technical, economic, and operational impacts in their interplay.
- You bring practical DevOps/SRE experience in production environments — from CI/CD, infrastructure, and Kubernetes to cloud platforms, observability, automation, and reliability engineering.
This text has been machine translated. Show original
What we offer
- Flexible working hours
- Company pension scheme
- Independent working
This text has been machine translated. Show original
Benefits
Work-Life-Integration
Topics that you deal with on the job
Job Locations
This is your employer
d.velop AG
d.velop AG offers solutions for classic document management, digital signatures, and digital mail delivery, focusing on the digitization of business processes.
Description
- Company Type
- Established Company
- Working Model
- Hybrid, Onsite
- Industry
- Internet, IT, Telecommunication