Job
- Level
- Junior
- Job Field
- IT, Software, DevOps
- Employment Type
- Part Time
- Contract Type
- Internship / school internship
- Location
- Hamburg, Berlin, Paderborn
- Working Model
- Hybrid, Onsite
Job Summary
In this role, you will work on integrating and maintaining a cloud-native MLOps platform, configure Kubernetes environments, and automate workflows using infrastructure-as-code and CI/CD.
Job Technologies
Your role in the team
- As a working student, you will support our team in integrating and further developing the various software, infrastructure, and ML components of our research and development platform into a functioning overall system.
- The platform monitors the security and robustness status of machine learning systems throughout their entire lifecycle — as a cloud-native system based on Kubernetes, which we are currently migrating to an OpenStack-based cloud infrastructure.
- In doing so, we work according to modern development and operations principles (Git-centric, IaC, containerization, CI/CD).
- Participation in our layered IaC architecture with OpenTofu/Terraform (modular layers for cloud infrastructure, platform services, and applications).
- Provisioning and maintenance of Kubernetes environments, potentially on an OpenStack-based cloud, including network, storage, and access configuration.
- Deployment of components via Helm, Kustomize, and K8s manifests.
- Troubleshooting the cluster (Pods, Services, Logs, Events) with kubectl.
- Building and maintaining Docker images for our services (multi-stage builds, build automation).
- Management of images in our internal container registry.
- Operation and integration of the MLOps components: Kubeflow Pipelines & Training Operator, MLflow (Experiment Tracking & Model Registry), KServe (Model Serving), MinIO (S3-compatible storage).
- Support with registering, deploying, and versioning ML models as well as setting up training and serving pipelines.
- Integration and maintenance of the streaming layer for inference logging (Kafka-compatible message broker, CloudEvents).
- Implementation and configuration of interfaces between services (REST, GraphQL, message queues, model inference protocols).
- Contributing to our ML services in Python, e.g., in model monitoring, drift detection, and the demo application.
- Support with evaluations, diagnostic plots, and method comparisons.
- Creating and executing unit, integration, and smoke/end-to-end tests.
- Automation of recurring deployment and build steps; contributing to the development of CI/CD pipelines.
- Maintenance of architecture, deployment, and runbook documentation (Markdown).
- Preparation of demo, presentation, and reproduction materials.
This text has been machine translated. Show original
Our expectations of you
Qualifications
- Enrolled student (Computer Science, Data Science, Computational Engineering, Electrical/Communication Engineering or comparable).
- Solid Python skills and enjoyment in reading and understanding foreign code.
- Basic understanding of containers (Docker) and Kubernetes.
- Proficient in Git and the command line (Linux).
- Independent, solution- and process-oriented working style as well as willingness to quickly familiarize oneself with new technologies.
- Good German and English skills, both written and spoken.
- Knowledge in the MLOps environment (MLflow, Kubeflow, KServe, MinIO).
- Interest in ML security & robustness (model attacks, model monitoring).
Experience
- Experience with Infrastructure-as-Code (Terraform/OpenTofu).
- Experience with cloud infrastructure, ideally OpenStack.
- Experience with Kafka or event-driven architectures.
- Experience with ML frameworks (e.g., PyTorch) is an advantage.
- Initial experience with agile methodologies (Scrum, Kanban).
This text has been machine translated. Show original
What we offer
- Duration of assignment: ideally 6-12 months.
- Working hours: 15-20 hours per week (more during the lecture-free period by arrangement).
- Hybrid work model with remote option.
This text has been machine translated. Show original
Benefits
Health, Fitness & Fun
Work-Life-Integration
Food & Drink
Topics that you deal with on the job
Job Locations
This is your employer
AtosIEviden
Wien, Klagenfurt, Graz, Innsbruck, Neutal, Ternitz, Salzburg, München, Wien, Wien, Leonding, Leonding, Linz, Linz, Tübingen, Ingolstadt, Stuttgart, Pforzheim, Karlsruhe, Saarbrücken, Mannheim, Fürth, Frankfurt Am Main, Wiesbaden, Köln, Düsseldorf, Kassel, Leipzig, Essen, Paderborn, Holzminden, Hannover – Laatzen, Berlin, Berlin, Meppen, Bremen, Hamburg
Atos is a leader in Infrastructure & Data Management, Business & Platform Solutions, Big Data & Security, Unified Communication & Collaboration Transactional & Payment Services and Cloud Computing.
Description
- Company Size
- 250+ Employees
- Founding year
- 2011
- Language
- German, English
- Company Type
- Established Company
- Working Model
- Hybrid, Onsite
- Industry
- Public Service, Unions, Industry, Production, Internet, IT, Telecommunication
Dev Reviews
by devworkplaces.com
Total
(3 Reviews)Engineering
3.7Culture
4.5Career Growth
3.8Workingconditions
4.0