Logo AtosIEviden

Student trainee MLOps platform & cloud-native infrastructure

New

Job

  • Level
    Junior
  • Job Field
    IT, Software, DevOps
  • Employment Type
    Part Time
  • Contract Type
    Internship / school internship
  • Location
    Hamburg, Berlin, Paderborn
  • Working Model
    Hybrid, Onsite
  • Job Summary

    In this role, you will work on integrating and maintaining a cloud-native MLOps platform, configure Kubernetes environments, and automate workflows using infrastructure-as-code and CI/CD.

    Job Technologies

    Your role in the team

    • As a working student, you will support our team in integrating and further developing the various software, infrastructure, and ML components of our research and development platform into a functioning overall system.
    • The platform monitors the security and robustness status of machine learning systems throughout their entire lifecycle — as a cloud-native system based on Kubernetes, which we are currently migrating to an OpenStack-based cloud infrastructure.
    • In doing so, we work according to modern development and operations principles (Git-centric, IaC, containerization, CI/CD).
    • Participation in our layered IaC architecture with OpenTofu/Terraform (modular layers for cloud infrastructure, platform services, and applications).
    • Provisioning and maintenance of Kubernetes environments, potentially on an OpenStack-based cloud, including network, storage, and access configuration.
    • Deployment of components via Helm, Kustomize, and K8s manifests.
    • Troubleshooting the cluster (Pods, Services, Logs, Events) with kubectl.
    • Building and maintaining Docker images for our services (multi-stage builds, build automation).
    • Management of images in our internal container registry.
    • Operation and integration of the MLOps components: Kubeflow Pipelines & Training Operator, MLflow (Experiment Tracking & Model Registry), KServe (Model Serving), MinIO (S3-compatible storage).
    • Support with registering, deploying, and versioning ML models as well as setting up training and serving pipelines.
    • Integration and maintenance of the streaming layer for inference logging (Kafka-compatible message broker, CloudEvents).
    • Implementation and configuration of interfaces between services (REST, GraphQL, message queues, model inference protocols).
    • Contributing to our ML services in Python, e.g., in model monitoring, drift detection, and the demo application.
    • Support with evaluations, diagnostic plots, and method comparisons.
    • Creating and executing unit, integration, and smoke/end-to-end tests.
    • Automation of recurring deployment and build steps; contributing to the development of CI/CD pipelines.
    • Maintenance of architecture, deployment, and runbook documentation (Markdown).
    • Preparation of demo, presentation, and reproduction materials.

    This text has been machine translated. Show original

    Our expectations of you

    Qualifications

    • Enrolled student (Computer Science, Data Science, Computational Engineering, Electrical/Communication Engineering or comparable).
    • Solid Python skills and enjoyment in reading and understanding foreign code.
    • Basic understanding of containers (Docker) and Kubernetes.
    • Proficient in Git and the command line (Linux).
    • Independent, solution- and process-oriented working style as well as willingness to quickly familiarize oneself with new technologies.
    • Good German and English skills, both written and spoken.
    • Knowledge in the MLOps environment (MLflow, Kubeflow, KServe, MinIO).
    • Interest in ML security & robustness (model attacks, model monitoring).

    Experience

    • Experience with Infrastructure-as-Code (Terraform/OpenTofu).
    • Experience with cloud infrastructure, ideally OpenStack.
    • Experience with Kafka or event-driven architectures.
    • Experience with ML frameworks (e.g., PyTorch) is an advantage.
    • Initial experience with agile methodologies (Scrum, Kanban).

    This text has been machine translated. Show original

    What we offer

    • Duration of assignment: ideally 6-12 months.
    • Working hours: 15-20 hours per week (more during the lecture-free period by arrangement).
    • Hybrid work model with remote option.

    This text has been machine translated. Show original

    Benefits

    Health, Fitness & Fun

    Work-Life-Integration

    Food & Drink

    Topics that you deal with on the job

    Job Locations

    • Location Paderborn

      Nordrhein-Westfalen

      Germany

    • Location Berlin

      Germany

    • Location Hamburg

      Germany

    This is your employer

    AtosIEviden

    AtosIEviden

    Wien, Klagenfurt, Graz, Innsbruck, Neutal, Ternitz, Salzburg, München, Wien, Wien, Leonding, Leonding, Linz, Linz, Tübingen, Ingolstadt, Stuttgart, Pforzheim, Karlsruhe, Saarbrücken, Mannheim, Fürth, Frankfurt Am Main, Wiesbaden, Köln, Düsseldorf, Kassel, Leipzig, Essen, Paderborn, Holzminden, Hannover – Laatzen, Berlin, Berlin, Meppen, Bremen, Hamburg

    Atos is a leader in Infrastructure & Data Management, Business & Platform Solutions, Big Data & Security, Unified Communication & Collaboration Transactional & Payment Services and Cloud Computing.

    Description

  • Company Size
    250+ Employees
  • Founding year
    2011
  • Language
    German, English
  • Company Type
    Established Company
  • Working Model
    Hybrid, Onsite
  • Industry
    Public Service, Unions, Industry, Production, Internet, IT, Telecommunication
  • Dev Reviews

    by devworkplaces.com

    Total

    (3 Reviews)
    4.0
    • Engineering

      3.7
    • Culture

      4.5
    • Career Growth

      3.8
    • Workingconditions

      4.0
    Show All Dev Reviews
    Logo AtosIEviden

    Student trainee MLOps platform & cloud-native infrastructure

    Location
    Hamburg, Berlin, Paderborn
    Working Model
    Hybrid, Onsite
    Diversity
    Open for all genders

    More Jobs