Logo Qimia

Data Engineer/Data Scientist

Job

  • Level
    Experienced
  • Job Field
    Data
  • Employment Type
    Full Time
  • Contract Type
    Permanent employment
  • Location
    Cologne
  • Working Model
    Onsite
  • Job Summary

    In this role, you will develop innovative AI and Machine Learning solutions, particularly Generative Deep Learning models, and implement data-driven systems using Python tools in a cloud environment.

    Job Technologies

    Your role in the team

    • Development of innovative AI and Machine Learning solutions.
    • Development of Generative Deep Learning Models: Creating, training, and fine-tuning neural networks with Python tools such as Torch, Numpy, Pandas, PySpark, Jupyter, Transformers, Datasets, Tiktoken, and Wandb.
    • Natural Language Processing (NLP) using libraries from the HuggingFace ecosystem (Python-based deep learning library for LLMs) - Transformers, Datasets, Tokenizers, and Accelerate - as well as pre-trained Hugging Face Hub models.
    • Design and development of ML systems for production-ready applications that are reliable, scalable, maintainable, and adaptable to changing business requirements.
    • Development of MLOps pipelines for automation, continuous development, evaluation, and deployment of models.
    • Development of monitoring systems for the rapid detection and resolution of issues that models may encounter in production.
    • End-to-End design and implementation of data analytics systems; this includes data collection, requirements engineering and specification, as well as the design of technical solutions based on business requirements.
    • Identification and recognition of opportunities for designing and implementing Internet-Scale Data Mining solutions in close collaboration with other Data Scientists and Data Engineers.
    • Development of ETL pipelines for large and complex datasets; processing of structured and unstructured data with Spark, Pandas, Dask, Kafka, etc.
    • Prototyping and implementation of massively scaled data analytics solutions based on big data tools (Spark, DWH, SQL, Python, and R).
    • Working with cloud platforms (AWS, Azure, and Google Cloud).

    This text has been machine translated. Show original

    Our expectations of you

    Education

    • Master's degree in Computer Science or similar quantitative programs such as Statistics, Operations Research, Bioinformatics, Mathematics, or Physics.

    Qualifications

    • Fluent in German and/or English.
    • Master's degree in Computer Science, Machine Learning, or similar technical fields. (preferred)

    Experience

    • 1 year of professional or academic experience in Machine Learning and Artificial Intelligence.
    • 1 year of relevant experience in data analysis (Statistics / Data Science).
    • Experience with one or more general-purpose programming languages, including but not limited to: Java, C / C++, Python, Scala, or R.
    • Experience with one or more of the following topics: Natural Language Processing and Understanding, Classification, Pattern Recognition, and Recommendation Systems. (preferred)
    • Experience working with large datasets, e.g., social network data, scientific data, sensor data, etc. (preferred)
    • Experience in applying machine learning to large datasets. (preferred)
    • Proven programming experience in at least one programming language such as Java, Scala, C++, or a similar object-oriented language. (preferred)

    This text has been machine translated. Show original

    What we offer

    • An inspiring and challenging work environment with a flat hierarchy and experienced, helpful colleagues.
    • A comprehensive training and education.
    • Topics we will cover in our training: Big Data Science: Python Machine Learning Libraries (NumPy, SciPy, Pandas, IPython, Scikit-Learn, PyTorch, TensorFlow, JAX, NLTK), Spark for Data Mining and Machine Learning (Spark SQL, Spark MLlib, PySpark).
    • Deep Neural Networks: Feed-Forward Neural Networks, Convolutional Networks, Recurrent Neural Networks, development of production prepares TensorFlow, JAX, and PyTorch solutions.
    • Fundamentals of Data Science and Machine Learning: Time Series and Sequential Data Processing, Supervised and Unsupervised Machine Learning, Classification.
    • Logistic Regression and Random Forest, Support Vector Machines, K-Nearest Neighbors, Naive Bayes, and Gradient Boosting.
    • Generative Deep Learning and LLMs: Training a large language model capable of generating coherent text paragraphs and achieving top performance on many language modeling benchmarks.
    • It also has reading comprehension capabilities and can perform machine translations, answer questions, and create summaries. All of this is done without task-specific training.

    This text has been machine translated. Show original

    Topics that you deal with on the job

    Job Locations

    • Location Cologne

      Nordrhein-Westfalen

      Germany

    This is your employer

    Qimia

    Qimia

    Wir sind ein junges und dynamisches Inhabergeführtes IT Unternehmen. Wir unterstützen unsere Kunden durch die Realisierung maßgeschneiderter IT-Lösungen und verbinden dabei Bereiche wie Big Data Analytics, Business Intelligence und Data Warehousing, Cloud Services, Hadoop und Spark, Software Entwicklung in Scala, Python, Java Full Stack Development, Data Migration und gleichzeitig Projekt- und Prozessmanagement.

    Description

  • Company Type
    Established Company
  • Working Model
    Hybrid, Onsite
  • Industry
    Internet, IT, Telecommunication
  • Logo Qimia

    Data Engineer/Data Scientist

    Location
    Cologne
    Working Model
    Onsite
    Diversity
    Open for all genders

    More Jobs