All Jobs
No items found.
MLOps / Cloud Engineer
Europe
Remote
Who We Are
Role Description

Project description

  • design, development, and maintenance of reliable cloud and AI/ML platforms for massive enterprise utilization
  • collaboration within a dedicated DevOps team on the development of end-to-end ML workflows and solving complex business challenges through the integration of GenAI and LLM models into a production environment
  • architecture and management of infrastructure as code with intensive use of AWS, Azure, Kubernetes, Terraformtechnologies, and advanced development in Python
  • building and optimizing CI/CD pipelines through GitHub Actions and ensuring comprehensive secure deployment
  • setting up and maintaining advanced monitoring and logging using the Prometheus, Grafana, and Loki stack (without the need for on-call support)
  • containerizing applications via Docker and managing the machine learning lifecycle using tools like Kubeflowand SageMaker
  • collaboration takes place in a 100% remote mode within the entire EU

Project requirements

  • Advanced experience with:
    • software architecture development and design in Python (minimum 5 years of experience)
    • cloud platform, primarily AWS or Azure (including services like SageMaker or Bedrock)
    • container orchestration and cluster management in Kubernetes (e.g., EKS)
    • infrastructure as code and its automation using Terraform
    • MLOps tools, especially integration and maintenance of Kubeflow
  • Experience with:
    • containerization and optimization of performance and security using Docker
    • creation and debugging of complex CI/CD pipelines, ideally via GitHub Actions
    • deployment and work with end-to-end ML workflows and integration of LLM and GenAI
    • monitoring and logging systems like Prometheus, Grafana, or Loki
  • Advanced knowledge of:
    • CI/CD principles, security practices, and DevOps culture
    • theoretical foundations and production deployment of a wide range of ML algorithms
    • model registry, model performance monitoring, and data quality monitoring
  • Advantageous:
    • experience with advanced Kubernetes features, such as operators
    • experience with the Dynatrace platform

We Expect You to Have:

Apply for this position

Our team will review your application within the next 5 days.

Uploading...
fileuploaded.jpg
Upload failed. Max size for files is 10 MB.
Send

Thank you!
We will be in touch shortly

kid giving a thumbs-up while sitting at a desktop table
Done
Oops! Something went wrong while submitting the form.