Who We Are
Role Description
Project description
- design, development, and maintenance of reliable cloud and AI/ML platforms for massive enterprise utilization
- collaboration within a dedicated DevOps team on the development of end-to-end ML workflows and solving complex business challenges through the integration of GenAI and LLM models into a production environment
- architecture and management of infrastructure as code with intensive use of AWS, Azure, Kubernetes, Terraformtechnologies, and advanced development in Python
- building and optimizing CI/CD pipelines through GitHub Actions and ensuring comprehensive secure deployment
- setting up and maintaining advanced monitoring and logging using the Prometheus, Grafana, and Loki stack (without the need for on-call support)
- containerizing applications via Docker and managing the machine learning lifecycle using tools like Kubeflowand SageMaker
- collaboration takes place in a 100% remote mode within the entire EU
Project requirements
- Advanced experience with:
- software architecture development and design in Python (minimum 5 years of experience)
- cloud platform, primarily AWS or Azure (including services like SageMaker or Bedrock)
- container orchestration and cluster management in Kubernetes (e.g., EKS)
- infrastructure as code and its automation using Terraform
- MLOps tools, especially integration and maintenance of Kubeflow
- Experience with:
- containerization and optimization of performance and security using Docker
- creation and debugging of complex CI/CD pipelines, ideally via GitHub Actions
- deployment and work with end-to-end ML workflows and integration of LLM and GenAI
- monitoring and logging systems like Prometheus, Grafana, or Loki
- Advanced knowledge of:
- CI/CD principles, security practices, and DevOps culture
- theoretical foundations and production deployment of a wide range of ML algorithms
- model registry, model performance monitoring, and data quality monitoring
- Advantageous:
- experience with advanced Kubernetes features, such as operators
- experience with the Dynatrace platform
We Expect You to Have:
Oops! Something went wrong while submitting the form.
.png)

