DevOps/Platform Engineer
Madrid, ES
|
About Bull Bull is the Atos Group brand for high-performance computing, artificial intelligence and quantum innovations with 2,500 employees. Built on an open, end-to-end and trusted foundation, Bull designs, deploys and runs hardware and software while providing strategic services that unlock enterprise value, accelerate scientific research and drive society forward. Driven by world-class R&D with 1,500 patents, manufacturing capabilities and data science, Bull enables nations and industries to fully control their AI and data, advancing progress for the benefit of the planet.
|
About Atos Group
Atos Group is a global leader in digital transformation with c. 63,000 employees and annual revenue of c. €8 billion, operating in 61 countries under two brands — Atos for services and Eviden for products. European number one in cybersecurity, cloud and high-performance computing, Atos Group is committed to a secure and decarbonized future and provides tailored AI-powered, end-to-end solutions for all industries. Atos Group is the brand under which Atos SE (Societas Europaea) operates. Atos SE is listed on Euronext Paris.
The purpose of Atos Group is to help design the future of the information space. Its expertise and services support the development of knowledge, education and research in a multicultural approach and contribute to the development of scientific and technological excellence. Across the world, the Group enables its customers and employees, and members of societies at large to live, work and develop sustainably, in a safe and secure information space
Role Description
Bull is looking for a proactive DevOps/Platform Engineer with strong experience in Kubernetes, cloud-native platforms, and distributed infrastructure. The role focuses on resource orchestration platforms, including HPC environments such as Slurm. You will join our international R&D team to help build and operate next-generation cloud and HPC platforms.
Key Responsibilities
- Design, build, and maintain CI/CD pipelines using Jenkins (Open Source) and GitLab
- Deploy and manage applications using Helm and Kustomize
- Implement and maintain monitoring and observability solutions using Prometheus, Grafana, and Datadog
- Ensure availability, performance, scalability, and reliability of systems
- Troubleshoot infrastructure, pipeline, and production incidents
- Collaborate with development and operations teams to continuously improve DevOps processes and standards.
- Automate infrastructure provisioning and configuration using HashiCorp Terraform and Ansible
- Support and operate Bull HPC environments, including workload scheduling with Slurm.
Required Skills & Experience
- Bachelor’s degree in Computer Science, Software Engineering, or a related technical field.
- 4+ years of experience building and operating cloud-native platforms, preferably in Kubernetes-based environments.
- Strong experience with Kubernetes administration and ecosystem, including cluster lifecycle management, workload scheduling, networking, and storage.
- Experience operating multi-cluster Kubernetes environments using tools such as Rancher, Fleet, OpenShift, or similar platforms.
- Experience implementing GitOps workflows using tools such as ArgoCD, FluxCD, or Fleet for infrastructure and application delivery.
- Knowledge of Infrastructure as Code and configuration management (Terraform, Ansible, Helm, Kustomize, etc.).
- Experience with CI/CD pipelines and engineering workflows (Git, Jenkins, GitLab CI, GitHub Actions, etc.).
- Experience with observability and monitoring stacks (Prometheus, Thanos, Grafana, Datadog, etc.).
- English B2+ minimum. C level desirable.
Nice to Have
- Experience integrating Kubernetes with HPC schedulers (Slurm integration or hybrid HPC/cloud environments).
- Familiarity with cluster provisioning tools (Cluster API, kubeadm, Kubespray, etc.).
- Knowledge of Kubernetes security best practices (RBAC, Pod Security Standards, secrets management).
- Solid experience with Linux system administration, including troubleshooting distributed systems and performance bottlenecks.
- It is also valuable any previous experience in innovation and pre-sales departments or in European research projects.
Location: Location: Madrid or Barcelona (or remote within Spain for other locations).
Benefits
- Flexible Work Schedule: Enjoy half-day Fridays and an intensive summer workday to help maintain a great work-life balance
- Learning & Growth: Stay at the forefront of AI and technologies, with opportunities to learn and grow in a supportive, innovative environment.
Let’s grow together.