System & Network Engineer HPC R&D (M/F)
Echirolles, FR
Eviden is an Atos Group business with an annual revenue of circa € 5 billion and a global leader in data-driven, trusted and sustainable digital transformation. As a next generation digital business with worldwide leading positions in digital, cloud, data, advanced computing and security, it brings deep expertise for all industries in more than 47 countries. By uniting unique high-end technologies across the full digital continuum with 55,000 world-class talents, Eviden expands the possibilities of data and technology, now and for generations to come.
Big Data & Security (BDS) division is the market leader in Europe on servers and super-computers segments, recognized as well for its innovations in the fields of Artificial Intelligence, cybersecurity and Quantum. Our customers are buying High Performance Computers (HPC) to study the climate change, find vaccines, work on decarbonization, or run scientific simulations.
The software R&D of BDS is based out of 5 countries, including multiple sites in France and the newly built European Research Laboratory, located in Grenoble. As part of the admin lab team, we install, administrate, maintain all the development servers used by the R&D teams working on HPC and AI, based on the latest HW technologies developed in house by Eviden and its partners (interconnect, cooling, servers,...).
We are seeking for a System & Network Engineer for HPC to join our experts team.
Your mission will be composed of:
• Systems monitoring and ensure the up and running status of all the lab infrastructure and HPC & AI clusters
• Install, update and configure the Software, Firmware and Hardware
• Evolve system and infrastructure architectures to accommodate new hardware
• Assist developers when they encounter availability issues on the Lab
• Be a driving force for the improvement of the products developed by the Software R&D team
Your profile and skills:
With a higher education degree in Systems and Networks oriented Computing, you have some experience in Linux Administration and master the following technologies/environments:
• Linux Operating System (RedHat, SUSE)
• Scripting (Python/Perl/Bash)
• Network administration (Ethernet, InfiniBand...)
• Configuration management (Puppet, Ansible)
• Virtualisation (KVM)
• Containers (Docker, Kubernetes)
• Data base (Postgresql, MySQL)
• Monitoring (Centreon, Grafana/Prometheus)
• Backup
You use version management tools (GIT) and are able to integrate into an R&D team in an Agile context.
Knowledge of HPC, Slurm, Luster and storage is a plus.
You have an appetite for hardware.
Positive and constructive by nature, you are known for your team spirit and natural leadership.
Curious and dynamic, you know how to be proactive.
Your technical english is fluent.
Let’s grow together.