HPC Application Support Engineer

Publication Date:  Oct 30, 2024
Ref. No:  513387
Location: 

Timisoara, RO

Eviden is an Atos Group business with an annual revenue of circa € 5 billion and a global leader in data-driven, trusted and sustainable digital transformation. As a next generation digital business with worldwide leading positions in digital, cloud, data, advanced computing and security, it brings deep expertise for all industries in more than 53 countries. By uniting unique high-end technologies across the full digital continuum with 57,000 world-class talents, Eviden expands the possibilities of data and technology, now and for generations to come.

 

HPC Application Support Engineer:

 

An Application Support Engineer in a High-Performance Computing (HPC) environment is responsible for maintaining and optimizing the performance of software applications and systems used in computational tasks. Their role involves ensuring applications run efficiently on HPC infrastructures, troubleshooting issues, and providing technical support to key users.

 

This role is critical for leveraging HPC resources to achieve optimal computational performance and support advanced research and development activities.

 

Role Expectations:

  • Deploy, configure, and maintain software stack HPC applications to ensure optimal performance.
  • Keep HPC software stack updated, including libraries, compilers, and application dependencies including also diagnose and resolve issues related to software applications and system performance.
  • Fine-tune HPC applications to leverage HPC resources efficiently, ensuring maximum performance and resource utilization.
  • Support and maintain technology standards, processes and policies related to on prem/cloud Infrastructure in scope.
  • Contribute to international projects by providing consultancy regarding HPC infrastructure architectures (on premises and cloud).
  • Suggest system changes in accordance with documented SOPs.
  • Produce and maintain appropriate documentation and diagrams describing system setups and overall inventory.

 

Capabilities and Expertise:

  • Fortran and C/C++, MPI (OpenMPI, Intel MPI, MVAPICH, ...)
  • Parallel programming expertise.
  • Various compilers (Intel, GNU, PGI, ...)
  • HPC libraries (MKL, fftw, HDF5, ...)
  • Experience with automated software configuration like Ansible or Chef.
  • Knowledge of CPU/GPU architectures (Intel, AMD, ARM, Nvidia, ...) will greatly help.
  • Strong working knowledge with Linux server operating systems.
  • Strong Scripting skills Bash shell/Python.

             

  Nice to have:

  • Initial experience in parallel debuggers (DDT, TotalView, ...).
  • Experience with the development and use of HPC applications in the technical-scientific field.
  • Experience with submit Jobs in Schedulers like Slurm, LSF, GridEngine, etc.

 

What we offer: 

  • Training and certifications: Ongoing in-depth training on current and emerging products and technologies. 
  • Hybrid schedule. 
  • Relocation bonus for candidates outside the West Area.  
  • Extra benefits: Fixed sum available yearly to spend on vacation, travel, dentist, tech gadgets, etc. 
  • Performance related bonus.
  • Extra vacation days.

 

 

Let's grow together.