HPC Application Support Engineer

Publication Date:  Sep 2, 2024
Ref. No:  513387
Location: 

Timisoara, RO

Eviden is an Atos Group business with an annual revenue of circa € 5 billion and a global leader in data-driven, trusted and sustainable digital transformation. As a next generation digital business with worldwide leading positions in digital, cloud, data, advanced computing and security, it brings deep expertise for all industries in more than 53 countries. By uniting unique high-end technologies across the full digital continuum with 57,000 world-class talents, Eviden expands the possibilities of data and technology, now and for generations to come.

 

 

An Application Support Engineer in a High-Performance Computing (HPC) environment is responsible for maintaining and optimizing the performance of software applications and systems used in computational tasks. Their role involves ensuring applications run efficiently on HPC infrastructures, troubleshooting issues, and providing technical support to key users. This role is critical for leveraging HPC resources to achieve optimal computational performance and support advanced research and development activities.

 

Role Expectations:

  • Deploy, configure, and maintain software stack HPC applications to ensure optimal performance.

  • Keep HPC software stack updated, including libraries, compilers, and application dependencies.

  • Diagnose and resolve issues related to software applications and system performance.

  • Fine-tune applications to leverage HPC resources efficiently, ensuring maximum performance and resource utilization.

  • Design and implement benchmarking tests to measure the performance of applications and systems.

  • Analyze benchmark results to identify performance bottlenecks and areas for improvement.

  • Support and maintain technology standards, processes and policies related to on prem/cloud Infrastructure in scope.

  • Contribute to international projects by providing consultancy regarding HPC infrastructure architectures (on premises and cloud).

  • Suggest system changes in accordance with documented SOPs.

  • Produce and maintain appropriate documentation and diagrams describing hardware setups and overall inventory.

 

  Capabilities and Expertise:

  • Strong working knowledge with Linux server operating systems.

  • Strong Scripting skills Bash shell/Python.

  • Fortran and C/C++, MPI (OpenMPI, Intel MPI, MVAPICH

  • Parallel debuggers (DDT, TotalView

  • Various compilers (Intel, GNU, PGI

  • HPC libraries (MKL, fftw, HDF5

  • Experience with automated software configuration like Ansible or Chef.

  • Knowledge of CPU/GPU architectures (Intel, AMD, ARM, Nvidia) will greatly help.

 

 Nice to have:

  • Initial experience in parallel programming, ideally in the field of high-performance computing

  • Experience with the development and use of HPC applications in the technical-scientific field

 

What we offer: 

  • Training and certifications: Ongoing In-depth training with current and emerging products and technologies. 

  • Hybrid schedule. 

  • Relocation bonus for candidates outside West Area.  

  • Extra benefits: Fixed sum available yearly to spend on vacation, travel, dentist, tech gadgets, etc. 

  • Performance related bonus.

  • Extra vacation days.

 

Let's grow together.