đ Are you already in Poland or ready to relocate? If YES this is the project for YOU!
Our benefits:đŠââď¸ Private Medical Care in Luxmed and Life Insuranceđď¸ââď¸ Multisport Cardđ¨âđ§âđŚ Paid referralsđ Self-learning librariesđŤ Relocation package for seniors and assistance during all process...and MORE!
đ Location: Remote in Poland
Project Description:Luxoft is looking for an individual to join a hardworking team developing Deep Learning and High-Performance Computing GPU kernels on the AMD Radeon Open Compute (ROCm) platform for MIOpen and Composable Kernel, AMD's Deep Learning primitives libraries which provides highly optimized implementations of different operators.⢠https://github.com/ROCmSoftwarePlatform⢠https://github.com/ROCmSoftwarePlatform/composable_kernel⢠https://github.com/ROCmSoftwarePlatform/MIOpen
The successful person will be an experienced GPU-compute programmer with an eye towards hardware-aware performance optimizations.
Responsibilities:The ideal candidate will be responsible for writing high performance GPU kernels for Machine Learning and Deep Learning Library: MIOpen and Composable Kernel⢠They will be porting and optimizing algorithms for new GPU hardware⢠Perform code reviews, building unit tests, authoring detailed documentation related to their work, and working with on-site and off-shore teams to deliver the software solutions on schedule.⢠They will play a key role in all phases of the software development including system requirements analysis, coordinating feature design and development across functional and organization boundaries.
Mandatory Skills Description:* Strong programming skills in modern C++ (templates, compile-time optimizations)* In-depth knowledge about one of the parallel programming technologies CUDA/HIP/OpenCL/SYCL etc.* Experience in parallel computing on multi-core/multi-node architectures (GPU/DL accelerators, computer clasters)* Experience with parallel programming techniques and optimizations* Understanding of Linear Algebra routines on tensors ("general algorithms" knowledge, not necessarily Linear Algebra)* Good teamwork and interpersonal skills required* Ability to work independently and within complementary teams
Nice-to-Have Skills:* Strong programming skills in modern C++ (templates, compile-time optimizations)* In-depth knowledge about one of the parallel programming technologies CUDA/HIP/OpenCL/SYCL etc.* Experience in parallel computing on multi-core/multi-node architectures (GPU/DL accelerators, computer clasters)* Experience with parallel programming techniques and optimizations* Understanding of Linear Algebra routines on tensors ("general algorithms" knowledge, not necessarily Linear Algebra)* Good teamwork and interpersonal skills required* Ability to work independently and within complementary teams