Project Description:Luxoft is searching for talented developers with GPU compute and performance profiling experience to join the rapidly growing team.
We are seeking an experienced individual proficient in HIP / ROCm applications to join our team. The primary responsibility of this role will be to lead the effort in porting CUDA kernels to HIP. The candidate should possess a strong background in GPU computing, parallel programming, and a deep understanding of CUDA or HIP frameworks. Additionally, familiarity with optimization techniques is highly desirable.
Responsibilities:The main task will be to help port CUDA kernels on HIP· Collaborate with development teams to optimize and enhance GPU-accelerated applications.· Debug, profile, and fine-tune code for performance improvements.· Stay updated with the latest advancements in GPU architectures and programming models.
Mandatory Skills:• CUDA or HIP• GPGPU• C/C++• Python• One of AI/ML/DL/NN/NLP/Computer Vision
Mandatory Skills Description:• Proficiency with C++ and GPU Assembler• Proficiency in CUDA or HIP / ROCm programming• Solid understanding of GPU architectures, parallel programming models, and optimization techniques• Strong problem-solving skills and the ability to work in a collaborative environment
Nice-to-Have Skills Description:• Linux• CPU Intrinsics (AVX/SSE)• GPU Assembler
Languages:English: B2 Upper Intermediate