Would you be interested in joining the Luxoft team? If so, this opportunity could be the right fit for you! 👋
Luxoft is one of the major software services companies world-wide. We deliver professional software services in multiple business verticals such as finance, automotive, and digital transformation.
What do we offer our Employees?
👩⚕️ LuxMed health insurance (dental care & travel insurance included), life insurance🧗🏻♀️ MyBenefit program (Multisport card or Multicafeteria)💸 Paid referrals (also for external referrers)🌎 Global relocation and Internal Mobility Program (switching between projects, locations)🎓 LuxTalent platform (webinars, training, courses, paid certificates)... and many more! 💥
Project DescriptionLuxoft is looking for an individual to join a hardworking team developing Deep Learning and High-Performance Computing GPU kernels on the AMD Radeon Open Compute (ROCm) platform for MIOpen and Composable Kernel, AMD's Deep Learning primitives libraries which provides highly optimized implementations of different operators.
• https://github.com/ROCmSoftwarePlatform• https://github.com/ROCmSoftwarePlatform/composable_kernel• https://github.com/ROCmSoftwarePlatform/MIOpen
The successful person will be an experienced GPU-compute programmer with an eye towards hardware-aware performance optimizations.
ResponsibilitiesThe ideal candidate will be responsible for writing high performance GPU kernels for Machine Learning and Deep Learning Library: MIOpen and Composable KernelThey will be porting and optimizing algorithms for new GPU hardwarePerform code reviews, building unit tests, authoring detailed documentation related to their work, and working with on-site and off-shore teams to deliver the software solutions on schedule.They will play a key role in all phases of the software development, including system requirements analysis, coordinating feature design and development across functional and organization boundaries.
Mandatory Skills DescriptionStrong programming skills in modern C++ (templates, compile-time optimizations)In-depth knowledge about CUDA/HIP and or OpenCLExperience in parallel computing on GPUs or HW accelerators and/or HPC (High Performance Computation)Detailed knowledge about GPU/accelerators hardware architecture from computational perspectiveExtensive experience with parallel programming techniques and optimizationsUnderstanding of Linear Algebra routines on tensorsExperience using version control software such as GitBasic understanding of Linux internals, Servers, and DebuggingBasic knowledge of software development lifecycle, SW practices including debug, test, revision control, documentation, and bug trackingGood teamwork and interpersonal skills requiredAbility to work independently and within complementary teams
Nice-to-Have SkillsDemonstrate flexibility, strong motivation and a proven track record of meeting results-oriented deadlines.Familiarity with deep neural network machine learning technologies, architectures and modern machine learning programming frameworksExperience working with and developing virtualization containers and package managers for code deployment
LanguagesEnglish - B2 Upper Intermediate