Dice is the leading career destination for tech experts at every stage of their careers. Our client, IStream Solutions Inc, is seeking the following. Apply via Dice today!
Lead/Senior Engineer - Generative AI Product Engineering (FULLY REMOTE)
Our mission at Client is to create trustworthy, reliable and human-in-the-loop AI systems, changing banking for good. For years, Client has been leading the industry in using machine learning to create real-time, intelligent, automated customer experiences. From informing customers in about unusual charges to answering their questions in real time, our applications of AI & MLClient are bringing humanity and simplicity to banking. Because of our investments in public cloud infrastructure and machine learning platforms, we are now uniquely positioned to harness the power of AI. We are committed to building world-class applied science and engineering teams and continue our industry leading capabilities with breakthrough product experiences and scalable, high-performance AI infrastructure. At Client, you will help bring the transformative power of emerging AI capabilities to reimagine how we serve our customers and businesses who have come to love the products and services we build.
We are looking for an experienced Lead Generative AI Engineer to help build and maintain APIs and SDKs to train, fine-tune and access AI models at scale. You will work as part of our Enterprise AI team and build systems that will enable our users to work with Large-Language Models (LLMs) and Foundation Models (FMs), using our public cloud infrastructure. You will work with a team of world-class AI engineers and researchers to design and implement key API products and services that enable real-time customer-facing applications. Examples of projects you will work on include:
Architect, build and deploy well-managed core APIs and SDKs to access LLMs and our proprietary FMs including training, fine-tuning and prompting tasks, including orchestration SDKs.
Design APIs for performance, real-time applications, scale, ease of use and governance automation.
Develop application-specific interfaces that leverage LLMs and FMs to continue to enhance the associate and customer experience.
Enable Our Users To Build New GenAI Capabilities.
Develop tools and processes to monitor API access patterns and operational health.
Design and implement AI safety and guardrails in the API layer working closely with researchers.
Basic Qualifications:
Bachelor s degree in Computer Science, Computer Engineering or a technical field
At least 6 years of experience designing and building data-intensive solutions using distributed computing and cache optimization techniques
At least 6 years of experience programming with Python, Go, Scala, or Java
At least 1 years of experience building, scaling, and optimizing training or inferencing systems for deep neural networks
Preferred Qualifications:
Familiarity with building large-scale AI products or platforms for NLP, speech, computer vision, or recommendation systems serving millions of users.
Ability to move fast in an environment with ambiguity at times, and with competing priorities and deadlines.
Experience At Tech And Product-driven Companies/startups Preferred.
Ability to iterate rapidly with researchers and engineers to improve a product experience while building the foundational capabilities.
Familiarity with deploying large neural network models in demanding production environments.
Have experience with API security, observability, cloud access control and privacy best practices.
At this time, Client will not sponsor a new applicant for employment authorization for this position.
Overview:
At Client, we are creating trustworthy and reliable AI systems, changing banking for good. For years, Client has been leading the industry in using machine learning to create real-time, AI powered customer experiences. Our investments in technology infrastructure and world-class talent along with our deep experience in machine learning position us to be at the forefront of enterprises leveraging AI. From informing customers about unusual charges to answering their questions in real time, our applications of AI & MLClient are bringing humanity and simplicity to banking. We are committed to building world-class applied science and engineering teams and continue delivering our industry leading capabilities with breakthrough product experiences and scalable, high-performance AI infrastructure. At Client, you will help bring the transformative power of emerging AI capabilities to reimagine how we serve our customers and businesses who have come to love the products and services we build.
Team Description:
The Intelligent Foundations and Experiences (IFX) team is at the center of bringing our vision for AI at Client to life. We work hand-in-hand with our partners across the company to advance the state of the art in science and AI engineering, and we build and deploy proprietary solutions that are central to our business and deliver value to millions of customers. Our AI models and platforms empower teams across Client to enhance their products with the transformative power of AI, in responsible and scalable ways for the highest leverage impact.
In this role, you will:
Partner with a cross-functional team of engineers, research scientists, technical managers, and product managers to deliver AI-powered products that change how employees work and how customers interact with Client.
Design, develop, test, deploy, and support AI software components including foundation model training, large language model inference, similarity search, guardrails, model evaluation, experimentation, governance, and observability, etc.
Leverage a broad stack of Open Source and SaaS AI technologies such as AWS Ultraclusters, Huggingface, VectorDBs, Nemo Guardrails, PyTorch, and more.
Invent and introduce state-of-the-art LLM optimization techniques to improve the performance scalability, cost, latency, throughput of large scale production AI systems.
Contribute to the technical vision and the long term roadmap of foundational AI systems at Client.
GenerativeAI Developer/Architect