What you’ll be doing :
Assisting field business development in guiding the customer build / extend their GPU infrastructures.
Help customers automate workload management, GPU monitoring, system updates and availability.
Be an industry leader with vision on integrating NVIDIA technology into HPC and Deep Learning architectures to support various applications.
You will engage with customers to develop a keen understanding of their data centers and technical needs and help integrate our GPU ecosystem in their existing environments.
You will strategically partner with lighthouse customers and industry-specific hardware infrastructures and other solution partners targeting our computing platform.
Be an internal champion and help improve the NVIDIA GPU systems provisioning tools and automation software for Deep Learning and HPC among the NVIDIA technical community.
What we need to see :
MS or PhD in Engineering, Mathematics, Physics, or Computer Science.
3+ years of work related experience in hardware provisioning and software development or Machine learning or high performance computing, familiarity with CUDA is highly desirable.
Deep understanding of operating systems, computer networks, High Performance Applications and Deep Learning frameworks (Tensorflow, CNTK, Pytorch etc).
Familiarity with resource scheduling managers (Slurm (preferred), LSF, etc).
Demonstrated ability to script in bash and python.
Experience working with containers technology : Docker, Singularity, LXC.
Experience working with orchestration technologies like Kubernetes.
Experience with automating configuration management, infrastructure, and application deployments in a toolset such as Puppet, Chef, Ansible.
Exposure to GPU Computing and CUDA programming.
Experience working with supercomputing and technical computing customers.
Capable of working in a rapidly changing environment without losing focus.
Ability to multitask effectively in a dynamic environment.
Strong analytical and problem solving skills.
Strong time-management and organization skills for coordinating multiple initiatives, priorities and implementations of new technology and products into very complex projects.
Strong written and oral communications skills with the ability to effectively collaborate with management and engineering.
Ways to stand out from the crowd :
Experience working with large GPU installations.
Experience managing DGX servers.
Experience with NVIDIA software for cluster management and provisioning such as nvsm, dcgm and DeepOps.
Specialty skills in large scale computing and cluster computing, datacenter design.
NVIDIA is widely considered to be one of the technology world’s most desirable employers. We have some of the most brilliant and talented people in the world working for us.
If you're creative and autonomous, we want to hear from you!