CUDA Engineer — AI Frameworks

NVIDIA·Santa Clara, CA

AI EngineeringSeniorFull-time
$200K-$350KPosted 2 weeks ago

About the Role

NVIDIA is seeking a CUDA Engineer to optimize deep learning frameworks for GPU acceleration. You will work on kernel optimization, memory management, and performance tuning for training and inference workloads on NVIDIA's latest GPU architectures.

Requirements

  • 5+ years of CUDA/C++ programming experience
  • Deep understanding of GPU architecture and parallel computing
  • Experience optimizing deep learning kernels
  • Strong background in computer architecture
  • BS/MS/PhD in Computer Science or Electrical Engineering

Nice to Have

  • Experience with TensorRT or Triton inference server
  • Contributions to PyTorch or TensorFlow GPU backends
  • Knowledge of mixed-precision training techniques
  • Experience with NVIDIA's Hopper or Blackwell architectures

Benefits

Stock grants in a leading semiconductor company
Comprehensive benefits package
Employee stock purchase plan
On-site gym and cafeteria
Patent bonuses
Education reimbursement

Skills

CUDAC++GPU ComputingDeep LearningPerformance OptimizationPython

Related Jobs

Preparing for Your AI Career?

Vincony has all 400+ AI models in one place — compare responses, AI debate, Image/Video/Voice generator, and 20 more tools to help you learn and build with AI.

Visit Vincony.com