CUDA Engineer — AI Frameworks

NVIDIA·Santa Clara, CA

AI EngineeringSeniorFull-time

$200K-$350KPosted 2 weeks ago

About the Role

NVIDIA is seeking a CUDA Engineer to optimize deep learning frameworks for GPU acceleration. You will work on kernel optimization, memory management, and performance tuning for training and inference workloads on NVIDIA's latest GPU architectures.

Requirements

5+ years of CUDA/C++ programming experience
Deep understanding of GPU architecture and parallel computing
Experience optimizing deep learning kernels
Strong background in computer architecture
BS/MS/PhD in Computer Science or Electrical Engineering

Nice to Have

Experience with TensorRT or Triton inference server
Contributions to PyTorch or TensorFlow GPU backends
Knowledge of mixed-precision training techniques
Experience with NVIDIA's Hopper or Blackwell architectures

Benefits

Stock grants in a leading semiconductor company

Comprehensive benefits package

Employee stock purchase plan

On-site gym and cafeteria

Patent bonuses

Education reimbursement

Skills

CUDAC++GPU ComputingDeep LearningPerformance OptimizationPython

Apply for this Position

Related Jobs

Prompt Engineer

xAI · Palo Alto, CA

$150K-$250KAI Engineering

Open Source ML Engineer

Hugging Face · New York, NY · Remote

$180K-$280KAI Engineering

AI Developer Relations Engineer

NVIDIA · Austin, TX · Remote

$150K-$250KSales & Marketing

Backend Engineer — Model Inference

Cohere · Toronto, Canada · Remote

CA$160K-CA$250KAI Engineering

Preparing for Your AI Career?

Vincony has all 400+ AI models in one place — compare responses, AI debate, Image/Video/Voice generator, and 20 more tools to help you learn and build with AI.

Visit Vincony.com