ML Compiler Engineer

Cohere·Toronto, Canada

AI EngineeringSeniorFull-timeRemote
CA$170K-CA$280KPosted 2 months ago

About the Role

Cohere is hiring an ML Compiler Engineer to optimize model inference through compiler techniques. You will develop custom optimizations for transformer models, implement efficient kernel generation, and work on cross-platform model compilation.

Requirements

  • 5+ years of compiler or systems engineering experience
  • Strong C++ and Python skills
  • Experience with LLVM, MLIR, or similar frameworks
  • Understanding of GPU architectures and parallel computing
  • Experience with model compilation or optimization

Nice to Have

  • Experience with Triton or custom CUDA kernels
  • Background in auto-tuning or performance optimization
  • Familiarity with quantization-aware compilation
  • Experience with multiple hardware backends (NVIDIA, AMD, custom)

Benefits

Equity in a well-funded AI company
Comprehensive benefits
Remote-first culture
Home office budget
Conference and travel budget
Flexible PTO

Skills

CompilersLLVMC++CUDAModel OptimizationPython

Related Jobs

Preparing for Your AI Career?

Vincony has all 400+ AI models in one place — compare responses, AI debate, Image/Video/Voice generator, and 20 more tools to help you learn and build with AI.

Visit Vincony.com