20 个优质engineer — high performance gpu communication职位（正在招聘！）

Deep Learning Performance Architect, CUTLASS DSL
NVIDIA —上海市
- 全职
12天
Manager, Formal Verification
NVIDIA —上海市
- 全职
Senior SOC Design Engineer
NVIDIA —上海市
- 全职
22089-AI Application SW Engineer
Mettler-Toledo (Changzhou) Measurement Technology Ltd. —常州市
快速申请
SI R&D/PRODUCT DVL ENGINEER
TE Connectivity —上海市
Senior Software Engineer, NCCL
NVIDIA —上海市
- 全职
SR FIELD APPLICATION ENGINEER
TE Connectivity —上海市
Principal Software Engineer
Cadence Design Systems —北京市
- 全职
ASIC Floorplan Design Engineer
NVIDIA —上海市
- 全职
Senior System Software Engineer, Robotics
NVIDIA —上海市
- 全职
12天
Senior System Software Engineer - AI Performance and Efficiency Tools
NVIDIA —上海市
- 全职
STAFF R&D/PRODUCT DVL ENGINEER
TE Connectivity —上海市
Senior Solutions Architect, GPU System
NVIDIA —上海市
- 全职
VFX Artist - The Sims
Electronic Arts —上海市
Applications Engineering, Sr Staff Engineer
Synopsys —北京市
Senior Staff AI Software System Design Engineer
Advanced Micro Devices, Inc —上海市
11天
Senior Field Engineer (Controls)
Overview Corporation —成都市
- 全职
快速申请
Staff Machine Learning Engineer, ML Infrastructure - Online
Unity Technologies —上海市
5天
Senior Supplier Quality Engineer - PCB
NVIDIA —深圳市
- 全职
Solutions Architect – Accelerated Computing Libraries TPM
NVIDIA —上海市
- 全职

我想收到 engineer — high performance gpu communication 的最新职位提醒

一旦登录您的账户，即表明您同意 SimplyHired 的服务条款和我们的Cookie 协议及隐私政策。

Deep Learning Performance Architect, CUTLASS DSL

NVIDIA -
上海市

立即申请

职位详情

全职
12 天前

完整职位描述

Are you passionate about programming languages, compiler technology, and GPU performance? Do you want to help shape the future of high-performance kernel development for AI? We are looking for outstanding engineers to build CUTLASS DSL, a Python-native language for GPU kernel development, along with the MLIR dialects and lowering passes behind it. In this role, you will also help accelerate kernel compilation while delivering performance comparable to CUTLASS C++, enabling efficient hardware-software co-design for NVIDIA's next generation of AI platforms.

What you'll be doing:

Design, develop, and optimize CUTLASS DSL, a Python-native language for high-performance GPU kernel development
Build and advance the MLIR dialects, lowering passes, and code generation flows that power the CUTLASS DSL stack
Drive innovations that improve kernel compilation speed while maintaining performance on par with CUTLASS C++
Collaborate closely with architecture, research, software product teams, and the open-source community to bring cutting-edge optimizations into real products

What we need to see:

MS, PhD, or equivalent experience in Computer Science, Software Engineering, or a related field
2+ years of relevant work experience
Excellent programming skills in Python and strong proficiency in C++
Hands-on experience with DSLs, compilers, or code generation systems
Strong command of the MLIR/LLVM stack, including IR design and pass optimization
Strong communication skills and the ability to thrive in a highly collaborative environment

Ways to stand out from the crowd:

Deep understanding of the CUDA GPU programming model, GPU microarchitecture, and performance analysis and optimization techniques
Familiarity with key high-performance computing abstractions such as Layout, Tile, MMA, and TMA in the CuTe ecosystem

立即申请

完善您的搜索

engineer — high performance gpu communication 个职位

Deep Learning Performance Architect, CUTLASS DSL

Manager, Formal Verification

Senior SOC Design Engineer

22089-AI Application SW Engineer

SI R&D/PRODUCT DVL ENGINEER

Senior Software Engineer, NCCL

SR FIELD APPLICATION ENGINEER

Principal Software Engineer

ASIC Floorplan Design Engineer

Senior System Software Engineer, Robotics

Senior System Software Engineer - AI Performance and Efficiency Tools

STAFF R&D/PRODUCT DVL ENGINEER

Senior Solutions Architect, GPU System

VFX Artist - The Sims

Applications Engineering, Sr Staff Engineer

Senior Staff AI Software System Design Engineer

Senior Field Engineer (Controls)

Staff Machine Learning Engineer, ML Infrastructure - Online

Senior Supplier Quality Engineer - PCB

Solutions Architect – Accelerated Computing Libraries TPM

我想收到 engineer — high performance gpu communication 的最新职位提醒

Related Searches

求职者工具

雇主工具

浏览

保持联系