AI Summary
Powered by ClaudeYou will design, operate, and debug large-scale GPU infrastructure used for distributed training and inference, working directly with customers pushing the limits of modern AI systems. What Youâll Own GPU Cluster Architecture: Design and evolve multi-provider, multi-region GPU compute clusters optimized for large-scale training.
Job description
You will design, operate, and debug large-scale GPU infrastructure used for distributed training and inference, working directly with customers pushing the limits of modern AI systems. What Youâll Own
GPU Cluster Architecture: Design and evolve multi-provider, multi-region GPU compute clusters optimized for large-scale training.
Get a weekly digest of similar roles
Save this search for Senior Site Reliability Engineer AI Infrastructure in San Francisco around $0–$0 and get the strongest matches every week.
Privacy-first. Unsubscribe anytime.