Director of Site Reliability Engineering, AI Infrastructure
oracle -
Santa Clara, CA, United States |
2024-07-31 06:25:25
Director of Site Reliability Engineering, AI Infrastructure job opportunities 2025, Director of Site Reliability Engineering, AI Infrastructure Jobs 2025, Director of Site Reliability Engineering, AI Infrastructure job opening 2025, Director of Site Reliability Engineering, AI Infrastructure job vacancies 2025, Director of Site Reliability Engineering, AI Infrastructure job descriptions 2025, Director of Site Reliability Engineering, AI Infrastructure job listing 2025 Oracle job opportunities 2025, Oracle Jobs 2025, Oracle job opening 2025, Oracle job vacancies 2025, Oracle job descriptions 2025, Oracle job listing 2025 Santa Clara, CA, United States job opportunities 2025, Santa Clara, CA, United States Jobs 2025, Santa Clara, CA, United States job opening 2025, Santa Clara, CA, United States job vacancies 2025, Santa Clara, CA, United States job descriptions 2025, Santa Clara, CA, United States job listing 2025, Australia Postal Service Jobs 2025, Australia Postal Service job opportunities 2025, Australia Postal Service job opening 2025, Australia Postal Service job vacancies 2025, Australia Postal Service job descriptions 2025, Australia Postal Service job listing 2025
For more information please click the link below
- MS or BS in Computer Science, or equivalent experience.
- 5+ years of experience managing technology teams.
- 10+ years of software engineering experience
- Proven experience as a Director of Site Reliability Engineering or a similar leadership role, with a track record of successfully managing and scaling SRE teams.
- Strong knowledge of cloud infrastructure, distributed systems, and network architecture.
- Demonstrated ability to manage and prioritize multiple projects and initiatives in a fast-paced, dynamic environment.
- Excellent problem-solving and troubleshooting skills, with the ability to analyze complex systems and identify areas for improvement.
- Strong leadership and communication skills, with the ability to effectively collaborate with cross-functional teams and influence decision-making at all levels of the organization.
- Experience in Nvidia training technologies (CUDA, NCCL).
- Working familiarity with networking protocols (TCP/IP, UDP, HTTP) and standard network architectures.
- Strong technical knowledge in distributed systems, high performance computing, and GPU systems.
- Experience in AI model training infrastructure
Career Level - M4
Director of Site Reliability Engineering, AI Infrastructure job opportunities 2025, Director of Site Reliability Engineering, AI Infrastructure Jobs 2025, Director of Site Reliability Engineering, AI Infrastructure job opening 2025, Director of Site Reliability Engineering, AI Infrastructure job vacancies 2025, Director of Site Reliability Engineering, AI Infrastructure job descriptions 2025, Director of Site Reliability Engineering, AI Infrastructure job listing 2025 Oracle job opportunities 2025, Oracle Jobs 2025, Oracle job opening 2025, Oracle job vacancies 2025, Oracle job descriptions 2025, Oracle job listing 2025 Santa Clara, CA, United States job opportunities 2025, Santa Clara, CA, United States Jobs 2025, Santa Clara, CA, United States job opening 2025, Santa Clara, CA, United States job vacancies 2025, Santa Clara, CA, United States job descriptions 2025, Santa Clara, CA, United States job listing 2025, Australia Postal Service Jobs 2025, Australia Postal Service job opportunities 2025, Australia Postal Service job opening 2025, Australia Postal Service job vacancies 2025, Australia Postal Service job descriptions 2025, Australia Postal Service job listing 2025
For more information please click the link below