Engineering Director: Developer GPU Platform (San Jose) Job at AMD, San Jose, CA

QmQ4cnZnQzQwTVJMdkhLVTlScTFIem5oRlE9PQ==
  • AMD
  • San Jose, CA

Job Description

Overview

This range is provided by AMD. Your actual pay will be based on your skills and experience talk with your recruiter to learn more.

WHAT YOU DO AT AMD CHANGES EVERYTHING At AMD, our mission is to build great products that accelerate next-generation computing experiencesfrom AI and data centers, to PCs, gaming and embedded systems. Grounded in a culture of innovation and collaboration, we believe real progress comes from bold ideas, human ingenuity and a shared passion to create something extraordinary. When you join AMD, youll discover the real differentiator is our culture. We push the limits of innovation to solve the worlds most important challengesstriving for execution excellence, while being direct, humble, collaborative, and inclusive of diverse perspectives. Join us as we shape the future of AI and beyond.

Together, we advance your career.

The Team

Our team, AIG SHARK, aims to build AI software solutions that are unified, performant, flexible, and customizable while also being the catalyst of AI software development across AMD. Open source is at the heart of everything we do and is critical to our success. We believe in the power of the community and together we can build a great stack that benefits everyone. We embrace a fast-paced, agile approach, leveraging modern development practices to deliver value quickly. Being a part of AMD and AIG, it also means we have a wide portfolio of hardware and customers to support realized product excellence and business impacts.

The Role

We are looking for a hands-on Engineering Director to help drive the future of how we manage GPUs within AMD. We are building a self-service GPU platform and while we have many of the lego pieces needed to bootstrap core aspects of it, foundational components need to be built from the ground up and all of them need to be integrated into a holistic platform. We have secured funding to help build it out we just need someone to help drive the ship. Note, this is an internal developer platform and a core part of this job is not only building the platform but working with tenants to help scale their workflows to be scale-ready.

The Impact You Will Have

  • Shift capacity management of our fleet from assigning teams to machines through a manual process to a streamlined self-service model leveraging virtual currency across organizations.
  • Drive up the effective utilization of our overall fleet of GPU machines to accelerate software innovation within AMD.
  • Direct partner strategy on lego pieces to incorporate buy vs build decisions over time.
  • Grow developers within AMD to shift to cloud-native workflows.
  • Grow leaders to own major components of a distributed platform stack.

The Person

The ideal candidate will have familiarity with building internal developer platforms for GPUs and expertise in fleet and capacity management within a large enterprise. They will not only rely on their leaders but also be capable of being hands-on with the design of the systems they manage.

Key Responsibilities

  • Architect the overall platform and identify key integration points where we can integrate technologies of the larger ecosystem vs. build our own.
  • Build out small- to medium-sized teams to close missing gaps needed to build the overall platform being hands-on as needed to bootstrap those efforts.
  • Define and track key performance metrics for fleet utilization, availability and reliability.
  • Work closely with developers to understand their pain points and challenges with development at scale.

Required Qualifications

  • 10+ years of experience in software engineering
  • 5+ years of software engineering management
  • Expertise in building internal development platforms
  • Expertise in Go/Python

Preferred Qualifications

  • Expertise with the challenges of molding developer workflows to be cloud-native
  • Expertise with GPU developer platforms

Benefits offered are described: AMD benefits at a glance.

AMD does not accept unsolicited resumes from headhunters, recruitment agencies, or fee-based recruitment services. AMD and its subsidiaries are equal opportunity, inclusive employers and will consider all applicants without regard to age, ancestry, color, marital status, medical condition, mental or physical disability, national origin, race, religion, political and/or third-party affiliation, sex, pregnancy, sexual orientation, gender identity, military or veteran status, or any other characteristic protected by law. We encourage applications from all qualified candidates and will accommodate applicants needs under the respective laws throughout all stages of the recruitment and selection process.

#J-18808-Ljbffr

Job Tags

Full time, Flexible hours, Shift work,

Similar Jobs

Infinity Staffing Professionals

Electrical Controls Engineer Job at Infinity Staffing Professionals

 ...Job Description Job Description Electrical Controls Engineer Responsibilities The Electrical Controls Engineer is responsible...  ...Legally authorized to work in the United States (no visa sponsorship available) Preferred Qualifications Bachelors degree in... 

Goodwill of North Georgia

Remote Accounts Payable & AR Specialist Job at Goodwill of North Georgia

A nonprofit organization is seeking an Accounts Payable Specialist in Decatur, Georgia. This role involves processing invoices, maintaining...  ...skills. The position may require work in an office environment or remotely, offering flexibility in work conditions.#J-18808-Ljbffr... 

Clean Harbors

Emergency Response Class A CDL Equipment Operator Job at Clean Harbors

 ...Clean Harbors in Kaukana, WI is seeking an Emergency Response Class A CDL Equipment Operator to operate a variety of heavy and light duty trucks/work equipment at our customer sites; some of the vehicles operated include vacuum trucks, Cuscos, guzzlers, and roll offs... 

Raytheon

RF Hardware Design & Test Engineer Job at Raytheon

 ...RF Hardware Design & Test Engineer at Raytheon summary: The RF Hardware Design & Test Engineer designs, analyzes, and tests RF and microwave components, circuits, and subsystems, supporting the development of radar receiver and signal processing technologies. Responsibilities... 

Ardán, Inc., A Community of Companies

Title Curative Specialist - Grid151 Job at Ardán, Inc., A Community of Companies

 ...POSITION SUMMARY: This role requires an experienced Title Curative Specialist relying on their understanding of title underwriting standards and title insurer underwriting guidelines to determine condition of title, and insurability while considering risk and liability...