• Skip to primary navigation
  • Skip to main content
  • Skip to primary sidebar
UT Shield
The University of Texas at Austin
  • Home
  • Schedule
    • Current Semester
    • Past Semesters
  • Information

October 8, 2022, Filed Under: 2022 Fall Seminar, Seminars

Accelerating the Pace of AWS Inferentia Chip Development, From Concept to End Customer Use

Speaker: Randy Huang, Amazon (AWS)

Date: October 18, 2022 at 3:30pm

Location: EER 3.646

Abstract: In this talk, I will detail the process and the decisions we have made to bring AWS Inferentia from a one-page press release to general availability. Our process starts with working backward from the customers and how we could bring real benefits to customers’ use cases. We will show that by separating out 1-way vs. 2-way door decisions, we can navigate technical and strategic decisions at AWS velocity and bring a deep-learning accelerator to the marketplace quickly.

Bio: Randy is a principal engineer of Inferentia and Trainium, custom chips designed by AWS to enable highly cost-effective low latency inference and training performance at any scale. Prior to joining AWS, he led the architecture group at Tabula, designing and building three dimensional field programmable gate arrays (3-D FPGAs). Randy received his Ph.D. from University of California, Berkeley.

Primary Sidebar

Current Semester

[Series 01] FEATHER: A Reconfigurable Accelerator with Data Reordering Support for Low-Cost On-Chip Dataflow Switching

Past Semesters

Welcome to CompArch 2024 Fall

2024 Spring

2022 Fall

2020 Spring

2019 Spring

2019 Fall

2018 Fall

2017 Spring

2017 Fall

2016 Spring

Prior Semesters

UT Home | Emergency Information | Site Policies | Web Accessibility | Web Privacy | Adobe Reader

© The University of Texas at Austin 2025