Artwork

Indhold leveret af Conviction. Alt podcastindhold inklusive episoder, grafik og podcastbeskrivelser uploades og leveres direkte af Conviction eller deres podcastplatformspartner. Hvis du mener, at nogen bruger dit ophavsretligt beskyttede værk uden din tilladelse, kan du følge processen beskrevet her https://da.player.fm/legal.
Player FM - Podcast-app
Gå offline med appen Player FM !

Speed will win the AI computing battle with Tuhin Srivastava from Baseten

38:32
 
Del
 

Manage episode 408090527 series 3444082
Indhold leveret af Conviction. Alt podcastindhold inklusive episoder, grafik og podcastbeskrivelser uploades og leveres direkte af Conviction eller deres podcastplatformspartner. Hvis du mener, at nogen bruger dit ophavsretligt beskyttede værk uden din tilladelse, kan du følge processen beskrevet her https://da.player.fm/legal.

At a time when users are being asked to wait unthinkable seconds for AI products to generate art and answers, speed is what will win the battle heating up in AI computing. At least according to today’s guest, Tuhin Srivastava, the CEO and co-founder of Baseten which gives customers scalable AI infrastructures starting with interference. In this episode of No Priors, Sarah, Elad, and Tuhin discuss why efficient code solutions are more desirable than no code, the most surprising use cases for Baseten, and why all of their jobs are very defensible from AI.

Show Links:

Sign up for new podcasts every week. Email feedback to show@no-priors.com

Follow us on Twitter: @NoPriorsPod | @Saranormous | @EladGil | @tuhinone

Show Notes:

(0:00) Introduction

(1:19) Capabilities of efficient code enabled development

(4:11) Difference in training inference workloads

(6:12) AI product acceleration

(8:48) Leading on inference benchmarks at Baseten

(12:08) Optimizations for different types of models

(16:11) Internal vs open source models

(19:01) timeline for enterprise scale

(21:53) Rethinking investment in compute spend

(27:50) Defensibility in AI industries

(31:30) Hardware and the chip shortage

(35:47) Speed is the way to win in this industry

(38:26) Wrap

  continue reading

99 episoder

Artwork
iconDel
 
Manage episode 408090527 series 3444082
Indhold leveret af Conviction. Alt podcastindhold inklusive episoder, grafik og podcastbeskrivelser uploades og leveres direkte af Conviction eller deres podcastplatformspartner. Hvis du mener, at nogen bruger dit ophavsretligt beskyttede værk uden din tilladelse, kan du følge processen beskrevet her https://da.player.fm/legal.

At a time when users are being asked to wait unthinkable seconds for AI products to generate art and answers, speed is what will win the battle heating up in AI computing. At least according to today’s guest, Tuhin Srivastava, the CEO and co-founder of Baseten which gives customers scalable AI infrastructures starting with interference. In this episode of No Priors, Sarah, Elad, and Tuhin discuss why efficient code solutions are more desirable than no code, the most surprising use cases for Baseten, and why all of their jobs are very defensible from AI.

Show Links:

Sign up for new podcasts every week. Email feedback to show@no-priors.com

Follow us on Twitter: @NoPriorsPod | @Saranormous | @EladGil | @tuhinone

Show Notes:

(0:00) Introduction

(1:19) Capabilities of efficient code enabled development

(4:11) Difference in training inference workloads

(6:12) AI product acceleration

(8:48) Leading on inference benchmarks at Baseten

(12:08) Optimizations for different types of models

(16:11) Internal vs open source models

(19:01) timeline for enterprise scale

(21:53) Rethinking investment in compute spend

(27:50) Defensibility in AI industries

(31:30) Hardware and the chip shortage

(35:47) Speed is the way to win in this industry

(38:26) Wrap

  continue reading

99 episoder

Alle episoder

×
 
Loading …

Velkommen til Player FM!

Player FM is scanning the web for high-quality podcasts for you to enjoy right now. It's the best podcast app and works on Android, iPhone, and the web. Signup to sync subscriptions across devices.

 

Hurtig referencevejledning

Lyt til dette show, mens du udforsker
Afspil