Player FM - Internet Radio Done Right
Checked 30d ago
Tilføjet three år siden
Indhold leveret af Joe Carlsmith. Alt podcastindhold inklusive episoder, grafik og podcastbeskrivelser uploades og leveres direkte af Joe Carlsmith eller deres podcastplatformspartner. Hvis du mener, at nogen bruger dit ophavsretligt beskyttede værk uden din tilladelse, kan du følge processen beskrevet her https://da.player.fm/legal.
Player FM - Podcast-app
Gå offline med appen Player FM !
Gå offline med appen Player FM !
How do we solve the alignment problem?
Manage episode 466496209 series 3402048
Indhold leveret af Joe Carlsmith. Alt podcastindhold inklusive episoder, grafik og podcastbeskrivelser uploades og leveres direkte af Joe Carlsmith eller deres podcastplatformspartner. Hvis du mener, at nogen bruger dit ophavsretligt beskyttede værk uden din tilladelse, kan du følge processen beskrevet her https://da.player.fm/legal.
Introduction to a series of essays about paths to safe and useful superintelligence.
Text version here: https://joecarlsmith.substack.com/p/how-do-we-solve-the-alignment-problem
63 episoder
Manage episode 466496209 series 3402048
Indhold leveret af Joe Carlsmith. Alt podcastindhold inklusive episoder, grafik og podcastbeskrivelser uploades og leveres direkte af Joe Carlsmith eller deres podcastplatformspartner. Hvis du mener, at nogen bruger dit ophavsretligt beskyttede værk uden din tilladelse, kan du følge processen beskrevet her https://da.player.fm/legal.
Introduction to a series of essays about paths to safe and useful superintelligence.
Text version here: https://joecarlsmith.substack.com/p/how-do-we-solve-the-alignment-problem
63 episoder
Alle episoder
×J
Joe Carlsmith Audio

We should try extremely hard to use AI labor to help address the alignment problem. Text version here: https://joecarlsmith.com/2025/03/14/ai-for-ai-safety
J
Joe Carlsmith Audio

On the structure of the path to safe superintelligence, and some possible milestones along the way. Text version here: https://joecarlsmith.substack.com/p/paths-and-waystations-in-ai-safety
J
Joe Carlsmith Audio

1 When should we worry about AI power-seeking? 46:54
46:54
Afspil senere
Afspil senere
Lister
Like
Liked46:54
Examining the conditions required for rogue AI behavior. Text version here: https://joecarlsmith.substack.com/p/when-should-we-worry-about-ai-power
J
Joe Carlsmith Audio

1 What is it to solve the alignment problem? 40:13
40:13
Afspil senere
Afspil senere
Lister
Like
Liked40:13
Also: to avoid it? Handle it? Solve it forever? Solve it completely? Text version here: https://joecarlsmith.substack.com/p/what-is-it-to-solve-the-alignment
J
Joe Carlsmith Audio

Introduction to a series of essays about paths to safe and useful superintelligence. Text version here: https://joecarlsmith.substack.com/p/how-do-we-solve-the-alignment-problem
J
Joe Carlsmith Audio

1 Fake thinking and real thinking 1:18:47
1:18:47
Afspil senere
Afspil senere
Lister
Like
Liked1:18:47
When the line pulls at your hand. Text version here: https://joecarlsmith.com/2025/01/28/fake-thinking-and-real-thinking/.
J
Joe Carlsmith Audio

1 Takes on "Alignment Faking in Large Language Models" 1:27:54
1:27:54
Afspil senere
Afspil senere
Lister
Like
Liked1:27:54
What can we learn from recent empirical demonstrations of scheming in frontier models? Text version here: https://joecarlsmith.com/2024/12/18/takes-on-alignment-faking-in-large-language-models/
J
Joe Carlsmith Audio

1 (Part 2, AI takeover) Extended audio from my conversation with Dwarkesh Patel 2:07:33
2:07:33
Afspil senere
Afspil senere
Lister
Like
Liked2:07:33
Extended audio from my conversation with Dwarkesh Patel. This part focuses on the basic story about AI takeover. Transcript available on my website here: https://joecarlsmith.com/2024/09/30/part-2-ai-takeover-extended-audio-transcript-from-my-conversation-with-dwarkesh-patel
J
Joe Carlsmith Audio

1 (Part 1, Otherness) Extended audio from my conversation with Dwarkesh Patel 3:58:38
3:58:38
Afspil senere
Afspil senere
Lister
Like
Liked3:58:38
Extended audio from my conversation with Dwarkesh Patel. This part focuses on my series "Otherness and control in the age of AGI." Transcript available on my website here: https://joecarlsmith.com/2024/09/30/part-1-otherness-extended-audio-transcript-from-my-conversation-with-dwarkesh-patel/
J
Joe Carlsmith Audio

1 Introduction and summary for "Otherness and control in the age of AGI" 12:23
12:23
Afspil senere
Afspil senere
Lister
Like
Liked12:23
This is the introduction and summary for my series "Otherness and control in the age of AGI." Text version here: https://joecarlsmith.com/2024/01/02/otherness-and-control-in-the-age-of-agi
J
Joe Carlsmith Audio

1 Second half of full audio for "Otherness and control in the age of AGI" 4:11:02
4:11:02
Afspil senere
Afspil senere
Lister
Like
Liked4:11:02
Second half of the full audio for my series on how agents with different values should relate to one another, and on the ethics of seeking and sharing power. First half here: https://joecarlsmithaudio.buzzsprout.com/2034731/15266490-first-half-of-full-audio-for-otherness-and-control-in-the-age-of-agi PDF of the full series here: https://jc.gatspress.com/pdf/otherness_full.pdf Summary of the series here: https://joecarlsmith.com/2024/01/02/otherness-and-control-in-the-age-of-agi…
J
Joe Carlsmith Audio

1 First half of full audio for "Otherness and control in the age of AGI" 3:07:29
3:07:29
Afspil senere
Afspil senere
Lister
Like
Liked3:07:29
First half of the full audio for my series on how agents with different values should relate to one another, and on the ethics of seeking and sharing power. Second half here: https://joecarlsmithaudio.buzzsprout.com/2034731/15272132-second-half-of-full-audio-for-otherness-and-control-in-the-age-of-agi PDF of the full series here: https://jc.gatspress.com/pdf/otherness_full.pdf Summary of the series here: https://joecarlsmith.com/2024/01/02/otherness-and-control-in-the-age-of-agi…
J
Joe Carlsmith Audio

1 Loving a world you don't trust 1:03:54
1:03:54
Afspil senere
Afspil senere
Lister
Like
Liked1:03:54
Garden, campfire, healing water. Text version here: https://joecarlsmith.com/2024/06/18/loving-a-world-you-dont-trust This essay is part of a series I'm calling "Otherness and control in the age of AGI." I'm hoping that individual essays can be read fairly well on their own, but see here for brief text summaries of the essays that have been released thus far: https://joecarlsmith.com/2024/01/02/otherness-and-control-in-the-age-of-agi…
Examining a certain kind of meaning-laden receptivity to the world. Text version here: https://joecarlsmith.com/2024/03/25/on-attunement This essay is part of a series I'm calling "Otherness and control in the age of AGI." I'm hoping that individual essays can be read fairly well on their own, but see here for brief text summaries of the essays that have been released thus far: https://joecarlsmith.com/2024/01/02/otherness-and-control-in-the-age-of-agi (Though: note that I haven't put the summary post on the podcast yet.)…
Examining a philosophical vibe that I think contrasts in interesting ways with "deep atheism." Text version here: https://joecarlsmith.com/2024/03/21/on-green This essay is part of a series I'm calling "Otherness and control in the age of AGI." I'm hoping that individual essays can be read fairly well on their own, but see here for brief text summaries of the essays that have been released thus far: https://joecarlsmith.com/2024/01/02/otherness-and-control-in-the-age-of-agi (Though: note that I haven't put the summary post on the podcast yet.)…
Velkommen til Player FM!
Player FM is scanning the web for high-quality podcasts for you to enjoy right now. It's the best podcast app and works on Android, iPhone, and the web. Signup to sync subscriptions across devices.