The Philosopher Teaching AI to Be Good

Episode 222 · Feb 14, 08:30 AM

AI company Anthropic has a new, values-oriented “constitution” that they’re feeding their chatbot, Claude. Amanda Askell, the company’s in-house philosopher, joins Offline to talk about what it means to teach ethics to an LLM, whether the AI skews more human or more robot, and how she is training Claude to make its own judgements. Breaking with other AI models—and social media’s attention obsession—Amanda is trying to teach Claude not to be sycophantic or engagement-driven, but a kind soul who may, one day, be considered sentient.

For a closed-captioned version of this episode, click here. For a transcript of this episode, please email transcripts@crooked.com and include the name of the podcast.

The Philosopher Teaching AI to Be Good

Share

Subscribe

Share

Subscribe

Next

Zuckerberg Takes the Stand, Pete Hegseth vs. AI, and Max-Maxxing with Max Fisher

Top episodes

Healing Our Broken Brains

Trump's Memeification of War

Adam Friedland Just Wants to Understand

Sorry, your browser isn't supported.

Page load failed