Don't Worry About the Vase PodcastAIs Will Increasingly Fake Alignment1×0:00Current time: 0:00 / Total time: -1:43:10-1:43:10Audio playback is not supported on your browser. Please upgrade.AIs Will Increasingly Fake AlignmentAskwho Casts AIDec 24, 2024ShareTranscriptPodcast episode for AIs Will Increasingly Fake Alignment.DWAtV Podcast is a listener-supported podcast. To receive new posts and support the cost of creation, consider becoming a free or paid subscriber.SubscribeDon't Worry About the VaseAIs Will Increasingly Fake AlignmentThis post goes over the important and excellent new paper from Anthropic and Redwood Research, with Ryan Greenblatt as lead author, Alignment Faking in Large Language Models…Read morea year ago · 8 likes · 1 comment · Zvi Mowshowitzhttps://open.substack.com/pub/thezvi/p/ais-will-increasingly-fake-alignment?r=67y1h&utm_campaign=post&utm_medium=web&showWelcomeOnShare=trueMusic: The Road Home by Alexander Nakarada (www.serpentsoundstudios.com)Licensed under Creative Commons BY Attribution 4.0 Licensehttp://creativecommons.org/licenses/by/4Discussion about this episodeCommentsRestacksDon't Worry About the Vase PodcastPodcast for https://thezvi.substack.com/Podcast for https://thezvi.substack.com/SubscribeListen onSubstack AppApple PodcastsSpotifyYouTubePocket CastsRSS FeedAppears in episodeAskwho Casts AIRecent EpisodesOpus 4.7 Part 3: Model Welfare2 hrs ago • Askwho Casts AIOpus 4.7 Part 2: Capabilities and ReactionsApr 21 • Askwho Casts AIOpus 4.7 Part 1: The Model CardApr 20 • Askwho Casts AIAI #164: Pre OpusApr 17 • Askwho Casts AIOn Dwarkesh Patel's Podcast With Nvidia CEO Jensen HuangApr 16 • Askwho Casts AIClaude Code, Codex and Agentic Coding #7: Auto ModeApr 15 • Askwho Casts AIClaude Mythos #3: Capabilities and AdditionsApr 14 • Askwho Casts AIPolitical Violence Is Never AcceptableApr 13 • Askwho Casts AI