DWAtV Podcast
Don't Worry About the Vase Podcast
AIs Will Increasingly Fake Alignment
0:00
-1:43:10

AIs Will Increasingly Fake Alignment

Podcast episode for AIs Will Increasingly Fake Alignment.

DWAtV Podcast is a listener-supported podcast. To receive new posts and support the cost of creation, consider becoming a free or paid subscriber.

Don't Worry About the Vase
AIs Will Increasingly Fake Alignment
This post goes over the important and excellent new paper from Anthropic and Redwood Research, with Ryan Greenblatt as lead author, Alignment Faking in Large Language Models…
Read more

https://open.substack.com/pub/thezvi/p/ais-will-increasingly-fake-alignment?r=67y1h&utm_campaign=post&utm_medium=web&showWelcomeOnShare=true

Music: The Road Home by Alexander Nakarada (www.serpentsoundstudios.com)

Licensed under Creative Commons BY Attribution 4.0 License

http://creativecommons.org/licenses/by/4

Discussion about this episode

User's avatar

Ready for more?