DWAtV Podcast
Don't Worry About the Vase Podcast
Claude Mythos #3: Capabilities and Additions
0:00
-36:11

Claude Mythos #3: Capabilities and Additions

The Don’t Worry About the Vase Podcast is a listener-supported podcast. To receive new posts and support the cost of creation, consider becoming a free or paid subscriber.

  • 00:00 - Introduction

  • 02:05 - Epoch Capabilities Index (ECI) (Model Card 2.3.6)

  • 04:52 - What Do You Mean Verbalized Evaluation Awareness Is Going Down

  • 05:43 - Capabilities (Model Card Section 6)

  • 08:38 - Agentic Safety Benchmarks (8.3)

  • 10:57 - Is Mythos AGI?

  • 11:56 - Are AI Companies Using Warnings As Hype?

  • 12:56 - Impressions (Model Card Section 7)

  • 16:13 - Blatant Denials Are The Best Kind

  • 17:13 - Prompt Injection Robustness

  • 18:21 - Does Mythos Cross The New Knowledge Threshold?

  • 19:10 - Is Mythos Surprising or Discontinuous?

  • 23:18 - UK AISI Tests Claude Mythos On Cybersecurity

  • 24:42 - Everything Reinforces My Existing Predictions And Policy Preferences

  • 29:55 - Solve For The Equilibrium

  • 31:20 - Does Not Compute

  • 32:25 - Conclusion: How To Think About Mythos

Don't Worry About the Vase
Claude Mythos #3: Capabilities and Additions
To round out coverage of Mythos, today covers capabilities other than cyber, and anything else additional not covered by the first two posts, including new reactions and details…
Read more

https://open.substack.com/pub/thezvi/p/claude-mythos-3-capabilities-and?utm_campaign=post-expanded-share&utm_medium=web

Discussion about this episode

User's avatar

Ready for more?