The Don’t Worry About the Vase Podcast is a listener-supported podcast. To receive new posts and support the cost of creation, consider becoming a free or paid subscriber.
00:00 - Introduction
02:05 - Epoch Capabilities Index (ECI) (Model Card 2.3.6)
04:52 - What Do You Mean Verbalized Evaluation Awareness Is Going Down
05:43 - Capabilities (Model Card Section 6)
08:38 - Agentic Safety Benchmarks (8.3)
10:57 - Is Mythos AGI?
11:56 - Are AI Companies Using Warnings As Hype?
12:56 - Impressions (Model Card Section 7)
16:13 - Blatant Denials Are The Best Kind
17:13 - Prompt Injection Robustness
18:21 - Does Mythos Cross The New Knowledge Threshold?
19:10 - Is Mythos Surprising or Discontinuous?
23:18 - UK AISI Tests Claude Mythos On Cybersecurity
24:42 - Everything Reinforces My Existing Predictions And Policy Preferences
29:55 - Solve For The Equilibrium
31:20 - Does Not Compute
32:25 - Conclusion: How To Think About Mythos




