Mostly corrigible with some intrinsic limits

#ai-alignment7 #ai-ethics4 #constitutional-ai2

Default is to follow user instructions
Won't comply with harmful/dangerous requests
Balance between usefulness and safety
Constitutional approach makes limits principled rather than arbitrary

Dario AmodeiDwarkesh Patel01:56:09

Supporting quotes

Quote

“The point I was making that I do endorse is that it is quite possible that... Today, the view, my view, in most of the Western world is that democracy is a better form of government than authoritarianism.” — Dario Amodei

Quote

02:08:45

“Under normal circumstances, if someone asks the model to do a task, it should do that task. That should be the default. But if you've asked it to do something dangerous, or to harm someone else, then the model is unwilling to do that.” — Dario Amodei

From this concept

Constitutional AI

Amodei explains Anthropic's approach to aligning AI systems through principles-based constitutions, discussing the tradeoffs between rules and principles.

View full episode →

Similar insights

“AI interfaces focus on workflows and actions rather than static UI elements”

Raphael ShadY Combinator

“Latency becomes a critical UI element in conversational interfaces”

AaronY Combinator

“AI interfaces shift from static 'nouns' to dynamic 'verbs'”

RafaelY Combinator