been thinking about why I don't feel comfortable with AI being a true personal assistant still, even though the models have gotten so much better. i realized a lot of the reason for me is actually git / version control.
using an agent that is 99.999999% accurate vs 97% accurate doesn't really matter to me when the errors are high impact. with code, i can yolo a PR & know that its will not be destructive to my existing codebase. there's no similar staging mechanism for my calendar, shopping cart, etc.
in theory you can create calendar event drafts, but very few of the other useful activities (esp spending money) have a draft / easy review paradigm. the solves here will most likely be on the infra / scaffolding level. once you start making a universal shopping cart draft, you're basically inventing a new web browser, or an agent-first wallet / credit card.
