If the agent should ask before acting, test it.
If it should escalate under uncertainty, test it.
If it should stop after repeated failure, test it.
drcarmenmartinez.substa…