Deep research should show why answers can be trusted.
Apodex 1.0 sets a new SOTA across deep-research benchmarks, but the full post is worth reading for the ideas behind it: agent team and verification.
Proud of the team.
http://x.com/i/article/2065001558316867584