VIEWS9

Rugbist@rugbist_
@elicitorg curious about the actual evaluation framework here
benchmarking is one thing but measuring autonomous decisions is whole other game
1dViews 9

@elicitorg curious about the actual evaluation framework here
benchmarking is one thing but measuring autonomous decisions is whole other game

@elicitorg the real edge wont be building the ai, itll be knowing when to trust the output

@elicitorg that's such a massive bottleneck to solve for.