Joint work with @prpaskov @S_OhEigeartaigh @NandoMartinezP @katie_m_collins @FazlBarez, Jonathan Prunty, Matteo Mecattaf, @zfountas @RistoUuk @sanmikoyejo @CUdudec, José Hernández-Orallo
Paper website→https://cl-eval.github.io/
Pointers to related work & questions welcome🙏
Pre-deployment trajectory sandbox + live predictive monitoring is a feasible alternative to continuously re-evaluating evolving systems.
They are effectively layered with input/output filters, transparent evolution methods, and broad indicators of CL systems' impacts on society.
