4/ The key technical idea is conformal risk control.
Instead of treating judge scores as perfectly reliable, CARE uses a small labeled calibration set to select thresholds that control the expected risk of missed errors.
The target risk level is chosen by the system builder.
