Maksym Andriushchenko of the ELLIS Institute Tübingen releases PostTrainBench to evaluate autonomous AI agents on post-training models · Digg