Maksym Andriushchenko and researchers release InferenceBench, a benchmark evaluating AI agents on open-ended optimization of OpenAI-compatible LLM servers using latency and throughput metrics on H100 GPUs · Digg