Our first commercial TTS model was optimized for WER and SSIM because that’s what research had taught us over years to be the standard metrics. The first customer feedbacks we had unveiled the huge blind spots of these metrics, in particular on naturalness, rhythm, emphasis, question intonation, etc. Now our internal eval has dozens of criteria monitored on each model.
Original post
Matt Turck#1497
Neil Zeghidour@neilzegh
9:56 AM · Jun 6, 2026 · 6.2K Views
Sentiment
Sentiment building, check back later.
Cluster Engagement

