Today we're bringing Mercury 2 to @Baseten.
Mercury 2 delivers over 1,000 tokens per second for customers on @NVIDIA GPUs with the reliability and scale enterprise teams need.
Read more to see how @augmentcode is using Mercury 2 in production reducing costs by 90% and latency by 82%. More customer stories across coding agents, real-time voice, and enterprise search dropping soon.
http://x.com/i/article/2065085903345754113



