LLM APIs need to return cost information in their response alongside tokens
literally everyone is using models[dot]dev data to approximate this - we see so many reqs to its api
but this is just sticker pricing, won't reflect discounts, etc so it doens't really work