What is Inference?

The process of using a trained model to process new inputs and produce outputs. Every time you chat with ChatGPT, that’s inference. Inference costs are typically per-token β€” input and output tokens priced differently. Optimizing inference costs is key to AI app profitability.