Performance Optimization

Get the most out of our APIs with these tips.

Batch Requests

Cut latency by sending multiple prompts at once:

data = {"prompts": ["Tell me a story.", "Write a poem."]}
response = requests.post(url, json=data, headers=headers)

Save time and credits by caching responses: