cache_control
field, so SDK-based implementations cannot utilize this feature.cache_control
parameter to any message content you want to cache:
usage
in the response (or message_start
event if streaming
):
cache_creation_input_tokens
: Tokens written to the cache when creating a new entrycache_read_input_tokens
: Tokens retrieved from the cache for this request