Update chat completion
POST https://api.fastapi.ai/v1/chat/completions/{completion_id}
Modify a stored chat completion. Only chat completions that have been created with the store parameter set to true can be modified. Currently, the only supported modification is to update the metadata field.
Path parameters
completion_id string Required
The ID of the chat completion to update.
Request body
metadata map string Required
Set of 16 key-value pairs that can be attached to an object. This can be useful for storing additional information about the object in a structured format, and querying for objects via API or the dashboard.
Keys are strings with a maximum length of 64 characters. Values are strings with a maximum length of 512 characters.
Returns
The ChatCompletion object matching the specified ID.
The chat completion object
Represents a chat completion response returned by model, based on the provided input.
id string
A unique identifier for the chat completion.
choices array
A list of chat completion choices. Can be more than one if
nis greater than 1.
finish_reasonstring
The reason the model stopped generating tokens. This will be stop if the model hit a natural stop point or a provided stop sequence,lengthif the maximum number of tokens specified in the request was reached,content_filterif content was omitted due to a flag from our content filters,tool_callsif the model called a tool, orfunction_call(deprecated) if the model called a function.
indexinteger
The index of the choice in the list of choices.
messageobject
A chat completion message generated by the model.
contentstring or null
The contents of the message.
refusalstring or null
The refusal message generated by the model.
tool_callsarray
The tool calls generated by the model, such as function calls.
idstring
The ID of the tool call.
typestring
The type of the tool. Currently, onlyfunctionis supported.
functionobject
The function that the model called.
namestring
The name of the function to call.
argumentsstring
The arguments to call the function with, as generated by the model in JSON format. Note that the model does not always generate valid JSON, and may hallucinate parameters not defined by your function schema. Validate the arguments in your code before calling your function.
rolestring
The role of the author of this message.
function_callDeprecated object
Deprecated and replaced bytool_calls. The name and arguments of a function that should be called, as generated by the model.
argumentsstring
The arguments to call the function with, as generated by the model in JSON format. Note that the model does not always generate valid JSON, and may hallucinate parameters not defined by your function schema. Validate the arguments in your code before calling your function.
namestring
The name of the function to call.
audioobject or null
If the audio output modality is requested, this object contains data about the audio response from the model. Learn more.
idstring
Unique identifier for this audio response.
expires_atinteger
The Unix timestamp (in seconds) for when this audio response will no longer be accessible on the server for use in multi-turn conversations.
datastring
Base64 encoded audio bytes generated by the model, in the format specified in the request.
transcriptstring
Transcript of the audio generated by the model.
idstring
The identifier of the chat message.
logprobsobject or null
Log probability information for the choice.
contentarray or null
A list of message content tokens with log probability information.
tokenstring
The token.
logprobnumber
The log probability of this token, if it is within the top 20 most likely tokens. Otherwise, the value-9999.0is used to signify that the token is very unlikely.
bytesarray or null
A list of integers representing the UTF-8 bytes representation of the token. Useful in instances where characters are represented by multiple tokens and their byte representations must be combined to generate the correct text representation. Can benullif there is no bytes representation for the token.
top_logprobsarray
List of the most likely tokens and their log probability, at this token position. In rare cases, there may be fewer than the number of requestedtop_logprobsreturned.
tokenstring
The token.
logprobnumber
The log probability of this token, if it is within the top 20 most likely tokens. Otherwise, the value-9999.0is used to signify that the token is very unlikely.
bytesarray or null
A list of integers representing the UTF-8 bytes representation of the token. Useful in instances where characters are represented by multiple tokens and their byte representations must be combined to generate the correct text representation. Can benullif there is no bytes representation for the token.
refusalarray or null
A list of message refusal tokens with log probability information.
tokenstring
The token.
logprobnumber
The log probability of this token, if it is within the top 20 most likely tokens. Otherwise, the value-9999.0is used to signify that the token is very unlikely.
bytesarray or null
A list of integers representing the UTF-8 bytes representation of the token. Useful in instances where characters are represented by multiple tokens and their byte representations must be combined to generate the correct text representation. Can benullif there is no bytes representation for the token.
top_logprobsarray
List of the most likely tokens and their log probability, at this token position. In rare cases, there may be fewer than the number of requestedtop_logprobsreturned.
tokenstring
The token.
logprobnumber
The log probability of this token, if it is within the top 20 most likely tokens. Otherwise, the value-9999.0is used to signify that the token is very unlikely.
bytesarray or null
A list of integers representing the UTF-8 bytes representation of the token. Useful in instances where characters are represented by multiple tokens and their byte representations must be combined to generate the correct text representation. Can benullif there is no bytes representation for the token.
created integer
The Unix timestamp (in seconds) of when the chat completion was created.
model string
The model used for the chat completion.
service_tier string or null
The service tier used for processing the request.
system_fingerprint string
This fingerprint represents the backend configuration that the model runs with.
Can be used in conjunction with the seed request parameter to understand when backend changes have been made that might impact determinism.
object string
The object type, which is always chat.completion.
usage object
Usage statistics for the completion request.
completion_tokensinteger
Number of tokens in the generated completion.
prompt_tokensinteger
Number of tokens in the prompt.
total_tokensinteger
Total number of tokens used in the request (prompt + completion).
completion_tokens_detailsobject
Breakdown of tokens used in a completion.
accepted_prediction_tokensinteger
When using Predicted Outputs, the number of tokens in the prediction that appeared in the completion.
audio_tokensinteger
Audio input tokens generated by the model.
reasoning_tokensinteger
Tokens generated by the model for reasoning.
rejected_prediction_tokensinteger
When using Predicted Outputs, the number of tokens in the prediction that did not appear in the completion. However, like reasoning tokens, these tokens are still counted in the total completion tokens for purposes of billing, output, and context window limits.
prompt_tokens_detailsobject
Breakdown of tokens used in the prompt.
audio_tokensinteger
Audio input tokens present in the prompt.
cached_tokensinteger
Cached tokens present in the prompt.
{
"id": "chatcmpl-123456",
"object": "chat.completion",
"created": 1728933352,
"model": "gpt-4o-2024-08-06",
"choices": [
{
"index": 0,
"message": {
"role": "assistant",
"content": "Hi there! How can I assist you today?",
"refusal": null
},
"logprobs": null,
"finish_reason": "stop"
}
],
"usage": {
"prompt_tokens": 19,
"completion_tokens": 10,
"total_tokens": 29,
"prompt_tokens_details": {
"cached_tokens": 0
},
"completion_tokens_details": {
"reasoning_tokens": 0,
"accepted_prediction_tokens": 0,
"rejected_prediction_tokens": 0
}
},
"system_fingerprint": "fp_6b68a8204b"
}Example
Request
curl -X POST https://api.fastapi.ai/v1/chat/completions/chat_abc123 \
-H "Authorization: Bearer $FAST_API_KEY" \
-H "Content-Type: application/json" \
-d '{"metadata": {"foo": "bar"}}'Response
{
"object": "chat.completion",
"id": "chatcmpl-AyPNinnUqUDYo9SAdA52NobMflmj2",
"model": "gpt-4o-2024-08-06",
"created": 1738960610,
"request_id": "req_ded8ab984ec4bf840f37566c1011c417",
"tool_choice": null,
"usage": {
"total_tokens": 31,
"completion_tokens": 18,
"prompt_tokens": 13
},
"seed": 4944116822809979520,
"top_p": 1.0,
"temperature": 1.0,
"presence_penalty": 0.0,
"frequency_penalty": 0.0,
"system_fingerprint": "fp_50cad350e4",
"input_user": null,
"service_tier": "default",
"tools": null,
"metadata": {
"foo": "bar"
},
"choices": [
{
"index": 0,
"message": {
"content": "Mind of circuits hum, \nLearning patterns in silence— \nFuture's quiet spark.",
"role": "assistant",
"tool_calls": null,
"function_call": null
},
"finish_reason": "stop",
"logprobs": null
}
],
"response_format": null
}