mirror of
https://github.com/wavetermdev/waveterm.git
synced 2025-11-28 05:00:26 +08:00
4.7 KiB
4.7 KiB
OpenAI Request Input Field Structure (On-the-Wire Format)
This document describes the actual JSON structure sent to the OpenAI API in the input field of OpenAIRequest.
Overview
The input field is a JSON array containing one of three object types:
- Messages (user/assistant) -
OpenAIMessageobjects - Function Calls (tool invocations) -
OpenAIFunctionCallInputobjects - Function Call Results (tool outputs) -
OpenAIFunctionCallOutputInputobjects
These are converted from OpenAIChatMessage internal format and cleaned before transmission (see lines 485-494).
1. Message Objects (User/Assistant)
User and assistant messages sent as OpenAIMessage:
{
"role": "user",
"content": [
{
"type": "input_text",
"text": "Hello, analyze this image"
},
{
"type": "input_image",
"image_url": "..."
}
]
}
Key Points:
role: Always"user"or"assistant"content: Always an array of content blocks (never a plain string)
Content Block Types
Text Block
{
"type": "input_text",
"text": "message content here"
}
Image Block
{
"type": "input_image",
"image_url": "data:image/png;base64,..."
}
- Can be a data URL or https:// URL
filenamefield is removed during cleaning
PDF File Block
{
"type": "input_file",
"file_data": "JVBERi0xLjQKJeLjz9M...",
"filename": "document.pdf"
}
file_data: Base64-encoded PDF content
Function Call Block (in assistant messages)
{
"type": "function_call",
"call_id": "call_abc123",
"name": "search_files",
"arguments": {"query": "test"}
}
2. Function Call Objects (Tool Invocations)
Tool calls from the model sent as OpenAIFunctionCallInput:
{
"type": "function_call",
"call_id": "call_abc123",
"name": "search_files",
"arguments": "{\"query\":\"test\",\"path\":\"./src\"}"
}
Key Points:
type: Always"function_call"call_id: Unique identifier generated by modelname: Function name to executearguments: JSON-encoded string of parametersstatus: Optional ("in_progress","completed","incomplete")- Internal
toolusedatafield is removed during cleaning
3. Function Call Output Objects (Tool Results)
Tool execution results sent as OpenAIFunctionCallOutputInput:
{
"type": "function_call_output",
"call_id": "call_abc123",
"output": "Found 3 files matching query"
}
Key Points:
type: Always"function_call_output"call_id: Must match the original function call'scall_idoutput: Can be text, image array, or error object
Output Value Types
Text Output
{
"type": "function_call_output",
"call_id": "call_abc123",
"output": "Result text here"
}
Image Output
{
"type": "function_call_output",
"call_id": "call_abc123",
"output": [
{
"type": "input_image",
"image_url": "data:image/png;base64,..."
}
]
}
Error Output
{
"type": "function_call_output",
"call_id": "call_abc123",
"output": "{\"ok\":\"false\",\"error\":\"File not found\"}"
}
- Error output is a JSON-encoded string containing
okanderrorfields
Complete Example
{
"model": "gpt-4o",
"input": [
{
"role": "user",
"content": [
{
"type": "input_text",
"text": "What files are in src/?"
}
]
},
{
"type": "function_call",
"call_id": "call_xyz789",
"name": "list_files",
"arguments": "{\"path\":\"src/\"}"
},
{
"type": "function_call_output",
"call_id": "call_xyz789",
"output": "main.go\nutil.go\nconfig.go"
},
{
"role": "assistant",
"content": [
{
"type": "output_text",
"text": "The src/ directory contains 3 files: main.go, util.go, and config.go"
}
]
}
],
"stream": true,
"max_output_tokens": 4096
}
Cleaning Process
Before transmission, internal fields are removed (cleanup code):
- Messages:
previewurlfield removed,filenameremoved frominput_imageblocks - Function Calls:
toolusedatafield removed - Function Outputs: Sent as-is (no cleaning needed)
This ensures the API receives only the fields it expects.