🚀Promptflow 1.13.0 has released! Try new feature: tracing interaction with LLMs.

OpenAI GPT-4V#

Introduction#

OpenAI GPT-4V tool enables you to leverage OpenAI’s GPT-4 with vision, also referred to as GPT-4V or gpt-4-vision-preview in the API, to take images as input and answer questions about them.

Prerequisites#

Create OpenAI resources

Sign up account OpenAI website Login and Find personal API key
Get Access to GPT-4 API

To use GPT-4 with vision, you need access to GPT-4 API. Learn more about How to get access to GPT-4 API

Connection#

Setup connections to provisioned resources in prompt flow.

Type	Name	API KEY
OpenAI	Required	Required

Inputs#

Name	Type	Description	Required
connection	OpenAI	the OpenAI connection to be used in the tool	Yes
model	string	the language model to use, currently only support gpt-4-vision-preview	Yes
prompt	string	The text prompt that the language model will use to generate it’s response.	Yes
max_tokens	integer	the maximum number of tokens to generate in the response. Default is 512.	No
temperature	float	the randomness of the generated text. Default is 1.	No
stop	list	the stopping sequence for the generated text. Default is null.	No
top_p	float	the probability of using the top choice from the generated tokens. Default is 1.	No
presence_penalty	float	value that controls the model’s behavior with regards to repeating phrases. Default is 0.	No
frequency_penalty	float	value that controls the model’s behavior with regards to generating rare phrases. Default is 0.	No
detail	string	control over how the model processes the image and generates its textual understanding, default is “auto”. Read more	No

Outputs#

Return Type	Description
string	The text of one response of conversation