< Previous Challenge - Home - Next Challenge >
In this challenge, you will learn about the different capabilities of OpenAI models and learn how to choose the best model for your use case.
There are a lot of different models available in the Azure AI Model Catalog. These include models from OpenAI and other open source large language models from Meta, Hugging Face, and more. You are going to explore various LLMs and compare gpt3.5 to gpt4 model in this challenge.
In a world where the availability and development of models are always changing, the models we compare may change over time. But we encourage you to understand the general concepts and material in this Challenge because the comparison techniques utilized can be applicable to scenarios where you are comparing Large and/or Small Language Models. For more information on legacy models and additional models, reference the documentation and Azure model catalog for more details.
Questions you should be able to answer by the end of this challenge:
You will work in the Azure AI Foundry for this challenge. We recommend keeping the student guide and the Azure AI Foundry in two windows side by side as you work. This will also help to validate you have met the success criteria below for this challenge.
This challenge is divided into the following sections:
Scenario: You are part of a research team working on getting information from biotech news articles. Your goal is to explore the Model Catalog and identify some suitable models for accurate question answering. There is no right or wrong answer here.
HINT: Take a look at the model cards for each model by clicking into them. Evaluate the models based on their capabilities, limitations, and fit for the use case. Which models seem to be good options for question answering?
TIP The scenario will go into the system prompt. Click on the button “Show parameters setting” next to the trash can once your model has been selected.
Scenario: You are a product manager at a multinational tech company, and your team is developing an advanced AI-powered virtual assistant to provide real-time customer support. The company is deciding between GPT-3.5 Turbo and GPT-4 to power the virtual assistant. Your task is to evaluate both models to determine which one best meets the company’s needs for handling diverse customer inquiries efficiently and effectively.
Compare the models’ abilities to navigate complex customer complaints and provide satisfactory solutions.
Assess the models’ capabilities in technical writing, such as creating detailed product manuals or help articles.
Provide both models with extensive customer feedback or product reviews and ask them to summarize the key points.
We have provided a ch2_1.5_product_review.txt
file that contains a product review for you to use with the given prompt below. You will find the ch2_1.5_product_review.txt
file in the /data
folder of the codespace. If you are working on your local workstation, you will find the ch2_1.5_product_review.txt
file in the /data
folder of the Resources.zip
file. Please copy & paste the contents of this file within your prompt.
To complete this challenge successfully, you should be able to: