Challenge 02 - OpenAI Models & Capabilities

< Previous Challenge - Home - Next Challenge >

Introduction

In this challenge, you will learn about the different capabilities of OpenAI models and learn how to choose the best model for your use case.

There are a lot of different models available in the Azure AI Model Catalog. These include models from OpenAI and other open source large language models from Meta, Hugging Face, and more. You are going to explore various LLMs and compare gpt3.5 to gpt4 model in this challenge.

In a world where the availability and development of models are always changing, the models we compare may change over time. But we encourage you to understand the general concepts and material in this Challenge because the comparison techniques utilized can be applicable to scenarios where you are comparing Large and/or Small Language Models. For more information on legacy models and additional models, reference the documentation and Azure model catalog for more details.

Description

Questions you should be able to answer by the end of this challenge:

You will work in the Azure AI Foundry for this challenge. We recommend keeping the student guide and the Azure AI Foundry in two windows side by side as you work. This will also help to validate you have met the success criteria below for this challenge.

This challenge is divided into the following sections:

2.1 Model Discovery

Scenario: You are part of a research team working on getting information from biotech news articles. Your goal is to explore the Model Catalog and identify some suitable models for accurate question answering. There is no right or wrong answer here.

Student Task 2.1

HINT: Take a look at the model cards for each model by clicking into them. Evaluate the models based on their capabilities, limitations, and fit for the use case. Which models seem to be good options for question answering?

2.2 Model Benchmarking

Student Task 2.2

2.3 Model Comparison

Student Task 2.3

TIP The scenario will go into the system prompt. Click on the button “Show parameters setting” next to the trash can once your model has been selected.

Scenario: You are a product manager at a multinational tech company, and your team is developing an advanced AI-powered virtual assistant to provide real-time customer support. The company is deciding between GPT-3.5 Turbo and GPT-4 to power the virtual assistant. Your task is to evaluate both models to determine which one best meets the company’s needs for handling diverse customer inquiries efficiently and effectively.

Student Task 2.3.1: Complex Problem Solving

Compare the models’ abilities to navigate complex customer complaints and provide satisfactory solutions.

Student Task 2.3.2: Creative and Technical Writing

Assess the models’ capabilities in technical writing, such as creating detailed product manuals or help articles.

Student Task 2.3.3: Long Form Content Understanding

Provide both models with extensive customer feedback or product reviews and ask them to summarize the key points.

We have provided a ch2_1.5_product_review.txt file that contains a product review for you to use with the given prompt below. You will find the ch2_1.5_product_review.txt file in the /data folder of the codespace. If you are working on your local workstation, you will find the ch2_1.5_product_review.txt file in the /data folder of the Resources.zip file. Please copy & paste the contents of this file within your prompt.

Success Criteria

To complete this challenge successfully, you should be able to:

Learning Resources