[HAI5016] Week 4: JSON and API's

Posted Sep 26, 2024

HAI5016 Week 4 - Human-AI interaction in Data Science (by Dall-e)

By Pim Kamphuis

3 min read

This week’s class is about API’s and our dear friend JSON. After the student’s presentations, we will deploy our Azure OpenAI instance and request increase of the 1K token quota that Microsoft enforces on student subscriptions by default.

Before following the instructions below, make sure that your Azure for Students Subscription is active.

1. Deploy an Azure OpenAI instance in Azure

Open Azure OpenAI studio via https://oai.azure.com/
Create an Azure OpenAI resource:
- Subscription: Azure for Students
- Resource group: click Create new -> HAI5016
- Region: Keep the East US option selected
- Name: Make a name starting with firsthai- and then some random characters. Like firsthai-hrn8c29
- Pricing tier: Select the Standard S0 option
- For (2) network, (3) Tags and (4) Review + Submit, just click Next and Create
  Azure OpenAI resources are constrained by regional quotas. In the event of a quota limit being reached in the exercise, there’s a possibility you may need to create another resource in a different region. Select randomly from Australia East, Canada East, East US, East US 2, France Central, Japan East, North Central US, Sweden Central, Switzerland North or UK South.

2. Deploy a chat completion and embedding model in Azure OpenAI Studio

We’re going to use two Large Language models in this course: (1) a text embedding model to vectorize the text in our own data so it can be indexed efficiently for use in grounding prompts and (2) a GPT model that your application can use to generate responses to prompts that are grounded in our data.

Open the Azure OpenAI Studio landing page
On the Azure OpenAI Studio landing page, find Deployments link under the Shared Resources section in the left navigation menu. Click + Deploy Model and select Deploy base model.
Find the Chat Completion model gpt-4o-mini and click Confirm.
Deploy the model with the following settings:

Deployment name: gpt-4o-mini
Model version: Select the latest version (2024-07-18 as the moment of writing)
Deployment type: Global Standard
Content filter: Default or Default V2
Enable dynamic quota: Enabled
Here we will select a model that has a good balance between pricing and performance. For the pricing details of the various models, see Azure OpenAI pricing

Then, create a new base model deployment of the text-embedding-3-small model with the following settings:

Deployment name: text-embedding-3-small
Model: text-embedding-3-small
Model version: The default version
Deployment type: Standard
Tokens per minute rate limit: 120K
Content filter: Default
Enable dynamic quota: Enabled

3. Request a quota increase (updated Apr 13th, 2026)

Because it seems that student subscriptions have limited deployment options by default, we have to request for Quota Increase in order to use the OpenAI (or Anthropic) models that I have planned to use in the exercises. The quota request process can take up to 24-48 hours, so make sure to do this as soon as possible.

Go to the landing page of your Microsoft Foundry at https://ai.azure.com/
Make sure that New Foundry is selected
Click on Operate in the top menu
Click on Quota in the left menu
Click on the Request quota button in the top right corner of the page.

Fill in the quota request form with the following information:

Your first name
Your last name
Company Email: use your @g.skku.edu or @skku.edu email address
Company Name: Sungkyunkwan University
Company Address: 25 , Sungkyunkwan-Ro
Company City: Seoul
Company Postal Code: 110-745
Company Country: South Korea
Subscription Id: This is the ID of your Azure for Students subscription. This ID can be found in the URL-bar of your Azure OpenAI Studio tab, or find it in the subscriptions blade on Azure Portal.

Make sure to copy the correct and full subscription ID, as the quota request will be rejected if the subscription ID is incorrect.
Justification: here you can write

   Need to deploy a GPT model to follow along the tutorial which is used in my class (https://learn.microsoft.com/en-us/azure/foundry/quickstarts/get-started-code)

Model type: Azure OpenAI
Model Deployment Quota or Fine Tuning Quota: Model Deployment
Quota Request Type: Global Standard
Global Standard Region: Select the region where you deployed your Azure Foundry resource. You can find this in your azure portal or in the project settings on top of the Foundry page.
Global Standard Model: gpt-5-nano
Global Standard Quota: 400

References

Manage and increase quotas for resources with Microsoft Foundry (Foundry projects)

SKKU, HAI5016

This post is licensed under CC BY 4.0 by the author.

1. Deploy an Azure OpenAI instance in Azure

2. Deploy a chat completion and embedding model in Azure OpenAI Studio

3. Request a quota increase (updated Apr 13th, 2026)

Fill in the quota request form with the following information:

References

Trending Tags