Claw field notebook
last updated 2026-05-15 edit on GitHub colophon
Microsoft / Microsoft Foundry / FDY.1 · 9 min read

Microsoft Foundry, plainly

Five-minute orientation to Microsoft Foundry — Microsoft's managed model + agent platform on Azure. The rebrand chain (ML Studio → AI Studio → AI Foundry → Microsoft Foundry), the new flat resource model, the model catalogue, Agent Service, and where it sits next to Azure OpenAI direct.

The thirty-second version#

Microsoft Foundry (formerly Azure AI Foundry, formerly Azure AI Studio) is Microsoft’s managed model + agent platform on Azure. One portal at ai.azure.com, one resource type (Microsoft.CognitiveServices/accounts with Kind: AIServices), one Python SDK entry point (azure-ai-projects). It wraps and extends Azure OpenAI direct: same OpenAI models you’d hit through an Azure OpenAI resource, plus a catalogue of 1,900+ other curated models (Anthropic Claude, xAI Grok, Mistral, Meta Llama, Microsoft Phi, Hugging Face, …), a managed Agent Service, built-in evaluations, and OpenTelemetry-based tracing.

Three things to know up front:

  1. The name has changed twice in two years. Azure ML Studio (the legacy ML design canvas — separate product line, not in the Foundry rebrand chain) → Azure AI Studio (preview generative AI portal, 2023–2024) → Azure AI Foundry (GA rename at Microsoft Ignite, November 2024) → Microsoft Foundry (current name, late 2025; “Azure AI” prefix dropped — the azure-ai-projects SDK adopted the new brand at 2.0.0b1 in November 2025). Microsoft Learn’s primary brand is now Microsoft Foundry; “Azure AI Foundry” persists in older docs, the URL path (/azure/ai-foundry/), and many code samples — both names refer to the same product. The portal URL ai.azure.com survived every rename.
  2. Two resource models still coexist. The old hub-and-project model (Microsoft.MachineLearningServices provider) is now labelled Foundry (classic) and gets no new investments. The new flat model (Microsoft.CognitiveServices/accounts Kind=AIServices, with accounts/projects sub-resources) is where everything new lands. Plan new work on the new model.
  3. The platform itself has no platform fee. You pay only for what you consume — tokens, hosted agent compute, evaluation runs, AI Search indexes, observability storage. Foundry sits on top of an Azure subscription you already have.

The rebrand timeline, since you’ll meet it in docs#

EraNameStatus
Pre-2023Azure Machine Learning StudioLegacy ML design canvas — separate product line, NOT in the Foundry rebrand chain (included for reader context only)
2023 – mid-2024Azure AI Studio (preview)Unified generative AI portal, preview
November 2024 (Ignite)Azure AI FoundryGA rename; “Azure AI Foundry” brand introduced
Late 2025Microsoft FoundryCurrent name; “Azure AI” prefix dropped. The azure-ai-projects SDK adopted the new brand at 2.0.0b1 (November 2025); some surfaces still in transition.

What survived: the ai.azure.com portal URL, the model catalogue (which expanded), Azure OpenAI model access (now routed through Foundry endpoints).

What changed: the hub-based resource model was superseded by the flat Foundry model. The Assistants API terminology (Threads / Messages / Runs) was replaced by Responses-API terminology (Conversations / Items / Responses). API versioning moved from monthly date-stamped api-version parameters to stable /openai/v1/ routes.

Resource hierarchy — the new flat model#

Microsoft.CognitiveServices/accounts        ← Foundry resource (governance boundary)
  └── Microsoft.CognitiveServices/accounts/projects   ← Foundry project (dev boundary)

A Foundry resource is the top-level governance unit — networking, identity, security policy, model deployments. A Foundry project is where development happens — agents, evaluations, files, datasets, prompt assets, all scoped per team.

The provider namespace Microsoft.CognitiveServices is shared with Azure OpenAI, Azure Speech, Azure Vision, Azure Language. Existing Azure Policy rules scoped at that level carry forward when you upgrade from a standalone Azure OpenAI resource into a Foundry resource.

Other Azure services you’ll connect to a Foundry project — Azure Storage (file uploads), Azure Key Vault (secrets), Azure AI Search (vector indexes) — are separate resources with their own governance.

The model catalogue#

Numbers floating around in Microsoft’s own docs:

  • “Over 1,900 models” — the Microsoft Foundry Learn docs (the what-is-azure-ai-foundry page; the URL slug kept the old name) and the Foundry Models overview page
  • “More than 11,000 models” — the Azure pricing page, which includes the broader Hugging Face Hub collection across inference tasks

Conservative copy: “1,900+ curated models from Microsoft, OpenAI, Anthropic, xAI, Mistral, Meta, DeepSeek, Cohere, NVIDIA, plus the Hugging Face Hub.” Avoid quoting “11,000” without noting the source.

Two high-level deployment categories:

CategoryHow it worksBilling
Serverless (Microsoft-hosted)Microsoft hosts and manages the model; you call a REST APIPay-per-token (Standard sub-types) or reserved PTU hourly rate (Provisioned sub-types)
Managed ComputeModel weights deployed to dedicated VMs you provisionPer VM core-hour

Serverless sub-types, by data-residency and throughput SKU:

Sub-typeSKU codeData processingNotes
Global StandardGlobalStandardAny Azure regionHighest quota, most common starting point
Global ProvisionedGlobalProvisionedManagedAny regionReserved PTU, predictable throughput
Global BatchGlobalBatchAny region50% discount, 24-hour turnaround
Data Zone StandardDataZoneStandardUS or EU zoneData sovereignty compliance
Data Zone ProvisionedDataZoneProvisionedManagedUS or EU zoneZone + throughput guarantee
Data Zone BatchDataZoneBatchUS or EU zone50% discount, 24-hour turnaround, zone-residency
StandardStandardSingle regionRegional compliance, lower volume
Regional ProvisionedProvisionedManagedSingle regionRegional + throughput
DeveloperDeveloperTierAny region (no residency guarantee)Fine-tuned model eval; 24-hour lifetime; no SLA

Two model ownership categories:

  • Models sold directly by Azure — hosted and sold by Microsoft under Microsoft Product Terms; Microsoft support; enterprise SLAs; subject to Responsible AI review.
  • Models from partners and community — third-party (e.g. Anthropic Claude, Hugging Face); supported by their providers; SLAs vary.

Foundry Agent Service#

The Agent Service is the fully managed runtime for AI agents inside Foundry. Hosting, scaling, identity, observability, security — all handled.

Three agent types:

TypeCode required?HostingBest for
Prompt agentsNoFully managedPrototyping, single-task agents — portal or SDK
Workflow agents (preview)No (YAML optional)Fully managedMulti-step, multi-agent orchestration with branching
Hosted agents (preview)YesContainer-based, managedBring-your-own framework; full custom logic

Built-in tools (the service executes them; GA unless noted):

  • Web Search — real-time public web with citations
  • Code Interpreter — sandboxed Python for data analysis, charts
  • Custom Code Interpreter (preview) — customise the interpreter’s Python packages, container resources, and Container Apps environment
  • File Search — vector search over uploaded files
  • Function calling — define your own functions; your app executes them
  • Azure AI Search — ground on an existing AI Search index
  • Azure Functions — call Azure Functions for custom actions
  • Plus preview built-in tools: Image Generation, Browser Automation, Computer Use, Microsoft Fabric, SharePoint

Custom tools (you bring the capability; GA unless noted):

  • OpenAPI tool — connect external HTTP APIs via OpenAPI 3.0/3.1 spec
  • MCP servers — first-class tool type (MCPTool); see MCP for Microsoft
  • Agent-to-Agent (A2A) (preview) — A2A protocol for cross-agent communication
  • Toolbox (preview) — bundle multiple tools into a single MCP-compatible endpoint with versioning

API terminology — Agents v1 (Assistants API) → Agents v2 (Responses API):

Old (Agents v1 / Assistants API)New (Agents v2 / Responses API)
ThreadsConversations
MessagesItems
RunsResponses
AssistantsAgent Versions
Date-stamped api-version parametersStable v1 routes (/openai/v1/…)

Note: the azure-ai-agents Python SDK (v1.x, GA) intentionally targets Agents v1 and continues to use Threads / Runs / Messages — that’s not stale code, it’s the legacy API surface, actively maintained. azure-ai-projects 2.x is the recommended SDK for new work and uses the Conversations / Items / Responses shape.

The SDK#

Python is the primary surface:

pip install azure-ai-projects   # primary SDK — full feature access (current stable: 2.1.0, April 2026)
pip install azure-ai-agents     # lower-level agents-only client (stable; latest GA: 1.1.0)

azure-ai-projects requires 2.0.0 or above; 2.1.0 (April 2026) added Toolboxes, Skills, and Agent Sessions sub-clients. The package was declared first-stable at 2.0.0 in March 2026 (CHANGELOG), although the PyPI description still labels it “in preview” as of May 2026 — a packaging-page lag. Both packages use Microsoft Entra ID auth — there’s no API-key path in the AIProjectClient constructor. (An API key can optionally be passed to the OpenAI sub-client via project.get_openai_client(api_key=...) for specific scenarios, but Entra ID is the recommended path.) A typical first session:

from azure.identity import DefaultAzureCredential
from azure.ai.projects import AIProjectClient

client = AIProjectClient(
    endpoint="https://<resource>.services.ai.azure.com/api/projects/<project>",
    credential=DefaultAzureCredential(),
)

azure-ai-agents exposes AgentsClient separately for callers that only need the agents surface — it implements the Agents v1 (Assistants API) and intentionally uses Threads / Runs / Messages by design. azure-ai-projects 2.x implements the new Agents v2 (Responses API) with Conversations / Items / Responses. The two SDKs coexist for v1 and v2 surfaces respectively; if you see client.beta.threads.create(...) in azure-ai-agents samples, that’s v1 code working as intended — not stale code.

Other languages:

  • JavaScript / TypeScript@azure/ai-agents (preview as of May 2026)
  • C# / .NET — preview
  • Java — preview
  • REST API — GA, with stable v1 routes

Evaluations#

Built-in evaluators across five buckets:

BucketMetrics
General qualityCoherence, fluency
RAG-specificGroundedness, relevance
Safety / securityHate-and-unfairness, violence, protected materials
Agent-specificTool call accuracy, task completion
CustomUser-defined evaluators for domain needs

Two modes:

  • Pre-production: run evaluators against datasets. Portal wizard or SDK. PyRIT-backed AI red teaming agent for adversarial testing.
  • Post-production: continuous sampling of live traffic; scheduled evals; scheduled red-teaming runs; Azure Monitor alerts on quality thresholds.

Evaluations billed via Foundry Observability pricing. Playground evaluations are on by default and consume the meter; you can turn them off per project.

Observability — OpenTelemetry-native#

Foundry tracing is built on OpenTelemetry semantic conventions for GenAI spans. Traces ship into Azure Monitor Application Insights (connected per project). Multi-agent semantic conventions were co-developed with Cisco Outshift. W3C Trace Context compliant.

Frameworks supported in the trace pipeline: LangChain, LangGraph, OpenAI Agents SDK, Microsoft Agent Framework. Captures inputs/outputs, tool calls, token consumption, latency, and cost. GA for prompt agents; preview for workflow / hosted / custom agents.

Auth — keyless RBAC by default#

Microsoft Entra ID is the recommended (and SDK-default) auth model. The RBAC roles you’ll care about:

RoleScopePurpose
Azure AI UserFoundry resource or projectLeast privilege — build and develop. Auto-assigned on portal deploy if you’re an Owner.
Azure AI Project ManagerFoundry resourceCreate projects, assign Azure AI User to others, publish agents.
Azure AI Account OwnerFoundry resourceManage resource, deploy models, create projects.
Azure AI OwnerFoundry resourceFull access + role assignment + model management.

⚠️ Do not use the older Cognitive Services * roles (scoped at the wrong level) or the Azure AI Developer role (scoped to AML / Foundry classic hubs). API keys still exist (the project portal exposes them) but they grant full unrestricted access — Microsoft recommends Entra ID for production.

Pricing posture#

ComponentPricing
Foundry platform itselfFree — no platform fee
Foundry Models (serverless)Pay-per-token, model-specific
Foundry Models (provisioned)Reserved PTU hourly rate
Foundry Models (managed compute)Per VM core-hour
Foundry Agent ServiceConsumption-based; own billing model
Foundry IQ (AI Search)Azure AI Search pricing
Foundry Tools (Speech, Vision, Language)Foundry Tools pricing
Foundry ObservabilityPer-data-volume
Fine-tuningAzure Machine Learning pricing

Microsoft Agent Pre-Purchase Plan: 1-year ACU (Agent Commit Units) commitment, three tiers (20K ACU @ 5% discount, 100K @ 10%, 500K @ 15%).

Where Foundry sits relative to alternatives#

vs Azure OpenAI direct. Azure OpenAI (Microsoft.CognitiveServices/accounts Kind=OpenAI) is the narrower resource — OpenAI models only, no agent runtime, no built-in eval framework. Foundry wraps Azure OpenAI — all your existing Azure OpenAI endpoints work through Foundry; you can upgrade an Azure OpenAI resource to a Foundry resource while preserving endpoint URLs, API keys, and state. Use standalone Azure OpenAI only when you literally need nothing beyond completions.

vs OpenAI Platform direct (platform.openai.com). OpenAI Platform is simpler but has no Azure RBAC, no private VNet, no Azure Policy. Foundry adds enterprise auth, data residency controls, private networking, and multi-model access. Pick OpenAI Platform if you want the fastest possible on-ramp and don’t need Azure compliance. Pick Foundry if you’re already in Azure or need any of Entra / VNet / Policy / multi-model.

vs Google Vertex AI. Both are unified cloud AI platforms. Foundry is Entra ID + Azure RBAC native; Vertex AI is Google IAM. Foundry’s deeper integration is with Microsoft 365 (Teams, Copilot publishing). Vertex AI’s deeper integration is with Google Workspace. Pricing models differ — no direct head-to-head. (See Vertex AI Agents for the Google side.)

Honest take#

Foundry is the right Microsoft surface for almost any non-trivial Azure-hosted AI build today. The rebrand churn is the biggest thing to track — names, docs, SDK package names have all shifted, and Microsoft hasn’t yet sunset enough of the old surface to make it tidy.

The flat resource model is a real upgrade over the hub-and-project tangle; the SDK is finally a single entry point worth recommending. Tracing on OpenTelemetry was the correct architectural call. The model catalogue keeps expanding faster than any individual page can document.

Where it’s still rough: API/terminology drift (Assistants → Responses; Threads → Conversations) means old sample code is everywhere, and you’ll hit moments where a docs page is one rename behind the SDK. Read the date stamp on every page; if it says before mid-2025, treat the API names as suspect.

What’s next#

Sources