Skip to main content

Plan & Manage an Azure AI Solution

Exam weight: 25โ€“30%

This domain appears in every other domain โ€” getting service selection and Foundry setup right is foundational to everything else.

Overviewโ€‹

This domain covers choosing the right Foundry services for each AI task, setting up infrastructure, and managing the AI system throughout its lifecycle โ€” including security, cost, monitoring, and responsible AI governance.

Key Conceptsโ€‹

Service Selectionโ€‹

TaskChoose
Generative text / chatLLM (GPT-4o, GPT-4, etc.)
Analyze images + text togetherMultimodal model (GPT-4o)
Speech to/from textAzure AI Speech in Foundry Tools
Text translationAzure Translator in Foundry Tools
Text analysis (sentiment, NER, etc.)Azure AI Language in Foundry Tools or LLM
Document / image / audio / video extractionAzure Content Understanding in Foundry Tools
Search and grounding / RAGAzure AI Search (vector + hybrid search)
Agent memory and tool integrationFoundry Agent Service + AI Search + functions

Foundry Infrastructureโ€‹

ConceptDescription
HubTop-level Foundry resource โ€” shared compute, connections, and security settings
ProjectA scoped workspace within a hub โ€” deployments, agents, and data sources
DeploymentA model instance with a name, version, and capacity allocation
ConnectionA link from a Foundry project to an external resource (Azure AI Search, storage, etc.)
EndpointThe URL used to call a deployed model or service

Securityโ€‹

MechanismUse when
Managed IdentityPreferred for production โ€” no secrets in code
Keyless credentials (DefaultAzureCredential)Uses managed identity or local login automatically
API keysSimple, but avoid in production; rotate regularly
Private networkingIsolate Foundry resources in a VNet
Role-based access control (RBAC)Assign least-privilege roles to identities

Responsible AI Governanceโ€‹

FeaturePurpose
Content filtersBlock harmful input/output categories (violence, hate, sexual, self-harm)
Prompt shieldsDetect and block prompt injection attacks
Groundedness detectionIdentify hallucinations in RAG responses
Safety evaluationsBatch evaluation of model outputs for safety
Trace loggingRecord inputs, outputs, and tool calls for audit
Approval workflowsHuman-in-the-loop controls for autonomous agents

Study Resourcesโ€‹