🎉 Lab Complete — Insurance Claims Processing (ClaimSight Insurance)¶

Congratulations — you've built, instrumented, evaluated, and deployed a production-ready multi-agent AI system from scratch. Here's what you accomplished.

Recap¶

#	Challenge	What You Built
0	Setup	Provisioned a Microsoft Foundry Resource, project, GPT model deployment, Log Analytics workspace, and Application Insights instance via a single `deploy.sh` script
1	Build Agents	Created a Claims Triage Agent (assesses document completeness, fraud risk, and policy coverage) and a Claims Decision Agent (recommends approve, fast-track, flag for investigation, or deny — with supporting rationale)
2	Monitor	Enabled OpenTelemetry GenAI tracing — every model call, tool invocation, and token count is captured as a distributed trace in Application Insights
3	Evaluate	Ran systematic LLM-as-judge evaluations across the full claims dataset, producing repeatable coherence and fluency scores you can version-track across prompt changes
4	Production Workflow	Wired both agents into an orchestrated pipeline in the Foundry portal — a stable, testable endpoint with run history that adjusters can inspect and audit

Skills you practiced¶

Designing agent system prompts with clear role boundaries and constraints
Grounding agents in real claims data via tool calls (function calling)
Distributed tracing for AI systems with OpenTelemetry
LLM-as-judge evaluation with the Azure AI Evaluation SDK
Multi-agent orchestration in the Foundry portal

Next Steps¶

Want to take the ClaimSight system further? Here are some directions:

Add more agents — a Document Extraction agent that parses uploaded PDFs, or a Fraud Pattern agent that cross-references claim history across policyholders
Connect real data — replace the static claims_data.json with a live policy management system or document storage query
Improve evaluation — add task-specific evaluators (e.g., "did the agent correctly flag a claim with a fraud score above 0.7?") alongside the generic coherence scores
Set up CI/CD — run your evaluation dataset automatically on every prompt change using GitHub Actions and fail the build if quality scores drop below a threshold
Explore fine-tuning — use your traced claim decisions as training data to fine-tune a smaller model for the initial triage step
Try another scenario — the Factory and Call Center scenarios cover predictive maintenance and customer support using the same lifecycle

Clean Up Azure Resources¶

Important: The resources deployed in Challenge 0 incur Azure costs while they exist. Delete them when you're done.

What gets deleted¶

The resource group foundry-hackathon-rg-<suffix> and everything inside it:
Microsoft Foundry Resource + project
GPT model deployment
Log Analytics workspace
Application Insights instance

Option 1 — Script¶

Run the cleanup script from the repo root:

bash claims/cleanup.sh

The script reads the .env file written by deploy.sh so it knows exactly which resource group to target. It asks for confirmation before deleting.

Option 2 — Azure Portal¶

Go to portal.azure.com
Search for Resource groups
Find foundry-hackathon-rg-<your-suffix>
Click Delete resource group and confirm

Option 3 — Azure CLI¶

# Replace <suffix> with the value shown in your .env file
az group delete --name foundry-hackathon-rg-<suffix> --yes --no-wait