Skip to content

🎉 Lab Complete — Insurance Claims Processing (ClaimSight Insurance)

Congratulations — you've built, instrumented, evaluated, and deployed a production-ready multi-agent AI system from scratch. Here's what you accomplished.


Recap

# Challenge What You Built
0 Setup Provisioned a Microsoft Foundry Resource, project, GPT model deployment, Log Analytics workspace, and Application Insights instance via a single deploy.sh script
1 Build Agents Created a Claims Triage Agent (assesses document completeness, fraud risk, and policy coverage) and a Claims Decision Agent (recommends approve, fast-track, flag for investigation, or deny — with supporting rationale)
2 Monitor Enabled OpenTelemetry GenAI tracing — every model call, tool invocation, and token count is captured as a distributed trace in Application Insights
3 Evaluate Ran systematic LLM-as-judge evaluations across the full claims dataset, producing repeatable coherence and relevance scores you can version-track across prompt changes
4 Production Workflow Wired both agents into an orchestrated pipeline in the Foundry portal — a stable, testable endpoint with run history that adjusters can inspect and audit

Skills you practiced

  • Designing agent system prompts with clear role boundaries and constraints
  • Grounding agents in real claims data via tool calls (function calling)
  • Distributed tracing for AI systems with OpenTelemetry
  • LLM-as-judge evaluation with the Azure AI Evaluation SDK
  • Multi-agent orchestration in the Foundry portal

Next Steps

Want to take the ClaimSight system further? Here are some directions:

  • Add more agents — a Document Extraction agent that parses uploaded PDFs, or a Fraud Pattern agent that cross-references claim history across policyholders
  • Connect real data — replace the static claims_data.json with a live policy management system or document storage query
  • Improve evaluation — add task-specific evaluators (e.g., "did the agent correctly flag a claim with a fraud score above 0.7?") alongside the generic coherence scores
  • Set up CI/CD — run your evaluation dataset automatically on every prompt change using GitHub Actions and fail the build if quality scores drop below a threshold
  • Explore fine-tuning — use your traced claim decisions as training data to fine-tune a smaller model for the initial triage step
  • Try another scenario — the Factory and Call Center scenarios cover predictive maintenance and customer support using the same lifecycle

Clean Up Azure Resources

Important: The resources deployed in Challenge 0 incur Azure costs while they exist. Delete them when you're done.

What gets deleted

  • The resource group foundry-hackathon-rg-<suffix> and everything inside it:
  • Microsoft Foundry Resource + project
  • GPT model deployment
  • Log Analytics workspace
  • Application Insights instance

Option 1 — Script

Run the cleanup script from the repo root:

bash claims/cleanup.sh

The script reads the .env file written by deploy.sh so it knows exactly which resource group to target. It asks for confirmation before deleting.

Option 2 — Azure Portal

  1. Go to portal.azure.com
  2. Search for Resource groups
  3. Find foundry-hackathon-rg-<your-suffix>
  4. Click Delete resource group and confirm

Option 3 — Azure CLI

# Replace <suffix> with the value shown in your .env file
az group delete --name foundry-hackathon-rg-<suffix> --yes --no-wait