Prerequisites and Build Validation
[!NOTE] This guide expands on the Prerequisites section of the main contributing guide.
Tools, Azure access, and build validation requirements for contributing to the Physical AI Toolchain.
Required Tools
Install these tools before contributing:
| Tool | Minimum Version | Installation |
|---|---|---|
| Terraform | 1.9.8 | https://developer.hashicorp.com/terraform/install |
| TFLint | 0.61.0 | https://github.com/terraform-linters/tflint |
| Azure CLI | 2.65.0 | https://learn.microsoft.com/cli/azure/install-azure-cli |
| kubectl | 1.31 | https://kubernetes.io/docs/tasks/tools/ |
| Helm | 3.16 | https://helm.sh/docs/intro/install/ |
| Node.js/npm | 20+ LTS | https://nodejs.org/ |
| Python | 3.11+ | https://www.python.org/downloads/ |
| shellcheck | 0.10+ | https://www.shellcheck.net/ |
| uv | latest | https://docs.astral.sh/uv/ |
| Go | 1.24+ | https://go.dev/dl/ |
| golangci-lint | 2.11+ | https://golangci-lint.run/welcome/install/ |
| Docker | latest | https://docs.docker.com/get-docker/ (with NVIDIA Container Toolkit) |
| OSMO CLI | latest | https://developer.nvidia.com/osmo |
| terraform-docs | 0.21.0 | https://github.com/terraform-docs/terraform-docs/releases |
| hve-core | latest | https://github.com/microsoft/hve-core |
Azure Access Requirements
Deploying this architecture requires Azure subscription access with specific permissions and quotas:
Subscription Roles
Contributorrole for resource group creation and managementUser Access Administratorrole for managed identity assignment
GPU Quota
- Request GPU VM quota in your target region before deployment
- Architecture uses
Standard_NC24ads_A100_v4(24 vCPU, 220 GB RAM, 1x A100 80GB GPU) - Check quota:
az vm list-usage --location <region> --query "[?name.value=='standardNCadsA100v4Family']" - Request increase through Azure Portal → Quotas → Compute
Regional Availability
- Verify GPU VM availability in target region: https://azure.microsoft.com/global-infrastructure/services/?products=virtual-machines
- Architecture validated in
eastus,westus2,westeurope
NVIDIA NGC Account
Training workflows use NVIDIA GPU Operator and Isaac Lab, which require NGC credentials:
- Create account: https://ngc.nvidia.com/signup
- Generate API key: NGC Console → Account Settings → Generate API Key
- Store API key in Azure Key Vault or Kubernetes secret (deployment scripts provide guidance)
Cost Awareness
Full deployment validation incurs Azure costs. Understand cost structure before deploying:
GPU Virtual Machines
Standard_NC24ads_A100_v4: ~$3.06/hour per VM (pay-as-you-go)- 8-hour validation session: ~$25
- 40-hour work week: ~$125
Managed Services
- AKS control plane:
$0.10/hour ($73/month) - Log Analytics workspace: ~$2.76/GB ingested
- Storage accounts: ~$0.02/GB (block blob, hot tier)
- Azure Container Registry: Basic tier ~$5/month
Cost Optimization
- Use
terraform destroyimmediately after validation - Automate cleanup with
-auto-approveflag - Monitor costs: Azure Portal → Cost Management + Billing
- Set budget alerts to prevent overruns
Estimated Costs
- Quick validation (deploy + verify + destroy): ~$25-50
- Extended development session (8 hours): ~$50-100
- Monthly development (40 hours): ~$200-300
Build and Validation Requirements
Tool Version Verification
Verify tool versions before validating:
# Terraform
terraform version # >= 1.9.8
# TFLint (Terraform linter)
tflint --version # >= 0.61.0
# Azure CLI
az version # >= 2.65.0
# kubectl
kubectl version --client # >= 1.31
# Helm
helm version # >= 3.16
# Node.js (for documentation linting)
node --version # >= 20
# Python (for training scripts)
python --version # >= 3.11
# shellcheck (for shell script validation)
shellcheck --version # >= 0.10
# uv (Python package manager)
uv --version
# Go
go version # >= 1.24
# golangci-lint
golangci-lint version # >= 2.11
# Docker with NVIDIA Container Toolkit
docker --version
nvidia-ctk --version
# OSMO CLI
osmo --version
# terraform-docs
terraform-docs --version # >= 0.21.0
# hve-core (VS Code extension — verify via extensions list)
code --list-extensions | grep -i hve-core
Validation Commands
Run these commands before committing:
Terraform:
# Format check (required)
terraform fmt -check -recursive infrastructure/terraform/
# Initialize and validate (required for infrastructure changes)
cd infrastructure/terraform/
terraform init
terraform validate
# Lint Terraform configurations (required for infrastructure changes)
tflint --recursive infrastructure/terraform/
Shell Scripts:
# Lint all shell scripts (required)
shellcheck deploy/**/*.sh scripts/**/*.sh
Go:
# Lint Go modules (required for Go changes)
npm run lint:go
# Test Go modules (required for Go changes)
npm run test:go
Documentation:
# Install dependencies (first time only)
npm install
# Lint markdown (required for documentation changes)
npm run lint:md
VS Code Configuration
The workspace is configured with python.analysis.extraPaths pointing to src/, enabling imports like:
from training.utils import AzureMLContext, bootstrap_azure_ml
Select the .venv/bin/python interpreter in VS Code for IntelliSense support.
The workspace .vscode/settings.json also configures Copilot Chat to load instructions, prompts, and chat modes from hve-core:
| Setting | hve-core Paths |
|---|---|
chat.modeFilesLocations | ../hve-core/.github/chatmodes, ../hve-core/copilot/beads/chatmodes |
chat.instructionsFilesLocations | ../hve-core/.github/instructions, ../hve-core/copilot/beads/instructions |
chat.promptFilesLocations | ../hve-core/.github/prompts, ../hve-core/copilot/beads/prompts |
These paths resolve when hve-core is installed as a peer directory or via the VS Code Extension. Without hve-core, Copilot still functions but shared conventions, prompts, and chat modes are unavailable.
For a complete list of available agents, prompts, and skills, see Copilot Artifacts.
Related Documentation
- Contributing Guide - Main contributing guide with all sections
- Deployment Validation - Validation levels and testing templates
- Cost Considerations - Component costs, budgeting, regional pricing