Contribute to HealthAgentBench

Add realistic healthcare tasks to the benchmark

HealthAgentBench grows through clinically meaningful tasks that test agent behavior in real healthcare workflows. We welcome new task proposals, datasets, evaluation ideas, and domain collaborations that broaden the benchmark.

What to Share

Send us a short description of the task, the healthcare workflow it represents, available data or environment constraints, and how success should be evaluated.