AIOpsLab

A Holistic Framework to Design, Develop, and Evaluate AI Agents for Enabling Autonomous Clouds

News

🆕 [3/2025] Welcome to join our tutorial "AIOpsLab in Action: An Open Platform for AIOps Research" at FSE'25, June 2025, Norway 🎉 [Link]

🆕 [2/2025] "AIOpsLab: A Holistic Framework for Evaluating AI Agents for Enabling Autonomous Cloud" was accepted by MLSys'25 🎉 [Link]

🆕 [12/2024] Microsoft Research features AIOpsLab in their latest blog post! 🌐 [Link]

🆕 [12/2024] Our code is now live on GitHub! 🚀 [Link]

🆕 [10/2024] Our vision paper "Building AI Agents for Autonomous Clouds: Challenges and Design Principles" was accepted by SoCC'24 🎉 [Link]

About

AIOpsLab is a holistic framework to enable the design, development, and evaluation of autonomous AIOps agents that, additionally, serves the purpose of building reproducible, standardized, interoperable and scalable benchmarks. AIOpsLab can deploy microservice cloud environments, inject faults, generate workloads, and export telemetry data, while orchestrating these components and providing interfaces for interacting with and evaluating agents. Moreover, AIOpsLab provides a built-in benchmark suite with a set of problems to evaluate AIOps agents in an interactive environment. This suite can be easily extended to meet user-specific needs.

The Orchestrator coordinates interactions between various system components and serves as the Agent-Cloud-Interface (ACI). Agents engage with the Orchestrator to solve tasks, receiving a problem description, instructions, and relevant APIs. The Orchestrator generates diverse problems using the Workload and Fault Generators, injecting these into applications it can deploy. The deployed service has observability, providing telemetry such as metrics, traces, and logs. Agents act via the Orchestrator, which executes them and updates the service's state. The Orchestrator evaluates the final solution using predefined metrics for the task.

BibTeX


    @inproceedings{shetty2024building,
        title = {Building AI Agents for Autonomous Clouds: Challenges and Design Principles},
        author = {Shetty, Manish and Chen, Yinfang and Somashekar, Gagan and Ma, Minghua and Simmhan, Yogesh and Zhang, Xuchao and Mace, Jonathan and Vandevoorde, Dax and Las-Casas, Pedro and Gupta, Shachee Mishra and Nath, Suman and Bansal, Chetan and Rajmohan, Saravan},
        year = {2024},
        booktitle = {Proceedings of 15th ACM Symposium on Cloud Computing (SoCC'24)},
    }
    @misc{chen2024aiopslab,
        title = {AIOpsLab: A Holistic Framework to Evaluate AI Agents for Enabling Autonomous Clouds},
        author = {Chen, Yinfang and Shetty, Manish and Somashekar, Gagan and Ma, Minghua and Simmhan, Yogesh and Mace, Jonathan and Bansal, Chetan and Wang, Rujia and Rajmohan, Saravan},
        year = {2024},
        booktitle = {Arxiv}
    }