Cloud Infrastructure Management in the Age of AI Agents. SIGOPS Operating Systems Review 2025.

Download

Paper

Abstract

Cloud infrastructure is the cornerstone of the modern IT industry. However, managing this infrastructure effectively requires considerable manual effort from the DevOps engineering team. We make a case for developing AI agents powered by large language models (LLMs) to automate cloud infrastructure management tasks. In a preliminary study, we investigate the potential for AI agents to use different cloud/user interfaces such as software development kits (SDK), command line interfaces (CLI), Infrastructure-as-Code (IaC) platforms, and web portals. We report takeaways on their effectiveness on different management tasks, and identify research challenges and potential solutions.

Figure 1: Envisioned agentic system architecture and workflow for cloud infrastructure management.

Citation

@article{yang2025cloudagents, title = {Cloud Infrastructure Management in the Age of AI Agents}, author = {Yang, Zhenning and Bhatnagar, Archit and Qiu, Yiming and Miao, Tongyuan and Kon, Patrick Tser Jern and Xiao, Yunming and Huang, Yibo and Casado, Martin and Chen, Ang}, journal = {ACM SIGOPS Operating Systems Review}, volume = {59}, number = {1}, pages = {1–8}, year = {2025}, doi = {10.1145/3759441.3759443} }