Downloads: 8
United States | Computer Science and Information Technology | Volume 14 Issue 2, February 2025 | Pages: 714 - 716
Enhancing Incident Response with AI - Assisted Runbooks: A Framework for Smarter Troubleshooting
Abstract: Site Reliability Engineering (SRE) teams rely on runbooks to help them troubleshoot and manage incidents, but traditional runbooks can be rigid, outdated, and hard to maintain - especially in fast - evolving tech environments. This paper explores how integrating AI - assisted runbooks, powered by structured prompts, can make incident response faster, more efficient, and more reliable. We present a framework that uses Large Language Models (LLMs) and structured prompting to create flexible, context - aware troubleshooting guides. Techniques like few - shot prompting, chain - of - thought reasoning, and self - refinement are key to this approach.
Keywords: Prompt Engineering, Site Reliability Engineering, AI - assisted Runbooks, Large Language Models, Incident Management, Observability
How to Cite?: Binoj Melath Nalinakshan Nair, "Enhancing Incident Response with AI - Assisted Runbooks: A Framework for Smarter Troubleshooting", Volume 14 Issue 2, February 2025, International Journal of Science and Research (IJSR), Pages: 714-716, https://www.ijsr.net/getabstract.php?paperid=SR25212002547, DOI: https://dx.doi.org/10.21275/SR25212002547
Received Comments
No approved comments available.