Enhancing Service Reliability with Graph Reinforcement Learning: Real-Time Dependency Mapping and Failure Prediction
International Journal of Science and Research (IJSR)

International Journal of Science and Research (IJSR)
Call for Papers | Fully Refereed | Open Access | Double Blind Peer Reviewed

ISSN: 2319-7064


Downloads: 5 | Views: 113 | Weekly Hits: ⮙1 | Monthly Hits: ⮙3

Research Paper | Computer Science and Information Technology | United States of America | Volume 14 Issue 3, March 2025 | Popularity: 5.1 / 10


     

Enhancing Service Reliability with Graph Reinforcement Learning: Real-Time Dependency Mapping and Failure Prediction

Nishant Nisan Jha


Abstract: In large-scale distributed systems with numerous workflows and microservices, traditional service dependency mapping approaches rely on static graphs that fail to capture real-time changes, leading to delayed incident detection and prolonged downtime. This research explores Graph Reinforcement Learning (GRL) as a dynamic solution for modeling inter-service dependencies and predicting failure propagation in real time. By leveraging real-time telemetry data and historical incidents, GRL continuously updates dependency graphs, reducing Mean Time to Detect (MTTD) and Mean Time to Recover (MTTR). The paper further discusses implementation challenges, including computational complexity and scalability, and proposes solutions such as hierarchical clustering and distributed processing. The findings suggest that GRL significantly enhances system resilience, making it a valuable tool for modern reliability engineering.


Keywords: Graph Reinforcement Learning, service reliability, failure prediction, site reliability engineering, dynamic dependency mapping


Edition: Volume 14 Issue 3, March 2025


Pages: 346 - 353


DOI: https://www.doi.org/10.21275/SR25308025322


Please Disable the Pop-Up Blocker of Web Browser

Verification Code will appear in 2 Seconds ... Wait



Text copied to Clipboard!
Nishant Nisan Jha, "Enhancing Service Reliability with Graph Reinforcement Learning: Real-Time Dependency Mapping and Failure Prediction", International Journal of Science and Research (IJSR), Volume 14 Issue 3, March 2025, pp. 346-353, https://www.ijsr.net/getabstract.php?paperid=SR25308025322, DOI: https://www.doi.org/10.21275/SR25308025322

Top