DC AIFeb 10

KORAL: Knowledge Graph Guided LLM Reasoning for SSD Operational Analysis

Mayur Akewar, Sandeep Madireddy, Dongsheng Luo, Janki Bhimani

arXiv:2602.10246v12.31 citationsHas Code

Originality Incremental advance

AI Analysis

This addresses the problem of SSD operational analysis for datacenter operators and system administrators by providing an automated, explainable tool, though it is incremental as it combines existing LLM and KG techniques in a new application.

The paper tackles the difficulty of diagnosing SSD performance and reliability due to fragmented data and limited insights from existing methods by introducing KORAL, a framework that integrates LLMs with knowledge graphs to generate expert-level analysis, reducing manual effort and improving transparency.

Solid State Drives (SSDs) are critical to datacenters, consumer platforms, and mission-critical systems. Yet diagnosing their performance and reliability is difficult because data are fragmented and time-disjoint, and existing methods demand large datasets and expert input while offering only limited insights. Degradation arises not only from shifting workloads and evolving architectures but also from environmental factors such as temperature, humidity, and vibration. We present KORAL, a knowledge driven reasoning framework that integrates Large Language Models (LLMs) with a structured Knowledge Graph (KG) to generate insights into SSD operations. Unlike traditional approaches that require extensive expert input and large datasets, KORAL generates a Data KG from fragmented telemetry and integrates a Literature KG that already organizes knowledge from literature, reports, and traces. This turns unstructured sources into a queryable graph and telemetry into structured knowledge, and both the Graphs guide the LLM to deliver evidence-based, explainable analysis aligned with the domain vocabulary and constraints. Evaluation using real production traces shows that the KORAL delivers expert-level diagnosis and recommendations, supported by grounded explanations that improve reasoning transparency, guide operator decisions, reduce manual effort, and provide actionable insights to improve service quality. To our knowledge, this is the first end-to-end system that combines LLMs and KGs for full-spectrum SSD reasoning including Descriptive, Predictive, Prescriptive, and What-if analysis. We release the generated SSD-specific KG to advance reproducible research in knowledge-based storage system analysis. GitHub Repository: https://github.com/Damrl-lab/KORAL

View on arXiv PDF Code

Similar