Jacob Schmier is a Senior Technologist in Firmware Engineering at SanDisk, developing enterprise SSD firmware for next‑generation platforms. His work centers on implementation and technical leadership driving performance, reliability, and efficiency in data‑center storage.
AI workloads are driving unprecedented power demand in modern data centers, with rack densities exceeding 100 kW and rapidly rising global consumption. While GPUs dominate most power discussions, NVMe SSDs remain a significant and often under monitored contributor to rack level energy use. The NVMe 2.3 specification addresses this gap through two complementary capabilities: TP4199, which standardizes 1 second power measurement, lifetime energy reporting, programmable thresholds, and persistent power logs; and TP4210, which adds rail level voltage measurement, threshold triggered asynchronous events, and persistent voltage history to expose power integrity issues invisible in aggregate data.
This session demonstrates these features in practice. A TP4199 based case study compares measured versus estimated SSD power under high power workloads, highlighting reporting differences. A complementary TP4210 aligned scenario shows how per rail voltage telemetry and logged events can reveal issues hidden when relying solely on aggregate measurements. Together, these examples show how standardized SSD level telemetry improves visibility for managing power intensive AI deployments.