Return to search results
NLR HPC Eagle GPU Node Metrics
Ganglia node metrics and iLO (Integrated Lights Out) power data captured from six representative Eagle GPU nodesThe Eagle HPC operated at NLR from 2019 through 2024. Eagle was a 2,000-node, 8-petaflop system. This dataset is a representative sample of metrics for 6 of the GPU nodes. Each GPU node contained 2 CPUs and 2 GPUs. Data provided in compressed CSV format.Ganglia and iLO Power Time Series Fields ts:  Timestampdv:  Device / Node - Rack and Unit - r103u17 == r(ack)103u(nit)17mt:  Metric (only present for Ganglia)vl:  Value - Value in watts for iLO power (instantaneous value at sampling time) or specified Ganglia metric belowGanglia MetricsMetric name -- Metric description -- Unitcpu_aidle -- Percent of time since boot idle CPU -- Percentcpu_idle -- Percent CPU idle -- Percentcpu_nice -- Percent CPU nice -- Percentcpu_speed -- Speed in MHz of CPU -- MHzcpu_user -- Percent CPU user -- Percentcpu_wio -- The percentage of CPU Wait I/O -- Percentgpu0_bar1_memory -- Used GPU bar1 memory -- MBgpu0_decoder_util -- GPU decoder utilization -- Percentgpu0_ecc_db_error -- Total ECC error counts for the GPU -- Numbergpu0_encoder_util -- GPU encoder utilization -- Percentgpu0_fan -- Fan speed -- RPMgpu0_fb_memory -- Used GPU framebuffer memory -- MBgpu0_graphics_clock_report -- Current clock speeds for the device -- MHzgpu0_mem_total -- Memory total -- MBgpu0_mem_util -- Memory utilization -- Percentgpu0_power_usage_report -- Power usage report -- Wattsgpu0_temp -- GPU 1 temperature -- Celsiusgpu1_bar1_memory -- Used GPU bar1 memory -- MBgpu1_decoder_util -- GPU decoder utilization -- Percentgpu1_ecc_db_error -- Total ECC error counts for the GPU -- Numbergpu1_encoder_util -- GPU encoder utilization -- Percentgpu1_fan -- Fan speed -- RPMgpu1_fb_memory -- Used GPU framebuffer memory -- MBgpu1_graphics_clock_report -- Current clock speeds for the GPU -- MHzgpu1_mem_total -- Memory total -- MBgpu1_mem_util -- Memory utilization -- MBgpu1_power_usage_report -- Power usage report -- Wattsgpu1_temp -- GPU 1 temperature -- Celsiusipmi_cpu1_temp -- CPU 1 temperature -- Celsiusipmi_cpu2_temp -- CPU 2 temperature -- Celsiusipmi_inlet_ambient_temp -- Temperature measured at intake -- Celsiusipmi_vr_p1_temp -- CPU 1 voltage regulator temperature -- Celsiusipmi_vr_p2_temp -- CPU 2 voltage regulator temperature -- Celsiusmem_buffers -- Amount of buffered memory -- Bytesmem_cached -- Amount of cached memory -- Bytesmem_free -- Amount of available memory -- Bytesmem_shared -- Amount of shared memory -- Bytesmem_total -- Amount of available memory -- Bytes
Complete Metadata
| @type | dcat:Dataset |
|---|---|
| accessLevel | public |
| bureauCode |
[
"019:20"
]
|
| contactPoint |
{
"fn": "Struan Clark",
"@type": "vcard:Contact",
"hasEmail": "mailto:Struan.Clark@nlr.gov"
}
|
| dataQuality |
true
|
| description | Ganglia node metrics and iLO (Integrated Lights Out) power data captured from six representative Eagle GPU nodesThe Eagle HPC operated at NLR from 2019 through 2024. Eagle was a 2,000-node, 8-petaflop system. This dataset is a representative sample of metrics for 6 of the GPU nodes. Each GPU node contained 2 CPUs and 2 GPUs. Data provided in compressed CSV format.Ganglia and iLO Power Time Series Fields ts: Timestampdv: Device / Node - Rack and Unit - r103u17 == r(ack)103u(nit)17mt: Metric (only present for Ganglia)vl: Value - Value in watts for iLO power (instantaneous value at sampling time) or specified Ganglia metric belowGanglia MetricsMetric name -- Metric description -- Unitcpu_aidle -- Percent of time since boot idle CPU -- Percentcpu_idle -- Percent CPU idle -- Percentcpu_nice -- Percent CPU nice -- Percentcpu_speed -- Speed in MHz of CPU -- MHzcpu_user -- Percent CPU user -- Percentcpu_wio -- The percentage of CPU Wait I/O -- Percentgpu0_bar1_memory -- Used GPU bar1 memory -- MBgpu0_decoder_util -- GPU decoder utilization -- Percentgpu0_ecc_db_error -- Total ECC error counts for the GPU -- Numbergpu0_encoder_util -- GPU encoder utilization -- Percentgpu0_fan -- Fan speed -- RPMgpu0_fb_memory -- Used GPU framebuffer memory -- MBgpu0_graphics_clock_report -- Current clock speeds for the device -- MHzgpu0_mem_total -- Memory total -- MBgpu0_mem_util -- Memory utilization -- Percentgpu0_power_usage_report -- Power usage report -- Wattsgpu0_temp -- GPU 1 temperature -- Celsiusgpu1_bar1_memory -- Used GPU bar1 memory -- MBgpu1_decoder_util -- GPU decoder utilization -- Percentgpu1_ecc_db_error -- Total ECC error counts for the GPU -- Numbergpu1_encoder_util -- GPU encoder utilization -- Percentgpu1_fan -- Fan speed -- RPMgpu1_fb_memory -- Used GPU framebuffer memory -- MBgpu1_graphics_clock_report -- Current clock speeds for the GPU -- MHzgpu1_mem_total -- Memory total -- MBgpu1_mem_util -- Memory utilization -- MBgpu1_power_usage_report -- Power usage report -- Wattsgpu1_temp -- GPU 1 temperature -- Celsiusipmi_cpu1_temp -- CPU 1 temperature -- Celsiusipmi_cpu2_temp -- CPU 2 temperature -- Celsiusipmi_inlet_ambient_temp -- Temperature measured at intake -- Celsiusipmi_vr_p1_temp -- CPU 1 voltage regulator temperature -- Celsiusipmi_vr_p2_temp -- CPU 2 voltage regulator temperature -- Celsiusmem_buffers -- Amount of buffered memory -- Bytesmem_cached -- Amount of cached memory -- Bytesmem_free -- Amount of available memory -- Bytesmem_shared -- Amount of shared memory -- Bytesmem_total -- Amount of available memory -- Bytes |
| distribution |
[
{
"@type": "dcat:Distribution",
"title": "Eagle Ganglia Data - Node r103u17 (Zipped CSV)",
"accessURL": "https://data.nrel.gov/system/files/301/1757117950-esif.hpc.eagle.ganglia.gpu.sixnodes_r103u17.csv.zip",
"mediaType": "application/octet-stream",
"description": "Eagle Ganglia Data - Node r103u17 (Zipped CSV)"
},
{
"@type": "dcat:Distribution",
"title": "Eagle Ganglia Data - Node r103u21 (Zipped CSV)",
"accessURL": "https://data.nrel.gov/system/files/301/1757117950-esif.hpc.eagle.ganglia.gpu.sixnodes_r103u21.csv.zip",
"mediaType": "application/octet-stream",
"description": "Eagle Ganglia Data - Node r103u21 (Zipped CSV)"
},
{
"@type": "dcat:Distribution",
"title": "Eagle Ganglia Data - Node r104u29 (Zipped CSV)",
"accessURL": "https://data.nrel.gov/system/files/301/1757117950-esif.hpc.eagle.ganglia.gpu.sixnodes_r104u29.csv.zip",
"mediaType": "application/octet-stream",
"description": "Eagle Ganglia Data - Node r104u29 (Zipped CSV)"
},
{
"@type": "dcat:Distribution",
"title": "Eagle Ganglia Data - Node r104u33 (Zipped CSV)",
"accessURL": "https://data.nrel.gov/system/files/301/1757117950-esif.hpc.eagle.ganglia.gpu.sixnodes_r104u33.csv.zip",
"mediaType": "application/octet-stream",
"description": "Eagle Ganglia Data - Node r104u33 (Zipped CSV)"
},
{
"@type": "dcat:Distribution",
"title": "Eagle Ganglia Data - Node r105u09 (Zipped CSV)",
"accessURL": "https://data.nrel.gov/system/files/301/1757117950-esif.hpc.eagle.ganglia.gpu.sixnodes_r105u09.csv.zip",
"mediaType": "application/octet-stream",
"description": "Eagle Ganglia Data - Node r105u09 (Zipped CSV)"
},
{
"@type": "dcat:Distribution",
"title": "Eagle Ganglia Data - Node r105u15 (Zipped CSV)",
"accessURL": "https://data.nrel.gov/system/files/301/1757117950-esif.hpc.eagle.ganglia.gpu.sixnodes_r105u15.csv.zip",
"mediaType": "application/octet-stream",
"description": "Eagle Ganglia Data - Node r105u15 (Zipped CSV)"
},
{
"@type": "dcat:Distribution",
"title": "Eagle iLO Power Data - Node r103u17 (Zipped CSV)",
"accessURL": "https://data.nrel.gov/system/files/301/1757117950-esif.hpc.eagle.ilo-power.gpu.sixnodes_r103u17.csv.zip",
"mediaType": "application/octet-stream",
"description": "Eagle iLO Power Data - Node r103u17 (Zipped CSV)"
},
{
"@type": "dcat:Distribution",
"title": "Eagle iLO Power Data - Node r103u21 (Zipped CSV)",
"accessURL": "https://data.nrel.gov/system/files/301/1757117950-esif.hpc.eagle.ilo-power.gpu.sixnodes_r103u21.csv.zip",
"mediaType": "application/octet-stream",
"description": "Eagle iLO Power Data - Node r103u21 (Zipped CSV)"
},
{
"@type": "dcat:Distribution",
"title": "Eagle iLO Power Data - Node r104u29 (Zipped CSV)",
"accessURL": "https://data.nrel.gov/system/files/301/1757117950-esif.hpc.eagle.ilo-power.gpu.sixnodes_r104u29.csv.zip",
"mediaType": "application/octet-stream",
"description": "Eagle iLO Power Data - Node r104u29 (Zipped CSV)"
},
{
"@type": "dcat:Distribution",
"title": "Eagle iLO Power Data - Node r104u33 (Zipped CSV)",
"accessURL": "https://data.nrel.gov/system/files/301/1757117950-esif.hpc.eagle.ilo-power.gpu.sixnodes_r104u33.csv.zip",
"mediaType": "application/octet-stream",
"description": "Eagle iLO Power Data - Node r104u33 (Zipped CSV)"
},
{
"@type": "dcat:Distribution",
"title": "Eagle iLO Power Data - Node r105u09 (Zipped CSV)",
"accessURL": "https://data.nrel.gov/system/files/301/1757117950-esif.hpc.eagle.ilo-power.gpu.sixnodes_r105u09.csv.zip",
"mediaType": "application/octet-stream",
"description": "Eagle iLO Power Data - Node r105u09 (Zipped CSV)"
},
{
"@type": "dcat:Distribution",
"title": "Eagle iLO Power Data - Node r105u15 (Zipped CSV)",
"accessURL": "https://data.nrel.gov/system/files/301/1757117950-esif.hpc.eagle.ilo-power.gpu.sixnodes_r105u15.csv.zip",
"mediaType": "application/octet-stream",
"description": "Eagle iLO Power Data - Node r105u15 (Zipped CSV)"
},
{
"@type": "dcat:Distribution",
"title": "Sample graph of node GPU temperatures created from the included Ganglia datasets. Indicates time regions where data is missing.",
"accessURL": "https://data.nrel.gov/system/files/301/1757118458-esif.hpc.eagle.ganglia.gpu-metrics.temps.jpg",
"mediaType": "application/octet-stream",
"description": "Sample graph of node GPU temperatures created from the included Ganglia datasets. Indicates time regions where data is missing."
},
{
"@type": "dcat:Distribution",
"title": "Eagle Ganglia Data
- All six nodes (Zipped CSV)
- MD5sum: bf2c397ce74dfcc82ac1be425647f2fc",
"accessURL": "https://data.nrel.gov/system/files/301/1757532691-esif.hpc.eagle.ganglia.gpu.sixnodes.csv.zip",
"mediaType": "application/octet-stream",
"description": "Eagle Ganglia Data
- All six nodes (Zipped CSV)
- MD5sum: bf2c397ce74dfcc82ac1be425647f2fc"
},
{
"@type": "dcat:Distribution",
"title": "Eagle iLO Power Data
- All six nodes (Zipped CSV)
- MD5sum: 2c4320667daca7d55a31181bc47b56d9",
"accessURL": "https://data.nrel.gov/system/files/301/1757532691-esif.hpc.eagle.ilo-power.gpu.sixnodes.csv.zip",
"mediaType": "application/octet-stream",
"description": "Eagle iLO Power Data
- All six nodes (Zipped CSV)
- MD5sum: 2c4320667daca7d55a31181bc47b56d9"
},
{
"@type": "dcat:Distribution",
"title": "Dataset description",
"accessURL": "https://data.nrel.gov/system/files/301/1769711921-README_1.md",
"mediaType": "application/octet-stream",
"description": "Dataset description"
}
]
|
| identifier | https://data.openei.org/submissions/8617 |
| issued | 2026-01-29T18:38:41Z |
| keyword |
[
"ESIF",
"GPU",
"HPC",
"node usage",
"power"
]
|
| landingPage | https://data.nrel.gov/submissions/301 |
| license | https://creativecommons.org/licenses/by/4.0/ |
| modified | 2026-01-29T18:42:54Z |
| programCode |
[
"019:000",
"019:023"
]
|
| projectNumber | DE-AC36-08GO28308 |
| projectTitle | |
| publisher |
{
"name": "National Laboratory of the Rockies",
"@type": "org:Organization"
}
|
| title | NLR HPC Eagle GPU Node Metrics |