Return to search results
Peregrine HPC Energy Research
Complete power and performance metric data for 10,000 jobs randomly selected from two years of the Peregrine system’s operation (all jobs prior to October, 2015). Complete power and performance metric data for jobs run on the Peregrine system in April, 2015. Related Publications: Bruce Bugbee, Caleb Phillips, Kenny Gruchalla, Ryan Elmore, Avi Purkayastha. Exploring HPC Application Power Usage. Conference on Data Analysis (CoDA) 2016. March 2-4, 2016. Santa Fe, New Mexico. Ryan Elmore, Kenny Gruchalla, Caleb Phillips, Avi Purkayastha, Nick Wunder. An Analysis of Application Power and Schedule Composition in a High Performance Computing Environment. National Renewable Energy Laboratory. Technical Report. NREL/TP-2C00-65392. January, 2016. http://www.nrel.gov/docs/fy16osti/65392.pdf
Complete Metadata
| @type | dcat:Dataset |
|---|---|
| accessLevel | public |
| bureauCode |
[
"019:20"
]
|
| contactPoint |
{
"fn": "Caleb Phillips",
"@type": "vcard:Contact",
"hasEmail": "mailto:Caleb.Phillips@nrel.gov"
}
|
| dataQuality |
true
|
| description | Complete power and performance metric data for 10,000 jobs randomly selected from two years of the Peregrine system’s operation (all jobs prior to October, 2015). Complete power and performance metric data for jobs run on the Peregrine system in April, 2015. Related Publications: Bruce Bugbee, Caleb Phillips, Kenny Gruchalla, Ryan Elmore, Avi Purkayastha. Exploring HPC Application Power Usage. Conference on Data Analysis (CoDA) 2016. March 2-4, 2016. Santa Fe, New Mexico. Ryan Elmore, Kenny Gruchalla, Caleb Phillips, Avi Purkayastha, Nick Wunder. An Analysis of Application Power and Schedule Composition in a High Performance Computing Environment. National Renewable Energy Laboratory. Technical Report. NREL/TP-2C00-65392. January, 2016. http://www.nrel.gov/docs/fy16osti/65392.pdf |
| distribution |
[
{
"@type": "dcat:Distribution",
"title": "Complete power and performance metric data from 10,000 jobs randomly selected from two years of the Peregrine system’s operation (all jobs prior to October, 2015).
Format:
The data is available as a compressed JSON stream — one line per JSON object. Each object is a hash containing three elements, each of which is itself a hash:
“power”
“ganglia”
“job”
The “job” object contains metadata for the job including these fields:
id unique identifier for this job
user_name user running the job — anonymized
group_name group running the job — anonymized
account allocation the job is charged to — anonymized
queue the queue the job is submitted to: https://hpc.nrel.gov/users/systems/peregrine/running-jobs/new-queue-structure
submit_time
start_time
end_time
exit_code exit code from Moab
nodelist colon separated list of node hostnames used for the job
nodes_used
processors_used
wallclock_used
torque_wallclock_used
torque_exit_code exit code according to torque
cpu_used
mem_used
vmem_used
wallclock_req
feature_req
nodes_req
processors_req feature requests
interactive whether or not the job was run interactively
node_hours
app_name application category
app_name_predicted whether or not application category was predicted using a Naïve Bayes classifier
The "power" and "ganglia" objects contain time-series data from iLo http://www8.hp.com/us/en/products/servers/ilo/ and NWPerf https://github.com/EMSL-MSC/NWPerf/.
The format of the timeseries data is a list of hashes containing, timestamp (epoch time in microseconds), metric name, host and value, e.g.,:
{
"timestamp": 1461697084441,
"host": "n1668",
"value": 228,
"metric": "power"
}
Anonymized fields have been SHA1 hashed.",
"accessURL": "https://data.nrel.gov/system/files/235/1712695887-10k.anon.json.bz2",
"mediaType": "application/octet-stream",
"description": "Complete power and performance metric data from 10,000 jobs randomly selected from two years of the Peregrine system’s operation (all jobs prior to October, 2015).
Format:
The data is available as a compressed JSON stream — one line per JSON object. Each object is a hash containing three elements, each of which is itself a hash:
“power”
“ganglia”
“job”
The “job” object contains metadata for the job including these fields:
id unique identifier for this job
user_name user running the job — anonymized
group_name group running the job — anonymized
account allocation the job is charged to — anonymized
queue the queue the job is submitted to: https://hpc.nrel.gov/users/systems/peregrine/running-jobs/new-queue-structure
submit_time
start_time
end_time
exit_code exit code from Moab
nodelist colon separated list of node hostnames used for the job
nodes_used
processors_used
wallclock_used
torque_wallclock_used
torque_exit_code exit code according to torque
cpu_used
mem_used
vmem_used
wallclock_req
feature_req
nodes_req
processors_req feature requests
interactive whether or not the job was run interactively
node_hours
app_name application category
app_name_predicted whether or not application category was predicted using a Naïve Bayes classifier
The "power" and "ganglia" objects contain time-series data from iLo http://www8.hp.com/us/en/products/servers/ilo/ and NWPerf https://github.com/EMSL-MSC/NWPerf/.
The format of the timeseries data is a list of hashes containing, timestamp (epoch time in microseconds), metric name, host and value, e.g.,:
{
"timestamp": 1461697084441,
"host": "n1668",
"value": 228,
"metric": "power"
}
Anonymized fields have been SHA1 hashed."
},
{
"@type": "dcat:Distribution",
"title": "Complete power and performance metric data for jobs run on the Peregrine system in April, 2015.
Format:
The data is available as a compressed JSON stream — one line per JSON object. Each object is a hash containing three elements, each of which is itself a hash:
“power”
“ganglia”
“job”
The “job” object contains metadata for the job including these fields:
id unique identifier for this job
user_name user running the job — anonymized
group_name group running the job — anonymized
account allocation the job is charged to — anonymized
queue the queue the job is submitted to: https://hpc.nrel.gov/users/systems/peregrine/running-jobs/new-queue-structure
submit_time
start_time
end_time
exit_code exit code from Moab
nodelist colon separated list of node hostnames used for the job
nodes_used
processors_used
wallclock_used
torque_wallclock_used
torque_exit_code exit code according to torque
cpu_used
mem_used
vmem_used
wallclock_req
feature_req
nodes_req
processors_req feature requests
interactive whether or not the job was run interactively
node_hours
app_name application category
app_name_predicted whether or not application category was predicted using a Naïve Bayes classifier.
The "power" and "ganglia" objects contain time-series data from iLo http://www8.hp.com/us/en/products/servers/ilo/ and NWPerf https://github.com/EMSL-MSC/NWPerf/.
The format of the timeseries data is a list of hashes containing, timestamp (epoch time in microseconds), metric name, host and value, e.g.,:
{
"timestamp": 1461697084441,
"host": "n1668",
"value": 228,
"metric": "power"
}
Anonymized fields have been SHA1 hashed.",
"accessURL": "https://data.nrel.gov/system/files/235/1712695887-april2015.anon.json.bz2",
"mediaType": "application/octet-stream",
"description": "Complete power and performance metric data for jobs run on the Peregrine system in April, 2015.
Format:
The data is available as a compressed JSON stream — one line per JSON object. Each object is a hash containing three elements, each of which is itself a hash:
“power”
“ganglia”
“job”
The “job” object contains metadata for the job including these fields:
id unique identifier for this job
user_name user running the job — anonymized
group_name group running the job — anonymized
account allocation the job is charged to — anonymized
queue the queue the job is submitted to: https://hpc.nrel.gov/users/systems/peregrine/running-jobs/new-queue-structure
submit_time
start_time
end_time
exit_code exit code from Moab
nodelist colon separated list of node hostnames used for the job
nodes_used
processors_used
wallclock_used
torque_wallclock_used
torque_exit_code exit code according to torque
cpu_used
mem_used
vmem_used
wallclock_req
feature_req
nodes_req
processors_req feature requests
interactive whether or not the job was run interactively
node_hours
app_name application category
app_name_predicted whether or not application category was predicted using a Naïve Bayes classifier.
The "power" and "ganglia" objects contain time-series data from iLo http://www8.hp.com/us/en/products/servers/ilo/ and NWPerf https://github.com/EMSL-MSC/NWPerf/.
The format of the timeseries data is a list of hashes containing, timestamp (epoch time in microseconds), metric name, host and value, e.g.,:
{
"timestamp": 1461697084441,
"host": "n1668",
"value": 228,
"metric": "power"
}
Anonymized fields have been SHA1 hashed."
}
]
|
| identifier | https://data.openei.org/submissions/8284 |
| issued | 2024-04-10T17:11:42Z |
| keyword |
[
"HPC",
"Peregrine",
"node usage",
"power",
"processors"
]
|
| landingPage | https://data.nrel.gov/submissions/235 |
| license | https://creativecommons.org/licenses/by/4.0/ |
| modified | 2025-01-16T18:10:23Z |
| programCode |
[
"019:000",
"019:023"
]
|
| projectNumber | |
| projectTitle | |
| publisher |
{
"name": "National Renewable Energy Laboratory",
"@type": "org:Organization"
}
|
| title | Peregrine HPC Energy Research |