Groundwater data, predictor variables, and rasters used for predicting the probability of high arsenic and high manganese in the Glacial Aquifer System, northern continental United States
This data release contains input data used in model development and TIF raster files used to predict the probability of high arsenic (As) and high manganese (Mn) in groundwater within the glacial aquifer system in the northern United States. Input data include measured As and Mn concentrations at groundwater wells, and associated predictor variable data. The probability of high As and high Mn was predicted using boosted regression tree methods using the gbm package in R version 4.0.0. The response variables for individual models were the occurrence of: (1) As >10 µg/L, and (2) Mn >300 µg/L. Water-quality data were compiled from three sources, as described in Wilson and others (2019): a compilation of data from numerous agencies and organizations at the state, regional, and local level; the U.S. Geological Survey National Water Information System; and the U.S. Environmental Protection Agency Safe Drinking Water Information System. The resultant dataset consisted of 10,001 As and 14,565 Mn measurements across the study area. A total of 108 predictor variables were originally considered for model development which included well characteristics, soil properties, aquifer properties, predicted nitrate, hydrologic position on the landscape, groundwater age, predicted pH, and predicted anoxic conditions. After model refinement, a total of 79 and 55 predictor variables were used for predicting the probability of high As and high Mn, respectively. The probability of high As and high Mn was predicted at two depths representative of public and domestic drinking water supply depths at a resolution of 1 km across the glacial aquifer.
Complete Metadata
| accessLevel | public |
|---|---|
| bureauCode |
[
"010:12"
]
|
| contactPoint |
{
"fn": "Sarah M Elliott",
"@type": "vcard:Contact",
"hasEmail": "mailto:selliott@usgs.gov"
}
|
| description | This data release contains input data used in model development and TIF raster files used to predict the probability of high arsenic (As) and high manganese (Mn) in groundwater within the glacial aquifer system in the northern United States. Input data include measured As and Mn concentrations at groundwater wells, and associated predictor variable data. The probability of high As and high Mn was predicted using boosted regression tree methods using the gbm package in R version 4.0.0. The response variables for individual models were the occurrence of: (1) As >10 µg/L, and (2) Mn >300 µg/L. Water-quality data were compiled from three sources, as described in Wilson and others (2019): a compilation of data from numerous agencies and organizations at the state, regional, and local level; the U.S. Geological Survey National Water Information System; and the U.S. Environmental Protection Agency Safe Drinking Water Information System. The resultant dataset consisted of 10,001 As and 14,565 Mn measurements across the study area. A total of 108 predictor variables were originally considered for model development which included well characteristics, soil properties, aquifer properties, predicted nitrate, hydrologic position on the landscape, groundwater age, predicted pH, and predicted anoxic conditions. After model refinement, a total of 79 and 55 predictor variables were used for predicting the probability of high As and high Mn, respectively. The probability of high As and high Mn was predicted at two depths representative of public and domestic drinking water supply depths at a resolution of 1 km across the glacial aquifer. |
| distribution |
[
{
"@type": "dcat:Distribution",
"title": "Digital Data",
"format": "XML",
"accessURL": "https://doi.org/10.5066/P94FCZJ2",
"mediaType": "application/http",
"description": "Landing page for access to the data"
},
{
"@type": "dcat:Distribution",
"title": "Original Metadata",
"format": "XML",
"mediaType": "text/xml",
"description": "The metadata original format",
"downloadURL": "https://data.usgs.gov/datacatalog/metadata/USGS.5f21cf8982cef313ed94004a.xml"
}
]
|
| identifier | http://datainventory.doi.gov/id/dataset/USGS_5f21cf8982cef313ed94004a |
| keyword |
[
"Connecticut",
"Idaho",
"Illinois",
"Indiana",
"Iowa",
"Kansas",
"Maine",
"Massachusetts",
"Michigan",
"Minnesota",
"Missouri",
"Montana",
"NAWQA",
"Nebraska",
"New Hampshire",
"New Jersey",
"New York",
"North Dakota",
"Ohio",
"Pennsylvania",
"Rhode Island",
"South Dakota",
"USGS:5f21cf8982cef313ed94004a",
"Vermont",
"Washington",
"Wisconsin",
"aquifer system",
"arsenic",
"drinking water use",
"glacial aquifer system",
"groundwater",
"hydrogeology",
"manganese",
"water quality"
]
|
| modified | 2021-04-06T00:00:00Z |
| publisher |
{
"name": "U.S. Geological Survey",
"@type": "org:Organization"
}
|
| spatial | -124.7542, 35.0921, -65.3793, 51.5222 |
| theme |
[
"Geospatial"
]
|
| title | Groundwater data, predictor variables, and rasters used for predicting the probability of high arsenic and high manganese in the Glacial Aquifer System, northern continental United States |