Return to search results
Catchments and Variables Used for Random Forest Classification and Regression Groundwater and Surface Water Models for Nitrate Violation classes, Violation concentrations, or Percent of Systems in Violation for Public Drinking Water Supplies
* This research used public drinking water nitrate violations data from across the conterminous United States as response variables, and various predictor variables from EPA's StreamCat dataset and other various land use, climate, and N input datasources.
* All predictor and response variables were summarized for each catchment.
* Response variables were either a binary variable (in violation or not or a 1 or 0), mean violation concentration (mg/L) per catchment, or percent of public water systems in violation per catchment.
This dataset is associated with the following publication:
Pennino, M., S. Leibowitz, J. Compton, R. Hill, and R. Sabo. Patterns and predictions of drinking water nitrate violations across the conterminous United States. SCIENCE OF THE TOTAL ENVIRONMENT. Elsevier BV, AMSTERDAM, NETHERLANDS, 722: 137661, (2020).
Complete Metadata
| accessLevel | public |
|---|---|
| bureauCode |
[
"020:00"
]
|
| contactPoint |
{
"fn": "Michael Pennino",
"hasEmail": "mailto:pennino.michael@epa.gov"
}
|
| describedBy | https://pasteur.epa.gov/uploads/10.23719/1503834/documents/Data_Dictionary_Pennino_NO3_Model_2019.xlsx |
| describedByType | application/vnd.openxmlformats-officedocument.spreadsheetml.sheet |
| description | * This research used public drinking water nitrate violations data from across the conterminous United States as response variables, and various predictor variables from EPA's StreamCat dataset and other various land use, climate, and N input datasources. * All predictor and response variables were summarized for each catchment. * Response variables were either a binary variable (in violation or not or a 1 or 0), mean violation concentration (mg/L) per catchment, or percent of public water systems in violation per catchment. This dataset is associated with the following publication: Pennino, M., S. Leibowitz, J. Compton, R. Hill, and R. Sabo. Patterns and predictions of drinking water nitrate violations across the conterminous United States. SCIENCE OF THE TOTAL ENVIRONMENT. Elsevier BV, AMSTERDAM, NETHERLANDS, 722: 137661, (2020). |
| distribution |
[
{
"title": "Metadata_Pennino_NO3_Model_2019.xlsx",
"mediaType": "application/vnd.openxmlformats-officedocument.spreadsheetml.sheet",
"downloadURL": "https://pasteur.epa.gov/uploads/10.23719/1503834/Metadata_Pennino_NO3_Model_2019.xlsx"
},
{
"title": "Observed_Violations.zip",
"mediaType": "application/x-zip-compressed",
"downloadURL": "https://pasteur.epa.gov/uploads/10.23719/1503834/Observed_Violations.zip"
},
{
"title": "RFC_GW_Predictions.zip",
"mediaType": "application/x-zip-compressed",
"downloadURL": "https://pasteur.epa.gov/uploads/10.23719/1503834/RFC_GW_Predictions.zip"
},
{
"title": "RFC_SW_Predictions.zip",
"mediaType": "application/x-zip-compressed",
"downloadURL": "https://pasteur.epa.gov/uploads/10.23719/1503834/RFC_SW_Predictions.zip"
},
{
"title": "RFR_GW_Predictions.zip",
"mediaType": "application/x-zip-compressed",
"downloadURL": "https://pasteur.epa.gov/uploads/10.23719/1503834/RFR_GW_Predictions.zip"
},
{
"title": "RFR_SW_Predictions.zip",
"mediaType": "application/x-zip-compressed",
"downloadURL": "https://pasteur.epa.gov/uploads/10.23719/1503834/RFR_SW_Predictions.zip"
},
{
"title": "RF_Catchments_Conc_All_Variables_GW.csv",
"mediaType": "application/vnd.ms-excel",
"downloadURL": "https://pasteur.epa.gov/uploads/10.23719/1503834/RF_Catchments_Conc_All_Variables_GW.csv"
},
{
"title": "RF_Catchments_Conc_All_Variables_SW.csv",
"mediaType": "application/vnd.ms-excel",
"downloadURL": "https://pasteur.epa.gov/uploads/10.23719/1503834/RF_Catchments_Conc_All_Variables_SW.csv"
},
{
"title": "RF_Catchments_Viol_Freq_All_Variables_GW.csv",
"mediaType": "application/vnd.ms-excel",
"downloadURL": "https://pasteur.epa.gov/uploads/10.23719/1503834/RF_Catchments_Viol_Freq_All_Variables_GW.csv"
},
{
"title": "RF_Catchments_Viol_Freq_All_Variables_SW.csv",
"mediaType": "application/vnd.ms-excel",
"downloadURL": "https://pasteur.epa.gov/uploads/10.23719/1503834/RF_Catchments_Viol_Freq_All_Variables_SW.csv"
},
{
"title": "RF_Catchments_Viol_Perc_All_Variables_GW.csv",
"mediaType": "application/vnd.ms-excel",
"downloadURL": "https://pasteur.epa.gov/uploads/10.23719/1503834/RF_Catchments_Viol_Perc_All_Variables_GW.csv"
},
{
"title": "RF_Catchments_Viol_Perc_All_Variables_SW.csv",
"mediaType": "application/vnd.ms-excel",
"downloadURL": "https://pasteur.epa.gov/uploads/10.23719/1503834/RF_Catchments_Viol_Perc_All_Variables_SW.csv"
}
]
|
| identifier | https://doi.org/10.23719/1503834 |
| keyword |
[
"contiguous United States",
"drinking water",
"groundwater",
"machine learning",
"nitrate",
"random forest",
"surface water"
]
|
| license | https://pasteur.epa.gov/license/sciencehub-license.html |
| modified | 2019-05-10 |
| programCode |
[
"020:096"
]
|
| publisher |
{
"name": "U.S. EPA Office of Research and Development (ORD)",
"subOrganizationOf": {
"name": "U.S. Environmental Protection Agency",
"subOrganizationOf": {
"name": "U.S. Government"
}
}
}
|
| references |
[
"https://doi.org/10.1016/j.scitotenv.2020.137661"
]
|
| rights |
null
|
| title | Catchments and Variables Used for Random Forest Classification and Regression Groundwater and Surface Water Models for Nitrate Violation classes, Violation concentrations, or Percent of Systems in Violation for Public Drinking Water Supplies |