Data to support Leveraging machine learning to automate regression model evaluations for large multi-site water-quality trend studies
This data release contains one dataset and one model archive in support of the journal article "Leveraging machine learning to automate regression model evaluations for large multi-site water-quality trend studies" by Jennifer C. Murphy and Jeffrey G. Chanat. The model archive contains scripts (run in R) to reproduce the four machine learning models (logistic regression, linear and quadratic discriminant analysis, and k-nearest neighbors) trained and tested as part of the journal article. The dataset contains the estimated probabilities for each of these models when applied to a training and test dataset.
Complete Metadata
| bureauCode |
[ "010:12" ] |
|---|---|
| identifier | http://datainventory.doi.gov/id/dataset/USGS_647a3349d34eac007b521f2d |
| spatial | -127.7930, 24.0465, -64.6875, 49.8380 |
| theme |
[ "Geospatial" ] |