Synthetic Healthcare Database for Research (SyH-DR)
The Agency for Healthcare Research and Quality (AHRQ) created SyH-DR from eligibility and claims files for Medicare, Medicaid, and commercial insurance plans in calendar year 2016. SyH-DR contains data from a nationally representative sample of insured individuals for the 2016 calendar year. SyH-DR uses synthetic data elements at the claim level to resemble the marginal distribution of the original data elements. SyH-DR person-level data elements are not synthetic, but identifying information is aggregated or masked.
Complete Metadata
| @type | dcat:Dataset |
|---|---|
| accessLevel | restricted public |
| bureauCode |
[
"009:00"
]
|
| contactPoint |
{
"fn": "SyH-DR help desk",
"@type": "vcard:Contact",
"hasEmail": "mailto:SyH-DR@ahrq.hhs.gov"
}
|
| description | The Agency for Healthcare Research and Quality (AHRQ) created SyH-DR from eligibility and claims files for Medicare, Medicaid, and commercial insurance plans in calendar year 2016. SyH-DR contains data from a nationally representative sample of insured individuals for the 2016 calendar year. SyH-DR uses synthetic data elements at the claim level to resemble the marginal distribution of the original data elements. SyH-DR person-level data elements are not synthetic, but identifying information is aggregated or masked. |
| distribution |
[
{
"@type": "dcat:Distribution",
"title": " Synthetic Healthcare Database for Research (SyH-DR)",
"mediaType": "text/html",
"description": "The Synthetic Healthcare Database for Research (SyH-DR) is an all-payer, nationally representative claims database. The database consists of a sample of inpatient, outpatient, and prescription drug claims, including utilization, payment, and enrollment data, for people insured by Medicare, Medicaid, or commercial health insurance in 2016. AHRQ created SyH-DR, in part, as a resource to facilitate improvements to price and quality transparency in healthcare.
SyH-DR is a synthetic database that replicates the structure and statistical properties of the original claims data while protecting privacy and confidentiality of people and institutions. Synthetic data are created by statistically modeling or changing original data so that new values or data elements are generated while maintaining the original data's statistical properties. Additional steps, such as masking, are taken to reduce the risk of identifying people and institutions so that the data may be made publicly available to a broad community of researchers. AHRQ approval is required for access to SyH-DR. User must submit an application and data use agreement. ",
"downloadURL": "https://www.ahrq.gov/data/innovations/syh-dr.html"
}
]
|
| identifier | https://healthdata.gov/api/views/88gj-w5in |
| issued | 2023-09-15 |
| keyword |
[
"all payer claims",
"all payer claims database",
"apcd",
"linked commercial files",
"linked medicaid files",
"linked medicare files",
"synthetic data"
]
|
| landingPage | https://www.ahrq.gov/data/innovations/syh-dr.html |
| modified | 2023-09-15 |
| programCode |
[
"009:074"
]
|
| publisher |
{
"name": "Agency for Healthcare Research and Quality",
"@type": "org:Organization"
}
|
| rights | AHRQ approval is required for access to SyH-DR. To request access to SyH-DR, follow the steps included in the Getting Started Guide and submit the required application form and data use agreement. Completed applications will be reviewed by AHRQ. |
| spatial | United States |
| theme |
[
"Health"
]
|
| title | Synthetic Healthcare Database for Research (SyH-DR) |