Data from: Impacts of gene duplication in the evolution of symbiotic root nodule symbiosis
The emerging consensus regarding the origin of root nodule symbiosis (RNS), based on modeling of trait gain and loss across approximately 13,000 species within the “nitrogen-fixing clade” in the rosid group, is that the trait has arisen multiple times, probably semi-independently, and has also been lost repeatedly. Evolution of a new organ and functions involves many thousands of genes; but the evolutionary histories for many of these genes may be uninformative regarding RNS evolution. A portion of the genes, however, are likely to be derived from prior gene duplications and to have acquired new functions or to have come under new regulatory patterns. Whole genome duplications (WGDs) could conceivably enable the necessary neo- or sub-functionalization for new roles in the nodule. All species that exhibit RNS share a history of several ancient WGDs; but the last such common WGD for these species was the “gamma” paleohexaploidy that occurred early in the core eudicot lineage, ~120 million years ago (Mya). This presents a puzzle: if RNS didn’t originate until ~60-80 Mya, within the respective families exhibiting RNS, what explains the long quiescent period (~40-60 million years) and the many eudicot lineages without RNS? This study focuses on a collection of gene families with additional independent WGDs that appear to have occurred in the interim period, after the gamma triplication and prior to the evolution of RNS, identifying several that are both essential for RNS and that show evidence of critical roles of both ancient WGDs and more recent local duplications. The data in this repository includes gene families for the legumes and allied species (some with RNS, some without), that has been used in associated manuscript to trace the origin of a collection of genes involved in RNS.
Complete Metadata
| @type | dcat:Dataset |
|---|---|
| accessLevel | public |
| bureauCode |
[
"005:18"
]
|
| contactPoint |
{
"fn": "Cannon, Steven B.",
"hasEmail": "mailto:steven.cannon@usda.gov"
}
|
| description | <p dir="ltr">The emerging consensus regarding the origin of root nodule symbiosis (RNS), based on modeling of trait gain and loss across approximately 13,000 species within the “nitrogen-fixing clade” in the rosid group, is that the trait has arisen multiple times, probably semi-independently, and has also been lost repeatedly. Evolution of a new organ and functions involves many thousands of genes; but the evolutionary histories for many of these genes may be uninformative regarding RNS evolution. A portion of the genes, however, are likely to be derived from prior gene duplications and to have acquired new functions or to have come under new regulatory patterns. Whole genome duplications (WGDs) could conceivably enable the necessary neo- or sub-functionalization for new roles in the nodule. All species that exhibit RNS share a history of several ancient WGDs; but the last such common WGD for these species was the “gamma” paleohexaploidy that occurred early in the core eudicot lineage, ~120 million years ago (Mya). This presents a puzzle: if RNS didn’t originate until ~60-80 Mya, within the respective families exhibiting RNS, what explains the long quiescent period (~40-60 million years) and the many eudicot lineages without RNS? This study focuses on a collection of gene families with additional independent WGDs that appear to have occurred in the interim period, after the gamma triplication and prior to the evolution of RNS, identifying several that are both essential for RNS and that show evidence of critical roles of both ancient WGDs and more recent local duplications. The data in this repository includes gene families for the legumes and allied species (some with RNS, some without), that has been used in associated manuscript to trace the origin of a collection of genes involved in RNS.</p> |
| distribution |
[
{
"@type": "dcat:Distribution",
"title": "LegSF.fam3.W6TK.sup1A_fam3_correspondence.hsh.tsv.gz",
"format": "gz",
"mediaType": "application/gzip",
"downloadURL": "https://ndownloader.figshare.com/files/58019530"
},
{
"@type": "dcat:Distribution",
"title": "README.LegSF.fam3.W6TK.yml",
"format": "yml",
"mediaType": "text/plain",
"downloadURL": "https://ndownloader.figshare.com/files/58019533"
},
{
"@type": "dcat:Distribution",
"title": "LegSF.fam3.W6TK.sup1A_stats.txt",
"format": "txt",
"mediaType": "text/plain",
"downloadURL": "https://ndownloader.figshare.com/files/58019536"
},
{
"@type": "dcat:Distribution",
"title": "LegSF.fam3.W6TK.sup1A_counts.tsv.gz",
"format": "gz",
"mediaType": "application/gzip",
"downloadURL": "https://ndownloader.figshare.com/files/58019539"
},
{
"@type": "dcat:Distribution",
"title": "LegSF.fam3.W6TK.sup1B_trees.tar.gz",
"format": "gz",
"mediaType": "application/x-gzip",
"downloadURL": "https://ndownloader.figshare.com/files/58019542"
},
{
"@type": "dcat:Distribution",
"title": "LegSF.fam3.W6TK.sup1A_clust.tsv.gz",
"format": "gz",
"mediaType": "application/gzip",
"downloadURL": "https://ndownloader.figshare.com/files/58019545"
},
{
"@type": "dcat:Distribution",
"title": "MANIFEST.LegSF.fam3.W6TK.descriptions.yml",
"format": "yml",
"mediaType": "text/plain",
"downloadURL": "https://ndownloader.figshare.com/files/58019548"
},
{
"@type": "dcat:Distribution",
"title": "LegSF.fam3.W6TK.sup1A_fam3_correspondence.clust.tsv.gz",
"format": "gz",
"mediaType": "application/gzip",
"downloadURL": "https://ndownloader.figshare.com/files/58019551"
},
{
"@type": "dcat:Distribution",
"title": "CHECKSUM.LegSF.fam3.W6TK.md5",
"format": "md5",
"mediaType": "text/plain",
"downloadURL": "https://ndownloader.figshare.com/files/58019554"
},
{
"@type": "dcat:Distribution",
"title": "MANIFEST.LegSF.fam3.W6TK.correspondence.yml",
"format": "yml",
"mediaType": "text/plain",
"downloadURL": "https://ndownloader.figshare.com/files/58019557"
},
{
"@type": "dcat:Distribution",
"title": "LegSF.fam3.W6TK.sup1B_hmmalign_trim.tar.gz",
"format": "gz",
"mediaType": "application/x-gzip",
"downloadURL": "https://ndownloader.figshare.com/files/58019560"
},
{
"@type": "dcat:Distribution",
"title": "LegSF.fam3.W6TK.sup1A_table.tsv.gz",
"format": "gz",
"mediaType": "application/gzip",
"downloadURL": "https://ndownloader.figshare.com/files/58019563"
},
{
"@type": "dcat:Distribution",
"title": "LegSF.fam3.W6TK.sup1A_hsh.tsv.gz",
"format": "gz",
"mediaType": "application/gzip",
"downloadURL": "https://ndownloader.figshare.com/files/58019566"
},
{
"@type": "dcat:Distribution",
"title": "LegSF.fam3.W6TK.sup1B_hmmalign.tar.gz",
"format": "gz",
"mediaType": "application/x-gzip",
"downloadURL": "https://ndownloader.figshare.com/files/58019569"
},
{
"@type": "dcat:Distribution",
"title": "LegSF.fam3.W6TK.sup1B_proteomes.tar.gz",
"format": "gz",
"mediaType": "application/x-gzip",
"downloadURL": "https://ndownloader.figshare.com/files/58019572"
},
{
"@type": "dcat:Distribution",
"title": "LegSF.fam3.W6TK.sup1B_hmm.tar.gz",
"format": "gz",
"mediaType": "application/x-gzip",
"downloadURL": "https://ndownloader.figshare.com/files/58019575"
},
{
"@type": "dcat:Distribution",
"title": "Relative_heatmaps_PDF.tar.gz",
"format": "gz",
"mediaType": "application/x-gzip",
"downloadURL": "https://ndownloader.figshare.com/files/58019872"
},
{
"@type": "dcat:Distribution",
"title": "Relative_heatmaps_PNG.tar.gz",
"format": "gz",
"mediaType": "application/x-gzip",
"downloadURL": "https://ndownloader.figshare.com/files/58019875"
},
{
"@type": "dcat:Distribution",
"title": "species_tree.pdf",
"format": "pdf",
"mediaType": "application/pdf",
"downloadURL": "https://ndownloader.figshare.com/files/58020268"
},
{
"@type": "dcat:Distribution",
"title": "species_tree.tre",
"format": "tre",
"mediaType": "text/plain",
"downloadURL": "https://ndownloader.figshare.com/files/58020304"
},
{
"@type": "dcat:Distribution",
"title": "methods_RNS_superfam_bash.txt",
"format": "txt",
"mediaType": "text/plain",
"downloadURL": "https://ndownloader.figshare.com/files/58020379"
},
{
"@type": "dcat:Distribution",
"title": "rns_families_counts_2025-09-19.xlsx",
"format": "xlsx",
"mediaType": "application/vnd.openxmlformats-officedocument.spreadsheetml.sheet",
"downloadURL": "https://ndownloader.figshare.com/files/58215010"
},
{
"@type": "dcat:Distribution",
"title": "rns_local_dups_2025-09-19.xlsx",
"format": "xlsx",
"mediaType": "application/vnd.openxmlformats-officedocument.spreadsheetml.sheet",
"downloadURL": "https://ndownloader.figshare.com/files/58215013"
},
{
"@type": "dcat:Distribution",
"title": "rns_pairwise_bias_results_09-19.xlsx",
"format": "xlsx",
"mediaType": "application/vnd.openxmlformats-officedocument.spreadsheetml.sheet",
"downloadURL": "https://ndownloader.figshare.com/files/58215016"
}
]
|
| identifier | 10.15482/USDA.ADC/30142387.v1 |
| keyword |
[
"Evolution",
"Gene expression",
"Gene families",
"Root Nodule Symbiosis",
"symbiotic nitrogen fixation (SNF)"
]
|
| license | https://creativecommons.org/publicdomain/zero/1.0/ |
| modified | 2025-09-29 |
| programCode |
[
"005:040"
]
|
| publisher |
{
"name": "Agricultural Research Service",
"@type": "org:Organization"
}
|
| temporal | 2025-09-23/2025-09-23 |
| title | Data from: Impacts of gene duplication in the evolution of symbiotic root nodule symbiosis |