Skip to main content
U.S. flag

An official website of the United States government

This site is currently in beta, and your feedback is helping shape its ongoing development.

Data from: Impacts of gene duplication in the evolution of symbiotic root nodule symbiosis

Published by Agricultural Research Service | Department of Agriculture | Metadata Last Checked: January 27, 2026 | Last Modified: 2025-09-29
The emerging consensus regarding the origin of root nodule symbiosis (RNS), based on modeling of trait gain and loss across approximately 13,000 species within the “nitrogen-fixing clade” in the rosid group, is that the trait has arisen multiple times, probably semi-independently, and has also been lost repeatedly. Evolution of a new organ and functions involves many thousands of genes; but the evolutionary histories for many of these genes may be uninformative regarding RNS evolution. A portion of the genes, however, are likely to be derived from prior gene duplications and to have acquired new functions or to have come under new regulatory patterns. Whole genome duplications (WGDs) could conceivably enable the necessary neo- or sub-functionalization for new roles in the nodule. All species that exhibit RNS share a history of several ancient WGDs; but the last such common WGD for these species was the “gamma” paleohexaploidy that occurred early in the core eudicot lineage, ~120 million years ago (Mya). This presents a puzzle: if RNS didn’t originate until ~60-80 Mya, within the respective families exhibiting RNS, what explains the long quiescent period (~40-60 million years) and the many eudicot lineages without RNS? This study focuses on a collection of gene families with additional independent WGDs that appear to have occurred in the interim period, after the gamma triplication and prior to the evolution of RNS, identifying several that are both essential for RNS and that show evidence of critical roles of both ancient WGDs and more recent local duplications. The data in this repository includes gene families for the legumes and allied species (some with RNS, some without), that has been used in associated manuscript to trace the origin of a collection of genes involved in RNS.

Find Related Datasets

Click any tag below to search for similar datasets

data.gov

An official website of the GSA's Technology Transformation Services

Looking for U.S. government information and services?
Visit USA.gov