In-house annotated gene set for the pecan weevil, <i>Curculio caryae</i>
This in-house annotated gene set was created using the following methods. RNA was isolated from the head and thorax segments of one adult male and one adult female pecan weevil using the NucleoMag RNA Kit (Macherey-Nagel, Düren, Germany, 744350.1) according to kit protocols. Isolated RNA was processed into PacBio Kinnex sequencing libraries using the Iso-Seq express 2.0 kit (Pacific Biosciences, Menlo Park, CA, USA 103-071-500) and Kinnex full-length RNA kit (Pacific Biosciences, Menlo Park, CA, USA,103-072-000). The prepared library was bound and sequenced at the USDA-ARS Veterinary Pest Genetics Research Unit in Kerrville, Texas, on two Pacific Biosciences SMRT cell trays with a Revio system (Pacific Biosciences, Menlo Park, CA, USA, 102-202-200) beginning with a 2-h pre-extension followed by a 30-h movie collection time. After sequencing, circular consensus sequences from the PacBio Sequel Revio subreads were obtained using the SMRTLink v13.0 software. Reads were subsequently mapped to the repeat-masked genome assembly using minimap2 with arguments for spliced nucleotide sequences (-ax splice:hq) to generate sam mapping files. These were then compressed into bam files using samtools view -bS and used as input for gene model prediction with the Braker version 3.0.8 program (https://github.com/Gaius-Augustus/BRAKER), generating 72,879 gene models. These gene models and amino acid protein predictions were further curated and annotated with gene ontologies and protein domains using InterProScan-5.73-104.0 with PANTHER-19.0 and Pfam-37.2 databases (https://github.com/ebi-pf-team/interproscan), resulting in 19,508 InterProScan results.
Complete Metadata
| @type | dcat:Dataset |
|---|---|
| accessLevel | public |
| bureauCode |
[
"005:18"
]
|
| contactPoint |
{
"fn": "Perkin, Lindsey, C.",
"hasEmail": "mailto:lindsey.perkin@usda.gov"
}
|
| description | <p dir="ltr">This in-house annotated gene set was created using the following methods. </p><p dir="ltr">RNA was isolated from the head and thorax segments of one adult male and one adult female pecan weevil using the NucleoMag RNA Kit (Macherey-Nagel, Düren, Germany, 744350.1) according to kit protocols. Isolated RNA was processed into PacBio Kinnex sequencing libraries using the Iso-Seq express 2.0 kit (Pacific Biosciences, Menlo Park, CA, USA 103-071-500) and Kinnex full-length RNA kit (Pacific Biosciences, Menlo Park, CA, USA,103-072-000). The prepared library was bound and sequenced at the USDA-ARS Veterinary Pest Genetics Research Unit in Kerrville, Texas, on two Pacific Biosciences SMRT cell trays with a Revio system (Pacific Biosciences, Menlo Park, CA, USA, 102-202-200) beginning with a 2-h pre-extension followed by a 30-h movie collection time. After sequencing, circular consensus sequences from the PacBio Sequel Revio subreads were obtained using the SMRTLink v13.0 software. Reads were subsequently mapped to the repeat-masked genome assembly using minimap2 with arguments for spliced nucleotide sequences (<i>-ax splice:hq</i>) to generate sam mapping files. These were then compressed into bam files using samtools view -bS and used as input for gene model prediction with the Braker version 3.0.8 program (<a href="https://github.com/Gaius-Augustus/BRAKER" target="_blank">https://github.com/Gaius-Augustus/BRAKER</a>), generating 72,879 gene models. These gene models and amino acid protein predictions were further curated and annotated with gene ontologies and protein domains using InterProScan-5.73-104.0 with PANTHER-19.0 and Pfam-37.2 databases (<a href="https://github.com/ebi-pf-team/interproscan" target="_blank">https://github.com/ebi-pf-team/interproscan</a>), resulting in 19,508 InterProScan results.</p> |
| distribution |
[
{
"@type": "dcat:Distribution",
"title": "curated_genes.fasta",
"format": "fasta",
"mediaType": "text/plain",
"downloadURL": "https://ndownloader.figshare.com/files/58362277"
}
]
|
| identifier | 10.15482/USDA.ADC/30234490.v1 |
| keyword |
[
"gene annotation",
"pecan weevil"
]
|
| license | https://creativecommons.org/publicdomain/zero/1.0/ |
| modified | 2025-12-04 |
| programCode |
[
"005:040"
]
|
| publisher |
{
"name": "Agricultural Research Service",
"@type": "org:Organization"
}
|
| temporal | 2023-09-19/2025-09-29 |
| title | In-house annotated gene set for the pecan weevil, <i>Curculio caryae</i> |