- Data Note
- Open access
- Published:
De Novo transcriptome assembly and annotation of the wild service tree Sorbus torminalis (L.) Crantz
BMC Research Notes volume 18, Article number: 139 (2025)
Abstract
Objectives
The wild service tree, Sorbus torminalis (L.) Crantz, is a sub-Mediterranean forest tree species native to Europe, Asia Minor, Caucasus and north-western Africa. Despite its ecological and societal importance, publicly available molecular resources for this species are scarce, limited to the chloroplast genome. Within the European Project FORGENIUS (Improving access to FORest GENetic resources Information and services for end-Users, Grant Agreement ID: 862221) we assembled the transcriptome of the wild service tree with the aim to use it as reference to develop new genetic markers and perform a large-scale population genetics study in this species.
Data description
An RNA-seq library was generated from pooled RNA extracted from four different tissues (young leaves, mature leaves, stems and roots) of a single S. torminalis sapling and sequenced on the Illumina HiSeq 2500 platform. This dataset was used to produce and annotate the first transcriptome assembly for S. torminalis, paving the way for future genomic research and applications in this species. For example, this study will facilitate the development of molecular markers such as SSRs and SNPs, useful for genetic studies, breeding programs, and conservation efforts and will serve as a reference for future differential expression studies, providing a deeper understanding of the species’ biology.
Objective
Rare and scattered forest tree species are important components of forest biodiversity and represent a key option for diversifying simplified forest ecosystems and eventually increasing their resilience. It is often difficult to monitor the adaptive potential of these species because, contrary to more widespread, ecologically and economically relevant forest tree species, the resources available for their genomic characterization are scarce. This is the case for the wild service tree Sorbus torminalis (L.) Crantz (Rosaceae), a diploid (2n = 34), insect-pollinated and animal-dispersed species native to Europe, Asia Minor, Caucasus and north-western Africa. S. torminalis has historically been used in various applications in the fields of medicine and nutrition and produces durable wood of high economic value [1, 2]. To date, most of the genetic studies on this species are based on a limited number of microsatellite markers [3,4,5] and, more recently, the complete chloroplast genome sequence of the species was released [6]. In the framework of the European Project FORGENIUS (Improving access to FORest GENetic resources Information and services for end-Users, Grant Agreement ID: 862221), we assembled and annotated for the first time the transcriptome of the wild service tree. Transcriptome assembly will serve as a reference for developing a new genotyping tool for S. torminalis using Single Primer Enrichment Technology (SPET), a recently developed targeted genotyping-by-sequencing technology [7]. This genotyping tool will enable to explore the distribution of genetic diversity at a genome-wide scale in this species, allowing forest geneticists to further investigate the adaptive responses of the species to environmental changes.
Data description
The RNA sequencing data used to construct the transcriptome assembly of Sorbus torminalis were generated from total RNA extracted from four different tissues (young leaves, mature leaves, stems and roots) of a single S. torminalis sapling provided by the public nursery of Veneto Agricoltura (Montecchio Precalcino, VI, Italy). The RNA from the different tissues was pooled in equimolar amounts and used to prepare the RNA-Seq libraries that have been sequenced on the Illumina HiSeq 2500 platform as 250 bp paired-end reads. The raw Illumina RNA-seq data were deposited in the NCBI Sequence Read Archive (SRA) database with ID SRP489964 (Table 1, Data file 1). The methodological details of the RNA sequencing and bioinformatics analysis are reported in Data File 2 (Table 1). The quality of the RNA-seq data was evaluated using FastQC v0.11.3 before and after the trimming step (Cutadapt v1.16 [8] and ERNE_FILTER v1.4.6 [9]). The quality control metrics of the trimmed reads were aggregated using MultiQC v1.6 [10]. Data File 3 (Table 1) shows the distribution of the mean quality scores at each read position and the mean read counts per quality score for both first-in-pair and second-in-pair reads. All positions in first-in-pair reads and almost all positions in second-in-pair reads were characterized by an average Phred score of at least 33. A total of 244,607,284 clean read pairs (approximately 91% of the raw reads) were retained after quality trimming and filtering.
The filtered reads were assembled into 71,309 unigenes with an average length of 543.1 bp and N50 of 658 bp using the de novo assembler Trinity v2.15.1 [11]. The transcriptome assembly has been deposited in the Transcriptome Shotgun Assembly database of NCBI under accession number GKRU00000000 (Data file 4, Table 1). Using TransDecoder v5.0.1 (https://github.com/TransDecoder/TransDecoder), a total of 28,662 open reading frames (ORFs) were predicted, of which 6,737 had both start and stop codons, while the remaining ORFs contained only partial coding regions (Data files 5 and 6, Table 1). Functional annotation of the unigenes was performed using BLAST2GO version V.6.0.3 [12] on BLASTx [13] results against the Viridiplantae section of the GenBank non-redundant protein database. Approximately 73% of the unigenes had a BLASTx hit to proteins included in the database. Most of them matched proteins belonging to the Malus or Pyrus genera (Data file 7, Table 1). Using Blast2GO, we assigned a total of 162,340 gene ontology (GO) terms to 43,240 of the 52,129 unigenes for which we obtained a BLASTx match. Most of the assignments (69,523; 42.8%) belonged to the ‘Molecular Function’ category, followed by those belonging to the ‘Biological Process’ (52,926; 32.6%) and ‘Cellular Component’ (39,891; 24.6%) categories. A total of 78,905 Enzyme Codes (EC) were assigned to 21,518 different unigenes. The distribution of the most frequent gene ontology (GO) terms in the three different GO categories and of the most represented EC is shown in Data files 8 and 9 (Table 1) respectively, while the complete annotation is available in Data file 10 (Table 1).
Limitations
The transcriptome assembly was obtained from RNA extracted from four tissues at a single time point in a single individual, and therefore it does not comprise an exhaustive set of all transcripts of this species but only those constitutively expressed under such specific conditions.
Data availability
The data described in this Data Note can be freely and openly accessed in the NCBI and figshare databases. The raw reads have been deposited in the SRA database of NCBI under accession number SRP489964. The assembled transcriptome has been deposited in the Transcriptome Shotgun Assembly database of NCBI under accession number GKRU00000000. The other data files generated in the current study are available in the figshare database. Please see Table 1 and references [14-23] for details and links to the data.
Abbreviations
- NCBI:
-
National Center for Biotechnology Information
- SRA:
-
Sequence Read Archive
- GO:
-
Gene Ontology
- EC:
-
Enzyme Codes
References
Ak G, Tüfekci EF, Mustafa AM, Caprioli G, Altunoglu YC, Baloglu MC, et al. Exploring Sorbus torminalis leaves: unveiling a promising natural resource for diverse chemical and biological applications. Chem Biodivers. 2024;21:e202301596.
Eriksson G. Conservation of noble hardwoods in Europe. https://doiorg.publicaciones.saludcastillayleon.es/101139/x00-134. 2011;31:577–87.
Oddou-Muratorio S, Houot ML, Demesure-Musch B, Austerlitz F. Pollen flow in the wildservice tree, Sorbus torminalis (L.) Crantz. I. Evaluating the paternity analysis procedure in continuous populations. Mol Ecol. 2003;12:3427–39.
Jankowska-Wroblewska S, Meyza K, Sztupecka E, Kubera L, Burczyk J. Clonal structure and high genetic diversity at peripheral populations of Sorbus torminalis (L.) Crantz. iForest - Biogeosciences for. 2016;9:892.
Kavaliauskas D, Šeho M, Baier R, Fussi B. Genetic variability to assist in the delineation of provenance regions and selection of seed stands and gene conservation units of wild service tree (Sorbus torminalis (L.) Crantz) in Southern Germany. Eur J Res. 2021;140:551–65.
Ulaszewski B, Sandurska E, Sztupecka E, Burczyk J. The complete Chloroplast genome sequence of wild service tree Sorbus torminalis (L.) Crantz. Conserv Genet Resour. 2017;9:419–22.
Scaglione D, Pinosio S, Marroni F, Centa E, Di, Fornasiero A, Magris G, et al. Single primer enrichment technology as a tool for massive genotyping: a benchmark on black Poplar and maize. Ann Bot. 2019;124:543–51.
Martin M. Cutadapt removes adapter sequences from high-throughput sequencing reads. EMBnet J. 2011;17:10–2.
Del Fabbro C, Scalabrin S, Morgante M, Giorgi FM. An extensive evaluation of read trimming effects on illumina NGS data analysis. PLoS ONE. 2013;8:e85024.
Ewels P, Magnusson M, Lundin S, Käller M. MultiQC: summarize analysis results for multiple tools and samples in a single report. Bioinformatics. 2016;32:3047–8.
Grabherr MG, Haas BJ, Yassour M, Levin JZ, Thompson DA, Amit I, et al. Trinity: reconstructing a full-length transcriptome without a genome from RNA-Seq data HHS public access author manuscript. Nat Biotechnol. 2013;29:644–52.
Gö Tz S, García-Gó Mez JM, Terol J, Williams TD, Nagaraj SH, Nueda MJ, et al. High-throughput functional annotation and data mining with the Blast2GO suite. Nucleic Acids Res. 2008;36:3420–35.
Altschul SF, Gish W, Miller W, Myers EW, Lipman DJ. Basic local alignment search tool. J Mol Biol. 1990;215:403–10.
Pinosio S, Avanzi C, Bagnoli F, Spanu I, Vendramin GG, Piotti A. De Novo transcriptome assembly and annotation of the wild service tree Sorbus torminalis (L.) Crantz. NCBI; 2024. https://identifiers.org/ncbi/insdc.sra:SRP489964.
Pinosio S, Avanzi C, Bagnoli F, Spanu I, Vendramin GG, Piotti A. De Novo transcriptome assembly and annotation of the wild service tree Sorbus torminalis (L.) Crantz. Figshare. 2024. https://doiorg.publicaciones.saludcastillayleon.es/10.6084/m9.figshare.26021323
Pinosio S, Avanzi C, Bagnoli F, Spanu I, Vendramin GG, Piotti A. De Novo transcriptome assembly and annotation of the wild service tree Sorbus torminalis (L.) Crantz. Figshare. 2024. https://doiorg.publicaciones.saludcastillayleon.es/10.6084/m9.figshare.26021386
Pinosio S, Avanzi C, Bagnoli F, Spanu I, Vendramin GG, Piotti A. De Novo transcriptome assembly and annotation of the wild service tree Sorbus torminalis (L.) Crantz. NCBI. 2024 https://identifiers.org/ncbi/insdc:GKRU00000000.1
Pinosio S, Avanzi C, Bagnoli F, Spanu I, Vendramin GG, Piotti A. De Novo transcriptome assembly and annotation of the wild service tree Sorbus torminalis (L.) Crantz. Figshare. 2024. https://doiorg.publicaciones.saludcastillayleon.es/10.6084/m9.figshare.26021413
Pinosio S, Avanzi C, Bagnoli F, Spanu I, Vendramin GG, Piotti A. De Novo transcriptome assembly and annotation of the wild service tree Sorbus torminalis (L.) Crantz. Figshare. 2024. https://doiorg.publicaciones.saludcastillayleon.es/10.6084/m9.figshare.26021464
Pinosio S, Avanzi C, Bagnoli F, Spanu I, Vendramin GG, Piotti A. De Novo transcriptome assembly and annotation of the wild service tree Sorbus torminalis (L.) Crantz. Figshare. 2024. https://doiorg.publicaciones.saludcastillayleon.es/10.6084/m9.figshare.26021485
Pinosio S, Avanzi C, Bagnoli F, Spanu I, Vendramin GG, Piotti A. De Novo transcriptome assembly and annotation of the wild service tree Sorbus torminalis (L.) Crantz. Figshare. 2024. https://doiorg.publicaciones.saludcastillayleon.es/10.6084/m9.figshare.26021503
Pinosio S, Avanzi C, Bagnoli F, Spanu I, Vendramin GG, Piotti A. De Novo transcriptome assembly and annotation of the wild service tree Sorbus torminalis (L.) Crantz Figshare. 2024. https://doiorg.publicaciones.saludcastillayleon.es/10.6084/m9.figshare.26021515
Pinosio S, Avanzi C, Bagnoli F, Spanu I, Vendramin GG, Piotti A. De Novo transcriptome assembly and annotation of the wild service tree Sorbus torminalis (L.) Crantz. Figshare. 2024. https://doiorg.publicaciones.saludcastillayleon.es/10.6084/m9.figshare.26021575
Acknowledgements
We acknowledge Roberto Fiorentin and Francesco Pernigotto (Veneto Agricoltura, https://www.venetoagricoltura.org/2007/01/sedi/centro-vivaistico-e-per-le-attivita-fuori-foresta-montecchio-precalcino-vi/) for providing the Sorbs torminalis sapling used for RNA extraction.u
Funding
The study was funded by the EU-H2020 project FORGENIUS (Grant Agreement 862221).
Author information
Authors and Affiliations
Contributions
S.P.: Conceptualization, Methodology, Data curation, Writing - Original Draft. C.A.: Investigation, Writing - review & editing. F.B.: Investigation, Writing - review & editing., I.S.: Investigation, Writing - review & editing. G.G.V.: Funding acquisition, Supervision, Writing - review & editing. A.P.: Conceptualization, Investigation, Supervision, Writing - Original Draft.
Corresponding author
Ethics declarations
Ethics approval and consent to participate
Plant material was provided by the public nursery of Veneto Agricoltura (Montecchio Precalcino, VI, Italy; https://www.venetoagricoltura.org/). Veneto Agricoltura is formally involved in the FORGENIUS project as a stakeholder, and all the material is produced in accordance with local legislation.
Consent for publication
Not applicable.
Competing interests
The authors declare no competing interests.
Additional information
Publisher’s note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License, which permits any non-commercial use, sharing, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if you modified the licensed material. You do not have permission under this licence to share adapted material derived from this article or parts of it. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by-nc-nd/4.0/.
About this article
Cite this article
Pinosio, S., Avanzi, C., Bagnoli, F. et al. De Novo transcriptome assembly and annotation of the wild service tree Sorbus torminalis (L.) Crantz. BMC Res Notes 18, 139 (2025). https://doiorg.publicaciones.saludcastillayleon.es/10.1186/s13104-025-07163-w
Received:
Accepted:
Published:
DOI: https://doiorg.publicaciones.saludcastillayleon.es/10.1186/s13104-025-07163-w