Skip to main content
  • Research Note
  • Open access
  • Published:

Faecal sample storage without ethanol for up to 24 h followed by freezing performs better than storage with ethanol for shotgun metagenomic microbiome analysis in patients with inflammatory and non-inflammatory intestinal diseases and healthy controls

Abstract

Objective

The influence of different faecal collection methods on metagenomic analyses remains under discussion, and there is no general agreement on which collection method is preferable for gut microbiome research. We compared faecal samples collected in tubes without preservatives with those containing 10 mL of 96% ethanol for gut microbiome research when the timeframe from defecation to freezing at – 80 °C was up to 24 h. We aimed to compare the collection methods on faeces from participants with inflammatory and non-inflammatory gastrointestinal disorders and healthy controls to investigate the most suitable method when considering data yield, human fraction of sequencing reads, and ease of use. We also examined the faecal sample homogeneity.

Results

Faeces collected in tubes without preservatives resulted in more sequencing reads compared to faeces collected in tubes with 96% ethanol and were also easier to handle. The human fraction of total reads in faeces collected in ethanol from participants with inflammatory bowel disease was higher than all other samples. DNA extraction and sequencing from two different locations in the same faecal sample gave similar results and showed sample homogeneity.

Peer Review reports

Introduction

Multiple association studies have linked the gut microbiome composition to human health and various diseases including the gastrointestinal (GI) disorders irritable bowel syndrome (IBS) and Crohn’s disease (CD) [1,2,3,4]. With the advancement of shotgun metagenomics sequencing, mapping of the gut microbiome has become more accessible [5]. However, data on how different faecal collection methods influence metagenomic data and analyses is ambiguous [1, 2].

When conducting clinical trials that explore the association between GI diseases and the human gut microbiome, knowing the most effective approach is crucial. Methodological research plays an important role in ensuring the reliability and accuracy of such studies [1, 6]. Furthermore, a standardised faecal sample collection method should be feasible in clinical practice [7, 8]. Faecal sample collection without any preservatives and rapidly freezing at -80°C is considered the gold standard [1, 6, 7]. However, in most human clinical trials, immediate freezing or analysis of a faecal sample is not possible, and thus a need for both preservation and long-term storage may be required [7].

Ethanol is widely used as a preservative of biomaterial because of its ability to fixate DNA by inhibiting DNA-degrading enzymes [9, 10] and its low cost. This study aimed to determine the optimal faecal collection method (no preservative versus ethanol) in clinical gut microbiome research involving participants with various GI conditions where samples are stored for up to 24 h before freezing at – 80 °C. The ease of use for sample collection at home and laboratory handling was considered. A secondary aim was to test the homogeneity of the faecal samples as the macroscopic composition of the samples can vary greatly.

We focused on patients with IBS and CD. Knowing that CD is an inflammatory bowel disease (IBD), we sought to test the collecting methods on patients with a high level of faecal calprotectin, a sensitive marker for GI inflammation, to investigate how the two methods affect the amount of human DNA reads.

Methods

Study participants

Four patients with IBS, four patients with CD with an abnormal level of faecal calprotectin (above 50 mg/kg), and four healthy volunteers delivered faecal samples for this study. All participants with IBS and CD were adult patients originally recruited for clinical research projects at Copenhagen University Hospital Hvidovre. A family of four, without any known GI diseases, volunteered to participate as healthy controls.

The study setup is illustrated in Fig. 1.

Fig. 1
figure 1

Illustration of the study setup. Faecal samples were continuously collected, frozen, and analysed

Faecal specimen collection

Each study participant delivered a set of faecal samples consisting of one tube without preservatives and one tube added 10 mL of 96% ethanol. Both samples were collected from the same stool. The consistency of the samples varied from loose to solid. Participants were asked to collect the samples on the same day as they would be delivered to the research investigator to ensure that samples would be frozen to – 80 °C within 24 h. If participants knew that a toilet visit in the morning was not guaranteed, they were allowed to collect the faecal samples the evening before and store the samples in the refrigerator (5 °C) until departure from home. Regardless, the 24-h time interval from defecation to freezing was maintained.

DNA extraction and sequencing

DNA extractions were performed manually using the DNeasy PowerSoil Pro Kit (250) (cat. nr. 47016, Qiagen, Hilden, Germany) following the manufacturer’s instructions. A 10 µL inoculation loop (cat. nr. 86.1562.959, Sarstedt, Nümbrecht, Germany) was used for sampling from the original non-homogenised stool sample. The sample was transferred to a tube containing beads, and lysis-buffer (CD1), and vortexed briefly to mix. Next, the tube was placed vertically in a styrofoam sample holder on a vortex mixer (Edison, New Jersey, USA) and mixed at maximal power for 10 min. Library preparation was performed automatically on a Biomek 4000 robot (Bechman Coulter, Indianapolis, USA) with 24 samples processed at a time. The library preparation was performed using the Nextera XT DNA Library Prep (cat. nr. FC-131-1096, Illumina, San Diego, California, USA) with bead-based normalisation and Nextera XT Index kit v2 (cat. nr. FC-131-2001/FC-131-2004, Illumina, San Diego, California, USA). The library preparation was performed as recommended by the kit manufacturer, except for an adjusted volume of AMPure XP beads during the final library purification following indexing (25 μL rather than 30 μL).

Sequencing of samples was performed on an Illumina MiSeq with Illumina MiSeq® Reagent Kit v2 (300 Cycles) (cat.nr. 15,033,418, Illumina, San Diego, California, USA), generating 150 base pair paired-end reads.

Test of sample homogeneity

The faecal samples from participants with CD were each DNA extracted and sequenced twice to investigate the homogeneity of the sample as well as the stability of the process. In the process of DNA extraction, samples were deliberately taken from two different locations in the stool. The procedure was performed by the same laboratory personnel each time. When evaluating data yield, only the first set of CD samples was included.

Bioinformatic analysis

Raw FASTQ read files were trimmed using fastp v. 0.20.1 [11] with –qualified_quality_phred of 20 and minimum read length of 50. FastQC v. 0.11.8 [12] was used to evaluate the quality of the reads before and after trimming. Depletion of human sequences was performed by aligning the trimmed reads to the human genome (hg38, University of California, Santa Cruz) using bowtie2 v. 2.3.4.1 [13] with end-to-end alignment and maximum fragment length for valid paired-end alignments (-X) of 2000. Clade-based microbial profiling of the human-depleted reads was performed with MetaPhlAn v. 4.0.6 [14] (database version mpa_vOct22_CHOCOPhlAnSGB_202212) with addition of the parameters –ignore_eukaryotes, –ignore_usgbs, and -t rel_ab_w_read_stats.

The taxonomic data was processed with R v. 4.3.0 [15] in RStudio v. 2023.06.0 [16] and ggplot2 v. 3.5.0 [17] was used for visualisations. Beta diversity analysis was performed using QIIME2 v. 2023.09 [18] using the species level estimated read counts generated by MetaPhlAn4. Aitchison distance was used as beta diversity metric to account for the compositionality of the data [19], and was calculated using the QIIME2 diversity plugin adding a pseudocount of 1. Principal-coordinate analysis was calculated using the ecodist R-package v. 2.0.9 [20].

Shannon diversity was calculated using the microbiome package [21]. Differences in Shannon diversity between preservation methods were tested with the Wilcoxon signed rank test.

Results

Study participants

The 12 participants delivered 24 faecal samples (Supplementary Table 1). Baseline data with a distribution of the age and sex of the study participants can be found in Supplementary Table 2. The participants with CD had an abnormal level of faecal calprotectin at the time of sample delivery, ranging from 226 to 914 mg/kg, indicating inflammatory activity in the intestine.

Sequencing reads

Following DNA extraction, the DNA yield was high in all samples, but lower in the samples collected with ethanol compared to those without (mean 144 ng/µL, range and 180 ng/µL respectively). During library preparation, the DNA input was normalised to similar concentrations.

Sequencing of the included samples resulted in a median of 1,094,598 quality trimmed sequencing reads (range 666,880-1,726,076) for the samples preserved in ethanol, and a median of 1,371,445 quality trimmed reads for the samples without preservatives (range 844,750-1,854,598) (Fig. 2A). As a result of more total sequencing reads, analyses of faeces without preservatives also resulted in more bacterial reads and a higher species count compared to faeces preserved in ethanol, while a similar bacterial fraction of total reads was observed (Fig. 2A-C).

Fig. 2
figure 2

Total reads (A), species count (B), bacterial fraction of total reads (%) (C), nonhuman fraction of total reads (%) (D), human fraction of total reads (%) (E), and Shannon diversity (F) divided into groups of participants: healthy controls (HC), irritable bowel syndrome (IBS), and Crohn’s disease (CD) with no preservative (−) versus 96% ethanol ( +)

Microbial composition

A total of 197 different bacterial species were identified across the 32 sequenced samples. The identified species belonged to 92 different genera, 37 families, and 10 phyla. Among these, 90 different bacterial species were found with a relative abundance above 1%. Figure 3A provides an overview of the relative abundance of these species with a comparison of (1) the gut microbiome among the different participants, (2) the faecal collection methods ethanol vs. no preservatives, and (3) the first and second time a faecal sample was DNA extracted and sequenced. In general, only small differences were observed between samples collected in tubes with and without the preservative. The variations of the microbiome profiles clustered by each participant and not by groups of participants (i.e., healthy controls, IBS, and CD) or by the type of preservation. For participants number 6 and 7, the variation was slightly more visible (Fig. 3B).

Fig. 3
figure 3

Relative abundance of the bacterial species (A) and the beta diversity (B) detected in healthy controls (HC) (participant 1–4), irritable bowel syndrome (IBS) (participant 5–8), and Crohn’s disease (CD) (participant 9–12) without a preservative (−) and with 96% ethanol ( +). The two times a CD sample was DNA extracted and sequenced are marked with A and B for the first and second analyses, respectively. Species detected at a relative abundance < 1% are grouped as Other

There were no significant differences in Shannon diversity between all samples with or without ethanol (p = 0.3) or when divided into groups of participants (healthy controls p = 1.0, IBS p = 0.9, CD p = 0.3).

Intestinal inflammation and human versus nonhuman reads

We obtained more data from participants with CD than from participants with IBS and healthy controls (median 1,853,583 from CD, 1,341,691 from IBS, and 1,183,483 from healthy controls) (Fig. 2A). However, the fraction of nonhuman reads was similar for all groups except for CD samples collected in tubes with ethanol. As presented in Fig. 2D, the other groups had a nonhuman fraction of total reads above 99.9%, and the human fraction of total reads was correspondingly lower in these groups compared to the CD samples with ethanol (Fig. 2E).

Sample homogeneity

To assess sample homogeneity, two separate sub-samples were collected for DNA extraction and sequencing from each of the CD samples (Supplementary Table 1). The results showed no major differences between the first and the second time a CD sample was analysed, which demonstrates both the sample homogeneity and the stability of the process from DNA extraction to sequencing (Fig. 3A, B).

Discussion

In this study, we compared samples collected without preservatives with samples collected in 10 mL of 96% ethanol from patients with GI conditions and healthy controls. To our knowledge, similar GI patient-specific studies with a comparison of collection methods have not previously been performed.

The collection methods in this study differ from the gold standard by samples not being frozen immediately after collection and—as for the ethanol samples—also the addition of a preservative. The time interval of up to 24 h from defecation to freezing at – 80 °C imitates what is practically feasible in most human clinical trials. This particular issue was addressed by Carroll et al. [22] who compared storage temperatures in different time intervals for participants with IBS and healthy controls. The gut microbial composition and diversity of both faecal samples stored at room temperature for 24 h and those preserved at – 80 °C for 6 months exhibited a stronger resemblance to the individual they were collected from than to any other sample [22]. Other studies have shown that the greatest impact on the gut microbiome occurs beyond the 24-h interval [2, 23].

Our main aim was to investigate the differences between the two faecal collection methods regarding suitability for gut microbiome analyses and ease of use. Gut bacterial DNA was successfully analysed in samples both with and without ethanol. However, a slightly higher number of sequencing reads and thus different bacterial species were observed in samples without ethanol. In a systematic review from 2023, Li et al. [2] reported that among eight studies studying the differences between the gold standard and preservation methods for 24-h storage, none of the studies found statistical differences in alpha diversity for any of the different preservation methods compared to the gold standard [2].

We assessed the methods on faeces from both participants with intestinal inflammation (CD + high levels of f-calprotectin), non-inflammatory GI disorder (IBS), and healthy controls. Our hypothesis was, that more human DNA would be detected in samples from participants with CD due to immunological reactions in the intestine. We generally obtained more sequencing reads from CD samples than from IBS and healthy controls, which naturally resulted in more human and non-human data. However, when comparing the human/non-human fraction of total reads, only the CD samples preserved in ethanol (CD +) differed notably from the remaining samples and contained slightly more human DNA. The higher human/lower non-human fraction in these samples does not favour the use of ethanol in this particular study setup.

When considering user-friendliness, we observed a few disadvantages with the ethanol samples. A standardised faecal sample collection method should be simple, intuitive and easy to work with in the laboratory. The risk of spilling and the uncertainty of how well the stool and liquid should be mixed argue against using ethanol. In addition, our laboratory personnel preferred to work with the samples without preservatives, as they were easier to handle due to the sample consistency.

A secondary aim of this study was to evaluate faecal sample homogeneity, which was tested on the CD samples. When studying the microbiome composition, we observed that DNA extraction and sequencing from two different locations in the same stool sample resulted in comparable results. Thus, we also suggest that DNA extraction and sequencing is a stable process.

Our study suggests that faecal sample collection in tubes without ethanol is the better choice when samples are stored for up to 24 h after defecation until freezing. Also, we find that when faeces are collected from patients with IBD, samples without ethanol perform well in the gut microbiome analyses.

Limitations

The study is limited by the small sample size, as only four participants were included in each group. For validation of our findings, we recommend a larger cohort.

Availability of data and material

The datasets generated and/or analysed during the current study are not publicly available because they contain identifiable human genome sequences but are available from principal investigator Andreas Munk Petersen on reasonable request. E-mail: andreas.munk.petersen@regionh.dk.

Abbreviations

GI:

Gastrointestinal

IBS:

Irritable bowel syndrome

CD:

Crohn’s disease

HC:

Healthy controls

IBD:

Inflammatory bowel disease

References

  1. Guan H, Pu Y, Liu C, Lou T, Tan S, Kong M, et al. Comparison of fecal collection methods on variation in gut metagenomics and untargeted metabolomics. mSphere. 2021. https://doiorg.publicaciones.saludcastillayleon.es/10.1128/mSphere.00636-21.

    Article  PubMed  PubMed Central  Google Scholar 

  2. Li X, Shi X, Yao Y, Shen Y, Wu X, Cai T, et al. Effects of stool sample preservation methods on gut microbiota biodiversity: new original data and systematic review with meta-analysis. Microbiol Spectr. 2023. https://doiorg.publicaciones.saludcastillayleon.es/10.1128/spectrum.04297-22.

    Article  PubMed  PubMed Central  Google Scholar 

  3. Vijay A, Valdes AM. Role of the gut microbiome in chronic diseases: a narrative review. Eur J Clin Nutr. 2022;76:489–501.

    Article  CAS  PubMed  Google Scholar 

  4. Zhang YZ, Li YY. Inflammatory bowel disease: pathogenesis. World J Gastroenterol. 2014;20:91–9.

    Article  PubMed  PubMed Central  Google Scholar 

  5. Durazzi F, Sala C, Castellani G, Manfreda G, Remondini D, De Cesare A. Comparison between 16S rRNA and shotgun sequencing data for the taxonomic characterization of the gut microbiota. Sci Rep. 2021. https://doiorg.publicaciones.saludcastillayleon.es/10.1038/s41598-021-82726-y.

    Article  PubMed  PubMed Central  Google Scholar 

  6. Loftfield E, Vogtmann E, Sampson JN, Moore SC, Nelson H, Knight R, et al. Comparison of collection methods for fecal samples for discovery metabolomics in epidemiologic studies. Cancer Epidemiol Biomark Prev. 2016;25:1483–90.

    Article  Google Scholar 

  7. Sinha R, Chen J, Amir A, Vogtmann E, Shi J, Inman KS, et al. Collecting fecal samples for microbiome analyses in epidemiology studies. Cancer Epidemiol Biomark Prev. 2016;25:407–16.

    Article  Google Scholar 

  8. Dominianni C, Wu J, Hayes RB, Ahn J. Comparison of methods for fecal microbiome biospecimen collection. BMC Microbiol. 2014. https://doiorg.publicaciones.saludcastillayleon.es/10.1186/1471-2180-14-103.

    Article  PubMed  PubMed Central  Google Scholar 

  9. Marquina D, Buczek M, Ronquist F, Lukasik P. The effect of ethanol concentration on the morphological and molecular preservation of insects for biodiversity studies. PeerJ. 2021. https://doiorg.publicaciones.saludcastillayleon.es/10.7717/peerj.10799.

    Article  PubMed  PubMed Central  Google Scholar 

  10. Horng KR, Ganz HH, Eisen JA, Marks SL. Effects of preservation method on canine (Canis lupus familiaris) fecal microbiota. PeerJ. 2018;2018: e4827.

    Article  Google Scholar 

  11. Chen S, Zhou Y, Chen Y, Gu J. Fastp: an ultra-fast all-in-one FASTQ preprocessor. In: Bioinformatics. Oxford: Oxford University Press; 2018. p. i884–90.

    Google Scholar 

  12. Andrews S. FastQC—A quality control tool for high throughput sequence data. Cambridge: Babraham Bioinformatics; 2010.

    Google Scholar 

  13. Langmead B, Salzberg SL. Fast gapped-read alignment with Bowtie 2. Nat Methods. 2012;9:357–9.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  14. Blanco-Míguez A, Beghini F, Cumbo F, McIver LJ, Thompson KN, Zolfo M, et al. Extending and improving metagenomic taxonomic profiling with uncharacterized species using MetaPhlAn 4. Nat Biotechnol. 2023;41:1633–44.

    Article  PubMed  PubMed Central  Google Scholar 

  15. R Core Team. R: a language and environment for statistical computing. Vienna: R Foundation for Statistical Computing; 2021.

    Google Scholar 

  16. Posit team. RStudio: integrated development environment for R. Boston: Posit Software PBC; 2023.

    Google Scholar 

  17. Wickham H. ggplot2: elegant graphics for data analysis. New York: Springer- Verlag; 2016.

    Book  Google Scholar 

  18. Bolyen E, Rideout JR, Dillon MR, Bokulich NA, Abnet CC, Al-Ghalith GA, et al. Reproducible, interactive, scalable and extensible microbiome data science using QIIME 2. Nat Biotechnol. 2019;37:852–7.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  19. Gloor GB, Macklaim JM, Pawlowsky-Glahn V, Egozcue JJ. Microbiome datasets are compositional: and this is not optional. Front Microbiol. 2017. https://doiorg.publicaciones.saludcastillayleon.es/10.3389/fmicb.2017.02224.

    Article  PubMed  PubMed Central  Google Scholar 

  20. Goslee SC, Urban DL. Journal of Statistical Software The ecodist Package for Dissimilarity-based Analysis of Ecological Data. 2007.

  21. Lahti L, Shetty S. (2012–2019) “microbiome R package”. https://doiorg.publicaciones.saludcastillayleon.es/10.18129/B9.bioc.microbiome.

  22. Carroll IM, Ringel-Kulka T, Siddle JP, Klaenhammer TR, Ringel Y. Characterization of the fecal microbiota using high-throughput sequencing reveals a stable microbial community during storage. PLoS ONE. 2012;7: e46953.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  23. Cardona S, Eck A, Cassellas M, Gallart M, Alastrue C, Dore J, et al. Storage conditions of intestinal microbiota matter in metagenomic analysis. BMC Microbiol. 2012. https://doiorg.publicaciones.saludcastillayleon.es/10.1186/1471-2180-12-158.

    Article  PubMed  PubMed Central  Google Scholar 

Download references

Acknowledgements

The authors would like to acknowledge medical laboratory technologist Louise Gylling Olsen for her laboratory assistance.

Funding

Open access funding provided by Copenhagen University. Not applicable.

Author information

Authors and Affiliations

Authors

Contributions

IG, SM, SH, SP, MP, HW, and AP conceived and designed the study and contributed to the results' interpretation and conclusions. IG and SH collected the data. IG prepared Fig. 1 and supplementary tables and wrote the manuscript. SM performed the analyses, prepared Figs. 2 and 3, and wrote the section on the bioinformatic analyses. SP wrote the section on DNA extraction and sequencing. All authors (IG, SM, SH, SP, MP, HW, and AP) reviewed and approved the final manuscript.

Corresponding author

Correspondence to Ida Marie Bruun Grønbæk.

Ethics declarations

Ethics approval and consent to participate

The study was performed in accordance with the Revised Declaration of Helsinki. The study was registered at www.clinicaltrials.gov as NCT04808271 and NCT04842149. All participants provided written informed consent to participate after verbal and written information was given.

Consent for publication

Not applicable.

Competing interests

The authors declare no competing interests.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary Information

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Grønbæk, I.M.B., Mollerup, S., Halkjær, S.I. et al. Faecal sample storage without ethanol for up to 24 h followed by freezing performs better than storage with ethanol for shotgun metagenomic microbiome analysis in patients with inflammatory and non-inflammatory intestinal diseases and healthy controls. BMC Res Notes 17, 340 (2024). https://doiorg.publicaciones.saludcastillayleon.es/10.1186/s13104-024-06999-y

Download citation

  • Received:

  • Accepted:

  • Published:

  • DOI: https://doiorg.publicaciones.saludcastillayleon.es/10.1186/s13104-024-06999-y

Keywords