Skip to main content

Enrichment of colibactin-associated mutational signatures in unexplained colorectal polyposis patients

Abstract

Background

Colibactin, a genotoxin produced by polyketide synthase harboring (pks+) bacteria, induces double-strand breaks and chromosome aberrations. Consequently, enrichment of pks+Escherichia coli in colorectal cancer and polyposis suggests a possible carcinogenic effect in the large intestine. Additionally, specific colibactin-associated mutational signatures; SBS88 and ID18 in the Catalogue of Somatic Mutations in Cancer database, are detected in colorectal carcinomas. Previous research showed that a recurrent APC splice variant perfectly fits SBS88.

Methods

In this study, we explore the presence of colibactin-associated signatures and fecal pks in an unexplained polyposis cohort. Somatic targeted Next-Generation Sequencing (NGS) was performed for 379 patients. Additionally, for a subset of 29 patients, metagenomics was performed on feces and mutational signature analyses using Whole-Genome Sequencing (WGS) on Formalin-Fixed Paraffin Embedded (FFPE) colorectal tissue blocks.

Results

NGS showed somatic APC variants fitting SBS88 or ID18 in at least one colorectal adenoma or carcinoma in 29% of patients. Fecal metagenomic analyses revealed enriched presence of pks genes in patients with somatic variants fitting colibactin-associated signatures compared to patients without variants fitting colibactin-associated signatures. Also, mutational signature analyses showed enrichment of SBS88 and ID18 in patients with variants fitting these signatures in NGS compared to patients without.

Conclusions

These findings further support colibactins ability to mutagenize colorectal mucosa and contribute to the development of colorectal adenomas and carcinomas explaining a relevant part of patients with unexplained polyposis.

Peer Review reports

Background

An enrichment of polyketide synthase (pks) encoding Escherichia coli in patients with colorectal cancer [1, 2] and polyposis [3] implies a potential carcinogenic effect in the large intestine. These E. coli bacteria harbor the pks gene island which encodes the necessary equipment to produce the genotoxin colibactin [4]. Colibactin induces double-strand breaks and chromosome aberrations leading to a specific mutational signature that has been observed in colorectal adenocarcinomas and oral squamous cell carcinomas [5, 6]. This mutational signature is characterized by T > N mutations with an adenine 3 base pairs to the 5’ side and single thymine deletions located in T homopolymers with 2 to 4 adenines to the 5’ side depending on the length of the T homopolymer. These signatures are documented in the Catalogue of Somatic Mutations in Cancer (COSMIC) database as single base substitution signature SBS88 and indel signature ID18.

E. coli is not the only bacterium able to harbor the pks gene island. Other bacteria mostly belonging to the Enterobacteriaceae family, such as Klebsiella pneumoniae, Enterobacter aerogenes and Citrobacter koseri, have also been shown to harbor pks [7]. Moreover, pks harboring bacteria are found in other organisms like bacteria in the honey bee gut or a marine sponge [8].

We previously showed that a common APC splice variant c.835-8 A > G and several other pathogenic APC variants perfectly fit the colibactin-associated mutational signatures [9]. This finding furthermore implies a possible association between colibactin and the development of colorectal neoplasms. Since a large proportion of our unexplained polyposis patient cohort showed a colibactin-associated APC variant in multiple adenomas, further research into the presence and impact of colibactin and its mutational signature was warranted. Therefore, for a subset of polyposis patients, metagenomics was performed on feces and Whole Genome Sequencing (WGS) with subsequent mutational signature analyses was conducted on Formalin Fixed Paraffin Embedded (FFPE) colorectal tissue blocks. Results were compared between those with and without colibactin-associated variants.

Materials and methods

APC mosaicism testing

In total, 379 patients with multiple colorectal adenomas or carcinomas were tested for APC mosaicism. The majority of these patients (80.5%), falling within the scope of the Dutch germline testing criteria, were tested before for germline pathogenic variants in APC, MUTYH, POLE, POLD1 and NTHL in regular diagnostic testing. In short, DNA was isolated from Formalin Fixed Paraffin Embedded (FFPE) tissue blocks of on average 4 colorectal adenomas or carcinomas using the automated Tissue Preparation System (Siemens). Ampliseq Next Generation Sequencing (NGS) libraries (ThermoFisher Scientific) of a custom-made panel containing 20 colorectal cancer and polyposis associated genes (APC, BMPR1A, BRCA1, BRCA2, ENG, MLH1, MSH2, MSH3, MSH6, MUTYH, NTHL1, PALB2, PMS2, POLD1, POLE, PTEN, RNF43, SMAD4, STK11, TP53) were prepared according to manufacturer’s instructions. In a proportion of patients (55%) the colorectal lesions were furthermore tested for hotspots in BRAF, CTNNB1, KRAS, NRAS and PIK3CA. Sequencing was performed in an Ion GeneStudio S5 Series sequencer (ThermoFisher Scientific). The raw, unaligned sequencing reads were mapped against human reference genome (hg19) using TMAP software and Torrent Variant Caller was used for variant calling. The detected variants were categorized by pathogenicity and were, when needed, visualized using Integrative Genomic Viewer [10] or interpreted using the Alamut Visual software (Sophia Genetics).

Detected variants and colibactin signature

To determine whether the somatic variants fit into the mutational signatures SBS88 and ID18, all detected T > N and delT variants and their sequencing context were visualized using IGV. As previously described [5, 6], T > N variants with the following sequencing context were labelled as fitting SBS88: 5’ A-(N)-(T/A)-T-(T/A/G) 3’. DelT variants were labelled as fitting ID18 whenever a 2 to 4 adenine homopolymer was flanking the 5’ side of a thymine homopolymer with a total of 5–6 base pairs. In total, 12 somatic variants in genes other than APC were determined to fit SBS88, listed in supplemental Table 1. These were found in 13 lesions in 12 patients, 6 of these patients had at least one APC variant fitting SBS88. As somatic variant data of the other genes will be limited in APC diagnostics and APC is most commonly affected gene in colorectal neoplasms, we decided to divide the cohort based on the APC variants. As illustrated in Fig. 1, in 269 patients no somatic variant fitting SBS88 or ID18 was found and therefore served as the control group.

Fig. 1
figure 1

Study design and patient selection. In total, 379 patients were tested using targeted NGS. The case group are patients with at least one APC variant fitting colibactin-associated mutational signature. Twenty cases are selected for additional fecal metagenomics and WGS. Patients without APC variant fitting colibactin-associated signatures serve as controls. Nine controls were selected for fecal metagenomics and WGS. Four patients could not be included for fecal metagenomics since they did not respond to sample request (N = 3) or passed away (N = 1). Two cases were excluded for WGS due to insufficient amount of DNA

Case and control group selection

A random selection of twenty-nine patients were included for fecal metagenomics and/or Whole-Genome Sequencing, as depicted in Fig. 1. Twenty of these patients have adenomas or carcinomas with an APC variant suiting SBS88 or ID18 and nine control patients do not have such a colibactin-associated APC variant. The patient characteristics are summarized in Table 1 and somatic APC variants per lesion in supplemental Table 2. Furthermore, the sequencing context of the APC variants are included in supplemental Table 3.

Table 1 Phenotypic characteristics and NGS results of patients included for fecal metagenomics and WGS

Fecal metagenomics

Feces samples of 25 out of 29 patients were collected for deep fecal shotgun metagenomic sequencing (Fig. 1; Table 1). Four patients could not be included since they did not respond (N = 3) or passed away (N = 1). Fecal metagenomic sequencing was performed as previously described [11]. In short, stool samples were stored at -80 °C, DNA was extracted and libraries were prepared according to manufacturer’s protocol.

Sequencing was performed on the Novaseq6000 platform (Illumina, San Diego, CA, USA). Raw metagenomic sequences were processed, analyzed and compared to the pks gene island partly comparable to the method description by Nooij et al. [12] Reads mapping to the human genome (GRCh38) were removed using bowtie2 (version 2.4.2 [13]) and SAMtools (version 1.11 [14]) and filtered reads were quality-trimmed using fastp (version 0.20.1 [15]). The pre-processing workflow is available at (https://git.lumc.nl/snooij/metagenomics-preprocessing). The quality-trimmed reads were screened for the presence of the pks island by mapping to the colibactin gene cluster (accession ID AM229678) using BWA-MEM (version 0.7.17 [16]). Mapped reads were deduplicated using Picard MarkDuplicates (version 2.23.3 [17]) to remove technical artifacts and improve quantification. The pks screening workflow is available at (https://git.lumc.nl/snooij/screen_pks_in_polyposis_fecal_metagenomes). As previously outlined, fecal samples positive for at least one pks gene were considered pks-positive. [12] To quantify reads per kilobase per million (RPKM) of the individual genes in the pks island present in the stool samples, RPKM values were calculated using the following formula: (N of mapped reads/N of base pairs of the coding sequence of the respective gene)*1,000 divided by N of trimmed and filtered reads*1,000,000. For mean RPKM of the entire pks island, RPKMs of all individual genes were summed and divided by the total number of 19 clb genes.

Whole-genome sequencing (WGS)

DNA from adenomas and carcinomas of 27 out of 29 patients was included for WGS (Fig. 1; Table 1). Two patients were excluded due to an insufficient amount of DNA extracted.

DNA was isolated from FFPE tissue blocks using the NucleoSpin DNA FFPE XS kit (BIOKE, Leiden, the Netherlands) according to manufacturer’s instructions. WGS was performed on the BGIseq500 platform (BGI, Hong Kong, China) for 4 out of 27 patients (ID 8, 10, 12 and 13). Sequencing for the remaining 23 patients was performed on the NovaSeq6000 platform (Illumina, San Diego, USA). The raw sequencing reads were aligned to a reference genome (GRCh38). The alignment, variant calling and filtering were performed as described before [6, 18]. The mutational signature assignment using reference mutational signatures was performed using mSigAct::sparseAssignSignatures followed by mSigAct signature presence test, which provides a p-value for the null-hypothesis that a signature is not needed to explain an observed somatic mutation profile compared with the alternative hypothesis that the signature is needed, as previously described [6].

BMI and lifestyle data

Body Mass Index (BMI) and information about lifestyle was collected using patient medical records and for some patients using a questionnaire (n = 65). BMI was categorized in 4 groups: ≤18.5 ‘underweight’, 18.5–24.9 ‘healthy weight’, 25.0-29.9 ‘overweight’ and ≥ 30.0 ‘obese’. Both tobacco and alcohol consumption were categorized as ‘never’, ‘former’ and ‘current’. Packyears (PY) was determined as the number of packs of cigarettes smoked per day multiplied by the number of years the patient has smoked.

Statistical analysis

Statistical analyses were performed using IBM SPSS statistics 25 (Armonk, NY, USA) and a p-value of < 0.05 was considered statistically significant. Independent T tests, Chi-square tests and Fisher’s exact tests were used to assess the differences between the patients with and without colibactin variants based on the targeted NGS and patients with pks in feces with and without contribution of SBS88 and/or ID18 in the WGS data.

Results

In total, 379 unexplained polyposis patients were tested for somatic APC mosaicism using targeted NGS. In 110 patients, at least one colorectal adenoma or carcinoma harbored an APC variant that fits with one of the colibactin-associated mutational signatures. Phenotypic characteristics, like adenoma count, age at first adenoma and personal history of colorectal carcinoma did not significantly differ between the patients with (cases) and without (controls) APC variants fitting colibactin mutational signatures. Similarly, lifestyle factors like BMI and smoking status were not significantly different between cases and controls (supplemental Table 4). The control group consisted of significantly more former alcohol consumers compared to the cases.

Fecal metagenomics

Fecal samples from seventeen patients with APC variants fitting SBS88 or ID18 (cases) and eight patients without APC variants fitting SBS88 or ID18 (controls) were used for metagenomic analysis to detect pks genes. As shown in Tables 2 and 59% (10 out of 17) of the cases were pks positive compared to 25% (2 out of 8) of controls (p-value = 0.124).

Table 2 Results of fecal pks using metagenomics and mutational signatures SBS88 and ID18 using WGS

In addition, fecal metagenomics was used to quantify pks using RPKM values. However, no significant correlation between number of adenomas/carcinomas with APC variants fitting SBS88 or ID18 and the pks RPKM values was observed (Pearson: R = 0.16, p-value = 0.45).

Also, no significant difference in phenotype was observed between the 10 cases with pks genes in feces and 7 cases without. When comparing lifestyle factors, a trend was observed towards a higher BMI in the group with pks in their feces (supplemental Table 5).

None of the bacteria previously associated with colorectal cancer, like Fusobacterium nucleatum, Bacteroides fragilis, Campylobacter jejuni and Clostridioides difficile, or capable of producing colibactin, like K. pneumoniae, E. aerogenes and C. koseri, were detected in any of the stool samples (data not shown).

Mutational signature analysis

For WGS, fifty-seven colorectal adenomas or carcinomas and six normal colon mucosa samples were analyzed from eighteen patients with APC variants fitting SBS88 or SBS18 (cases) and nine patients without these variants (controls).

As summarized in Table 2, mutational signature analysis identified SBS88 in 8 adenomas or carcinomas derived from 6 cases and ID18 in 2 lesions of 2 cases. Overall, colibactin-associated mutagenesis was detected in 38.9% (7 out of 18) cases. One adenoma of nine controls (11.1%) also showed colibactin associated mutagenesis (SBS88).

Combining fecal metagenomics and mutational signature analyses

Fifteen cases and eight controls were analyzed both using fecal metagenomics and WGS to compute mutational signature analyses. In 10 cases pks was found in their feces samples of which 5 patients also showed a contribution of SBS88 or ID18. Of the 5 cases without pks in their feces, three showed SBS88 or ID18 contribution. In 2 controls, pks genes were detected in feces and in one of them SBS88 was determined in colorectal lesions. Therefore, 86.7% (13 out of 15) of cases and 25% (2 out of 8) of controls showed hints of pks or its carcinogenic effects (p-value = 0.006).

No significant differences were detected in lifestyle factors between fecal pks + and SBS88/ID18 + cases and fecal pks + and SBS88/ID18- cases (supplemental Table 6).

Discussion

Using targeted NGS, 379 patients with unexplained colorectal polyposis were tested for APC mosaicism. At least one somatic APC variant fitting one of the colibactin associated mutational signatures (SBS88 or ID18) was found in 29% (n = 110) patients. Except for the distribution of former alcohol consumption, no significant differences were observed in phenotypic characteristics or lifestyle factors between patients with and without these APC variants. Although further research is warranted, the significant difference in former alcohol consumption observed between the groups is likely attributable to the small number of patients with a former alcohol consumption status.

Fecal metagenomics revealed 59% (10 out of 17) of cases with one or more pks genes. This proportion is comparable to pks+E. coli bacteria found in colon mucosa of individuals with Familial Adenomatous Polyposis (68%) and sporadic CRC patients (55%) [1, 3]. In contrast, only 25% (2 out of 8) of controls showed pks genes. Although numbers are small, this is comparable to the previously reported incidence of healthy individuals with pks genes in feces (27–29%) [12, 19, 20] and with pks+E. coli bacteria in colon mucosa (19–22%) [1, 3].

The current study found no significant differences in phenotypic characteristics and tobacco and alcohol consumption between patients with and without pks in feces. Further research is required to draw a conclusion about the correlation between BMI and pks+E. coli. Although not directly linked to BMI, Arima et al. [21] found that the association between the Western diet and colorectal cancer patients was only significant in patients with pks+E. coli in their tumor, suggesting a potential interactive carcinogenic effect between diet and pks+E. coli.

WGS with subsequent mutational signature analysis showed a contribution of SBS88 or ID18 in 39% (7/18) of cases, compared to 11.1% (1/9) of controls. In only one case all analyzed samples showed a contribution of SBS88. This might be explained by the variable distribution of colonic crypts with the signature within one patient [22]. Moreover, as summarized in supplemental Table 2, the majority of adenomas and carcinomas (n = 25) selected for WGS from cases did not harbor APC variants fitting SBS88 or ID18. Eighteen of these adenomas and carcinomas were located in the right colon and right sided carcinomas were less likely to have colibactin-associated signatures [23].

Combining both fecal metagenomics and mutational signature analyses, 86.7% (13/15) of cases showed a significant enrichment towards colibactin influence compared to 25% (2/8) of controls in which both analyses were performed.

This significant enrichment of fecal pks and colibactin-associated mutational signatures in cases compared to controls, supports the proposition of a recent preprint that the APC splice variant c.835-8 A > G might be used as a biomarker for pks+E. coli influence in the development of the adenoma or carcinoma [23].

Despite the enrichment, no clear correlation between pks in feces and colibactin-associated mutational signatures in colorectal lesions was observed in individual cases. Multiple hypotheses might explain (part of) this finding, comprising both biological and technical issues:

It was previously described that colibactin has a short-term effect, affecting the colon early in life [22, 24, 25]. Colonic mucosa of patients with a contribution of SBS88 and ID18 might therefore be affected by colibactin, but the pks-encoding bacteria may have been eradicated from the intestinal tract at time of feces sampling.

The other way around, in patients with pks detected in feces but no SBS88 or ID18 in WGS, enrichment of pks+ bacteria after the development of adenomas but before feces sampling seems unlikely as pks+E. coli is detected in feces of newborns and therefore proposed to be transmitted during birth [25, 26]. These patients might, however, have some kind of mechanism inhibiting colibactin from entering the host cell or whenever inside the cell protects against the specific DNA damage. The protein ATG16L1 for example is described to be associated with preventing colorectal tumorigenesis in presence of pks+E. coli in cell lines and mouse models [27]. Also, colibactin production is in a recent preprint suggested to be inhibited by oxygen [28]. On the other hand, inflammation seems to promote the expansion of the colibactin-encoding E. coli and creates an opportunity to adhere to colon mucosa [2]. Moreover, co-localization with B. fragilis seems to increase DNA damage with faster tumor onset in mice [3]. These hypotheses might also play a role in whether presence of pks+E. coli in the intestinal tract actually leads to DNA damage.

Technically, the small sample set and use of shotgun metagenomics and FFPE tissue blocks are limitations of this study. Especially WGS performed on FFPE samples affects the variant and signature calling and interpretation due to fragmentation and deamination artefacts [29,30,31]. Moreover, shotgun fecal metagenomics is a broad analyses but a more sensitive qPCR approach performed at multiple timepoints and at time of adenoma diagnosis could give more insight into the association with adenoma development.

To conclude, in 29% of our cohort with unexplained polyposis patients a colibactin influence was suggested based on targeted NGS data. A subset of cases was included for additional analyses and showed further evidence of colibactin in fecal metagenomics and mutational signature analyses compared to controls. Further research, circumventing the complications of WGS on FFPE tissue and validating the feces analyses, should be performed to draw conclusions for individual cases. Still, these findings provide evidence that colibactin affects the colonic mucosa and plays a pivotal role in unexplained polyposis patients.

Data availability

The datasets used and/or analysed during the current study are available from the corresponding author on reasonable request.

References

  1. Buc E, Dubois D, Sauvanet P, et al. High prevalence of mucosa-associated E. Coli producing cyclomodulin and genotoxin in colon cancer. PLoS ONE. 2013;8:e56964.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  2. Arthur JC, Perez-Chanona E, Muhlbauer M, et al. Intestinal inflammation targets cancer-inducing activity of the microbiota. Science. 2012;338:120–3.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  3. Dejea CM, Fathi P, Craig JM, et al. Patients with familial adenomatous polyposis harbor colonic biofilms containing tumorigenic bacteria. Science. 2018;359:592–7.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  4. Nougayrede JP, Homburg S, Taieb F, et al. Escherichia coli induces DNA double-strand breaks in eukaryotic cells. Science. 2006;313:848–51.

    Article  CAS  PubMed  Google Scholar 

  5. Pleguezuelos-Manzano C, Puschhof J, Huber AR, et al. Mutational signature in colorectal cancer caused by genotoxic pks(+)E. coli. Nature. 2020;580:269–.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  6. Boot A, Ng AWT, Chong FT, et al. Characterization of colibactin-associated mutational signature in an Asian oral squamous cell carcinoma and in other mucosal tumor types. Genome Res. 2020;30:803–13.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  7. Wami H, Wallenstein A, Sauer D et al. Insights into evolution and coexistence of the colibactin- and yersiniabactin secondary metabolite determinants in enterobacterial populations. Microb Genom 2021;7.

  8. Silpe JE, Wong JWH, Owen SV, et al. The bacterial toxin colibactin triggers prophage induction. Nature. 2022;603:315–20.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  9. Terlouw D, Suerink M, Boot A, et al. Recurrent APC splice variant c.835-8A > G in patients with unexplained colorectal polyposis fulfilling the Colibactin mutational signature. Gastroenterology. 2020;159:1612–1614e5.

    Article  CAS  PubMed  Google Scholar 

  10. Thorvaldsdottir H, Robinson JT, Mesirov JP. Integrative genomics viewer (IGV): high-performance genomics data visualization and exploration. Brief Bioinform. 2013;14:178–92.

    Article  CAS  PubMed  Google Scholar 

  11. Ducarmon QR, Terveer EM, Nooij S, et al. Microbiota-associated risk factors for asymptomatic gut colonisation with multi-drug-resistant organisms in a Dutch nursing home. Genome Med. 2021;13:54.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  12. Nooij S, Ducarmon QR, Laros JFJ et al. Faecal microbiota transplantation influences procarcinogenic Escherichia coli in recipient recurrent Clostridioides difficile patients. Gastroenterology. 2021.

  13. Langmead B, Salzberg SL. Fast gapped-read alignment with Bowtie 2. Nat Methods. 2012;9:357–U54.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  14. Li H, Handsaker B, Wysoker A, et al. The sequence Alignment/Map format and SAMtools. Bioinformatics. 2009;25:2078–9.

    Article  PubMed  PubMed Central  Google Scholar 

  15. Chen SF, Zhou YQ, Chen YR, et al. Fastp: an ultra-fast all-in-one FASTQ preprocessor. Bioinformatics. 2018;34:884–90.

    Article  Google Scholar 

  16. Li H. Aligning sequence reads, clone sequences and assembly contigs with BWA-MEM. arXiv: Genomics; 2013.

    Google Scholar 

  17. https://broadinstitute.github.io/picard/index.html.

  18. Boot A, Huang MN, Ng AWT, et al. In-depth characterization of the cisplatin mutational signature in human cell lines and in esophageal and liver tumors. Genome Res. 2018;28:654–65.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  19. Dubinsky V, Dotan I, Gophna U. Carriage of colibactin-producing Bacteria and colorectal Cancer risk. Trends Microbiol. 2020;28:874–6.

    Article  CAS  PubMed  Google Scholar 

  20. Watanabe D, Murakami H, Ohno H, et al. Stool pattern is associated with not only the prevalence of tumorigenic bacteria isolated from fecal matter but also plasma and fecal fatty acids in healthy Japanese adults. BMC Microbiol. 2021;21:196.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  21. Arima K, Zhong R, Ugai T, et al. Western-style Diet, Pks Island-carrying Escherichia coli, and Colorectal Cancer: analyses from two large prospective cohort studies. Gastroenterology. 2022;163:862–74.

    Article  CAS  PubMed  Google Scholar 

  22. Lee-Six H, Olafsson S, Ellis P, et al. The landscape of somatic mutation in normal colorectal epithelial cells. Nature. 2019;574:532–7.

    Article  CAS  PubMed  Google Scholar 

  23. Georgeson P, Steinfelder RS, Harrison TA et al. Genotoxic colibactin mutational signature in colorectal cancer is associated with clinicopathological features, specific genomic alterations and better survival. This article is a preprint and has not been certified by peer review. MedRxiv. https://doi.org/10.1101/2023.03.10.23287127.

  24. Iftekhar A, Berger H, Bouznad N, et al. Genomic aberrations after short-term exposure to colibactin-producing E. Coli transform primary colon epithelial cells. Nat Commun. 2021;12:1003.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  25. Payros D, Secher T, Boury M, et al. Maternally acquired genotoxic Escherichia coli alters offspring’s intestinal homeostasis. Gut Microbes. 2014;5:313–25.

    Article  PubMed  PubMed Central  Google Scholar 

  26. Tsunematsu Y, Hosomi K, Kunisawa J, et al. Mother-to-infant transmission of the carcinogenic colibactin-producing bacteria. BMC Microbiol. 2021;21:235.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  27. Lucas C, Salesse L, Hoang MHT, et al. Autophagy of intestinal epithelial cells inhibits colorectal Carcinogenesis Induced by Colibactin-Producing Escherichia coli in Apc(Min/+) mice. Gastroenterology. 2020;158:1373–88.

    Article  CAS  PubMed  Google Scholar 

  28. Nadège Bossuet-Greif N, Guyonnet C, Chagneau CV et al. Oxygen concentration modulates colibactin production. This article is a preprint and has not been certified by peer review. BioRxiv. https://doi.org/10.1101/2022.06.20.496773.

  29. de Schaetzen L, Larmuseau M, Van der Eecken K, et al. Comparative analysis of somatic variant calling on matched FF and FFPE WGS samples. BMC Med Genomics. 2020;13:94.

    Article  Google Scholar 

  30. Robbe P, Popitsch N, Knight SJL, et al. Clinical whole-genome sequencing from routine formalin-fixed, paraffin-embedded specimens: pilot study for the 100,000 genomes Project. Genet Med. 2018;20:1196–205.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  31. Do H, Dobrovic A. Sequence artifacts in DNA from formalin-fixed tissues: causes and strategies for minimization. Clin Chem. 2015;61:64–71.

    Article  CAS  PubMed  Google Scholar 

Download references

Acknowledgements

Not applicable.

Funding

This study is supported by the Dutch Cancer Society (Project number: 11292).

Author information

Authors and Affiliations

Authors

Contributions

D.T.: acquistion of data, analysis and interpretation of data, drafting the manuscript. A.B.: performed mutational signature analysis, critical revision of manuscript. Q.R.D., S.N.: performed fecal metagenomics analysis, critical revision of manuscript. M.S.: acquisition of data, critical revision of manuscript. M.Ev.L.: acquisition of patients, critical revision of manuscript. Dv.E.: performed laboratory work NGS and WGS, critical revision of manuscript. C.M.T.: acquisition of data, critical revision of manuscript. R.D.: bioinformatics and critical revision of manuscript. R.Z.: concept fecal metagenomics, critical revision of manuscript. A.M.J.L.: acquisition of patients, interpretation of data, critical revision of manuscript. M.N.: acquisition of patients. Tv.W., H.M., M.N.: study concept and design, interpretation of data, obtained funding, critical revision of manuscript.

Corresponding author

Correspondence to Hans Morreau.

Ethics declarations

Ethics approval and consent to participate

This study was approved by a the ethics review board (B18.042) of the Leiden University Medical Center (LUMC) and all subjects provided written informed consent. All methods were carried out in accordance with relevant guidelines and regulations.

Consent for publication

Not applicable.

Competing interests

The authors declare no competing interests.

Additional information

Publisher’s Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Electronic supplementary material

Below is the link to the electronic supplementary material.

Supplementary Material 1:

Somatic variants detected using targeted NGS fitting SBS88 in other genes than APC

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated in a credit line to the data.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Terlouw, D., Boot, A., Ducarmon, Q.R. et al. Enrichment of colibactin-associated mutational signatures in unexplained colorectal polyposis patients. BMC Cancer 24, 104 (2024). https://doi.org/10.1186/s12885-024-11849-y

Download citation

  • Received:

  • Accepted:

  • Published:

  • DOI: https://doi.org/10.1186/s12885-024-11849-y

Keywords