- Research article
- Open Access
Microbiome composition indicate dysbiosis and lower richness in tumor breast tissues compared to healthy adjacent paired tissue, within the same women
BMC Cancer volume 22, Article number: 30 (2022)
Breast cancer (BC) is the most common malignancy in women, in whom it reaches 20% of the total neoplasia incidence. Most BCs are considered sporadic and a number of factors, including familiarity, age, hormonal cycles and diet, have been reported to be BC risk factors. Also the gut microbiota plays a role in breast cancer development. In fact, its imbalance has been associated to various human diseases including cancer although a consequential cause-effect phenomenon has never been proven.
The aim of this work was to characterize the breast tissue microbiome in 34 women affected by BC using an NGS-based method, and analyzing the tumoral and the adjacent non-tumoral tissue of each patient.
The healthy and tumor tissues differed in bacterial composition and richness: the number of Amplicon Sequence Variants (ASVs) was higher in healthy tissues than in tumor tissues (p = 0.001). Moreover, our analyses, able to investigate from phylum down to species taxa for each sample, revealed major differences in the two richest phyla, namely, Proteobacteria and Actinobacteria. Notably, the levels of Actinobacteria and Proteobacteria were, respectively, higher and lower in healthy with respect to tumor tissues.
Our study provides information about the breast tissue microbial composition, as compared with very closely adjacent healthy tissue (paired samples within the same woman); the differences found are such to have possible diagnostic and therapeutic implications; further studies are necessary to clarify if the differences found in the breast tissue microbiome are simply an association or a concausative pathogenetic effect in BC. A comparison of different results on similar studies seems not to assess a universal microbiome signature, but single ones depending on the environmental cohorts’ locations.
Breast cancer (BC) is the most common form of cancer among women and, after ovarian cancer, is the second cause of death due to a neoplastic disease worldwide [1, 2]. Familial forms of BC represent up to 20% of all BCs: among these more than 25% are due to predisposing mutations in the BRCA1/2 genes [3,4,5,6,7,8,9] while another percentage concerns mutations in high, moderate and low susceptibility genes . Despite this genetic component, the etiology of up to 80-85% of tumors remains unknown and thus they are considered sporadic. In this context, environmental and lifestyle factors might also modify cancer risk in both familial and sporadic BCs. Nevertheless, most of the factors contributing to BC are still not completely understood thereby limiting BC prevention and treatment measures [11, 12].
The human microbiome plays an important role in promoting health and preventing disease, which suggests that microbial dysbiosis could contribute to increasing the risk of cancer [13,14,15,16,17,18,19]. In this regard, in recent years attention has focused on the relationship between the human microbiome and carcinogenesis to assess its role in BC onset and/or development [20,21,22,23,24]. Therefore, in this scenario, we analyzed (in paired samples from the same subject) the microbiome of tumor breast tissue and the adjacent normal one of women affected by BC in the attempt to get a closer view which may shed light on the potential involvement of microbial dysbiosis in breast cancer. To this aim, we used next-generation-sequencing (NGS)-based methodology to analyze the 16 s ribosomal RNA of the microbiome tissue populations.
Patients’ samples and ethics
Biological samples and clinical data were obtained from a total of 34 women attending the Breast Unit of the “Istituto Nazionale dei Tumori - Fondazione G. Pascale” of Naples starting in 2014 lasting 5 years (Table 1). All patients gave their written informed consent to the study that was carried out according to the tenets of the Helsinki Declaration and approved by the Istituto Nazionale Tumori - Fondazione G. Pascale Ethics Committee (protocol number 3 of 03/25/2009). All patients were previously screened for BRCA1/2 mutations using the protocol and the selection criteria reported by D’Argenio et al. 2015 .
Tumor tissues and healthy tissues, singly paired from the same woman, and surgically removed at the same time (within the same sequencing run, see below), were analyzed for a total of 68 samples, from which total DNA was isolated. Only fresh frozen tissues were used. The tissues were frozen immediately after removal directly in the surgery room to avoid environmental contamination.
To precisely ensure the histology of tissues, all were analyzed in the pathology laboratory (see Table 1).
Genomic DNA extraction from breast tissue
DNA was extracted from tissues using the QIAamp DNeasy Blood & Tissue kits (Qiagen, Hilden, Germany), according to the manufacturer’s instructions. DNA was quantified using the NanoDrop 2000c Spectrophotometer (Thermo Fisher Scientific, Waltham, MA, USA) and the Qubit dsDNA BR and HS assay kit (Life Technologies, CA, USA).
Preparation of the 16 S Metagenomic Sequencing Library
Amplification of the V4-V6 regions of the 16 S rRNA bacterial genes was assessed in two PCR steps: a template of 5 ng/µl of DNA for each sample was used for the first PCR, which was performed using the V4-V6 region specific primers with overhang adapters attached. The primer sequences used in this study are listed in Table 2; the primers were designed and synthetized in our core facility.
Subsequently, 1 µl of the PCR product was analyzed on a Bioanalyzer DNA 1000 chip (Agilent, Santa Clara, CA, USA) to verify its size (~550 bp). Next, Agencourt AMPure XP beads (Beckman Coulter, Brea, CA, USA) were used to purify the 16 S V4-V6 amplicons away from free primers and primer dimer species. Purification products underwent further quality and quantity controls by Bioanalyzer DNA 1000 analysis (Agilent, Santa Clara, CA, USA). The second PCR, performed as per the Nextera XT protocol (Illumina, San Diego, CA), allowed the addition of the Illumina sequencing adapters and the dual-index primers, which barcoded each sample. The V4-V6 amplified regions of each patient were purified through Agencourt AMPure XP Beads (Beckman Coulter, Brea, CA, USA), quantified using the Qubit HS assay kit (Life Technologies, CA, USA) and quality-assessed using a High Sensitivity Chip on the 2100 Bioanalyzer Instruments (Agilent, Santa Clara, CA, USA). However, up to 68 libraries were pooled together for sequencing. Therefore, 8 pM of denatured libraries were combined to 25% of 8 pM PhiX control and loaded into the MiSeq v3 reagent cartridge. Sequencing reactions were per-formed through the Illumina MiSeq System (PE 300 × 2), by obtaining an average read length of about 300 bp. The raw sequencing data are available in the SRA repository under the BioProject PRJNA759366.
Bioinformatic Analysis and Statistics
The Illumina MiSeq paired-end (PE) reads were denoised using a procedure relying on the inference of the Amplicon Sequence Variants (ASVs) (i.e. an estimation of the actual amplicons). The PE reads were treated with cut adapt to remove Illumina adaptors . The trimmed reads were merged using PEAR . The resulting merged reads were denoised by applying the DADA2 workflow . This procedure included the chimera- (i.e. PCR artifacts) and PhiX- (i.e. the PhiX phage is used during Illumina library preparation to increase nucleotide variability) removal [29,30,31]. ASV sequences were mapped against the human genome (release hg19) using bowtie2 to remove nonspecific amplification products (i.e. 16 S rRNA mitochondrial gene) .
The ASVs obtained were taxonomically annotated in BioMaS using the Ribosomal (RDP) database (release 11.5) and the NCBI taxonomy as 16 S rRNA reference collection and taxonomy, respectively [33,34,35,36]. In particular, the query sequences were aligned to the reference collection using bowtie2, and the resulting alignments were filtered according to query coverage (≥ 70%) and identity percentage (≥ 90%). A phylogenetic tree was inferred using the QIIME2 align-to-tree-mafft-fasttree plugin: a multiple sequence alignment of ASV sequences was obtained by using MAFFT and the phylogenetic tree was inferred by applying the maximum-likelihood procedure implemented in Fasttree 2 [37,38,39].
The taxonomic classification was performed using TANGO . In particular, for ASV sequences obtaining matches with an identity percentage equal or higher than 97% the classification at species level was accepted, otherwise ASVs were classified at higher taxonomic ranks . The ASV table was normalized by using rarefaction for diversity analysis . The Shannon and the Faith Phylogenetic indices [43, 44] were inferred as alpha diversity measure by applying the phyloseq R-package, and statistically relevant differences between groups were evaluated by applying the Wilcoxon test . The principal coordinates analysis (PCoA) that describes the diversity between the samples (i.e. Beta-diversity) based on the weighted and unweighted UNIFRAC metrics, were inferred by using the vegan R package and evaluated by PERMANOVA [46, 47].
The statistical comparison between the healthy and tumor samples was performed by using DESeq2 . To measure differences between tissues in the different conditions, the data were normalized by taking into account inter-sample variability. The p-values obtained were adjusted for multiple comparisons with the Benjamin-Hochberg method. Finally, a supervised model for sample classification was built using the Random Forest (RF) Machine Learning (ML) methods and the R package caret . In particular, the DESeq2 ASVs normalized counts were scaled and centered. Then the dataset was randomly divided into the training set and the test data set that including 54 and 14 samples, respectively. The tuning of RF hyperparameter mtry was performed by repeating cross-validation (10 cross-validation with 10 repeats) on the training dataset and the best mtry value was selected according to ROC metric. Lastly, the accuracy of the RF model was assessed on the test dataset.
The comparison between the breast tissue microbiota in tumor and that in paired normal adjacent tissues from 34 women affected by breast cancer enabled us to investigate the distribution of microbial communities of each sample. Each sample obtained more than 90% of reads thereby passing quality filtering with an average quality value of 30 (Q30) >80%. The analyzed data were produced by performing an Illumina MiSeq sequencing run, and we obtained a variable number of Paired End (PE) reads per sample (mean 130,820, sd 384,926.925, median 69,920, min 13,417, max 3,215,914). About 96% of input sequences passed the trimming of adaptors and the PCR primer step.
The overall quality of reverse reads was lower than that of forward reads for all the sequenced samples and, in this specific case, did not pass the quality filter in dada2 . To overcome this issue, we applied an approach based on PE reads merging before denoising [51, 52]. About 70% of input reads were successfully merged. The denoising step enabled us to infer the Amplicon Sequence Variants (ASV) sequences and their absolute counts. The ASV sequences were checked to remove chimeras and human contaminants. In order to achieve an adequate compromise between the microbiome sampling and the number of retained samples, the ASV table was rarefied using an equal sequencing depth of 15,000 (Additional File 1: Figure S1), 27 and 16 tumoral and non-tumoral samples were retained, respectively.
The alpha diversity was measured using the Shannon Index and plotted as a box-plot (Fig. 1a). No statistically significant differences were observed between the tested conditions according to the Shannon index (p-value = 0.1649). Conversely, the distribution of the Faith index differed significantly (p-value ≤ 0.05) between healthy and tumor tissue samples (Fig. 1b).
Although no clear clustering was observed in the PCoA plot based on weighted UniFrac analysis (Fig. 1c) between healthy (H) and tumor (T) tissue samples, the PERMANOVA suggested that about 7% of the observed variability is explained by the conditions (p-value = 0.007). Conversely, neither the PCoA plots nor the PERMANOVA based on unweighted UniFrac (data not reported) resulted in any significant difference between the two conditions (p-value = 0.103).
All the ASVs were taxonomically annotated at least at kingdom level. Generally, 13 phyla, 25 classes, 59 orders, 105 families, 199 genera and 514 species were identified across all samples. The distribution of phyla is shown in Fig. 2. The most predominant phyla are Actinobacteria and Proteobacteria (about 31% and 55.4% on average, respectively). Gammaproteobacteria (40.22%), Actinobacteria (25.09%), Bacilli (7.83%) and Alphaproteobacteria (5.57%) are the most abundant classes among all tumor and normal samples.
The most prevalent families are Propionibacteriaceae (23.57%), Moraxellaceae (17.83%) and Pseudomonadaceae (15.19%). The genera Propionibacterium (22.59%), Acinetobacter (15.43%) and Pseudomonas (15.10%) are the most abundant. The results of statistical comparisons are reported in Table 3. The box-plot of each statistically different taxon between healthy and tumor samples, are shown in Fig. 3 (A-F) and Fig. 4 (A-C), and in Additional File 1: Figure S2. Overall, in non-tumoral paired samples a higher abundance of taxa belonging the Actinobacteria phylum was found. In particular, the order Propionibacteriales, the family Propionibacteriaceae, the genus Propionibacterium and species Propionibacterium sp. enrichment culture clone MRHull-FeSM-11R and Propionibacterium acnes are more abundant in non-tumoral tissues (Fig. 3 A-F and Fig. 4 C). Conversely, Firmicutes and Alpha-proteobacteria are significantly overrepresented in tumoral tissues.
In order to identify the ASVs able to discern among tumoral and non-tumoral tissues by using a robust and reliable method, a supervised classification machine learning model was built using Random Forest (RF). To avoid overfitting and to properly train the model, the dataset was divided into a training and a test dataset, accounting for 54 and 14 samples, respectively. The overall accuracy of the test dataset was about 89%, with two misclassifications for healthy samples. The ASVs that most contribute to the model accuracy were selected and used to plot a heatmap (Fig. 5).
As shown in Fig. 5, two main clusters may be identified, the first one is constituted mainly by healthy tissues and the second one by tumor tissues. The first ASV was assigned to Propionibacterium acnes and was principally observed in healthy tissues. This result agrees with those obtained by comparing taxa abundances in DESeq2.
Regarding BRCA mutational status, there were only three BRCA-positive patients in our population and in particular one carrying a mutation in the BRCA1 gene and two in BRCA2 gene. Consequently, the data were not enough to carry a reliable statistical analysis. Similarly, the same issue was observed for other confounding factors, i.e. smoking status and contraception usage.
Studies of the entire microbial communities and their relationships with the host have been conducted to evaluate how their imbalance could be involved in health maintaining and diseases [20, 54,55,56,57,58,59,60,61]. In particular, several studies have linked the microbiome to the initiation and progression of different types of cancer, including breast cancer [58, 59]. Moreover, the cooperation of microbial communities’ imbalance with diet, obesity, estrogens and immune modulation has been considered an important promoter of breast cancer [12, 62]. Notably, the majority of authors [16,17,18,19, 24, 36, 37, 61,62,63,64] note that their findings are hypothesis-generators and support further investigations to identify a microbial risk signature for breast cancer and potential microbial-based prevention and/or therapies.
In this scenario, we studied the resident breast microbiota in tumor and paired normal breast tissue from 34 BC patients. The aim of our study was to evaluate the microbial composition of breast tumor tissues and healthy tissues in the attempt to shed light on the link between dysbiosis and breast cancer which, in turn, may indicate that a change in bacterial species could contribute to the modulation of cancer development. A comparison between paired healthy and tumor tissues revealed differences of bacterial community and composition. The number of ASVs detected between paired normal and tumor tissue showed significant differences in richness between the sampled communities. Proteobacteria and Actinobacteria showed differences between two groups: healthy tissues showed an increase of Actinobacteria and a decrease of Proteobacteria; the opposite appeared in tumor tissues. Conversely, in healthy tissues, appear to be more prevalent Propionibacterium and Pseudomonas.
In particular, we observed an overall decrease of microbial alpha diversity in tumoral tissues compared to healthy ones. We also found a significant depletion of Propionibacterium acnes in tumor tissues versus normal breast tissues, which is a novel finding. Propionibacterium acnes (currently denominated Cutibacterium acnes) is a component of the human microbiome found in several body districts. Its over-representation in normal tissue was observed by comparing abundances (DESeq2) and also by machine learning (Random Forest), which indicates that these results are robust. This gram-positive species is considered an opportunist pathogen because potentially pathogenic genes were found in the genomes available (5 phylotypes). However, the role of Propionibacterium remains to be established. For example, Talib et al. 2015  described a potential antitumoral action of Propionibacterium acnes in breast cancer, and Portillo et al. 2013  suggested that it plays a role in implant-associated infections.
Our study supports the presence of microbial DNA in breast tissues that could probably influence the local tissue microenvironment. In the attempt to minimize any external variations (including sample preparation and sequencing) between healthy and tumor tissues, we compared healthy tissues to the paired tumor breast tissues taken from each woman at the same time and in the same conditions. All 68 samples were amplified, purified and sequenced together in a single sequencing run in order to minimize any analytical variation. Although, our results are at variance from those reported by others [16,17,18,19, 24, 61, 62], it is important to highlight that differences in both experimental procedures (i.e., primer design and the use of bioinformatic pipelines to filter and to analyze data) and different cohort enrolled can affect results and, therefore, their comparison. Survey results about the breast cancer tissue microbiome, are reported for a more comprehensive comparison (Additional File 1: Table S1) and it is important to note how several factors, such as ethnicity, dietary habits, geographical origin, lactation status, pharmaco-therapeutic before surgery, the method of sample collection [66, 67] can affect the composition of microbial tissues . For instance, Fusobacterium nucleatum has been described as a key player in several pathological conditions, and particularly in colon rectal cancer. However, earlier work was principally based on a comparison between healthy and unhealthy samples [16,17,18,19, 24], not including paired tissues analysis.
Accordingly, another key difference is that the primer pairs we used differed from those used in other studies. In their review of the association between the gut/breast microbiota and breast cancer, Laborda-Illanes et al. 2020  highlighted the differences among studies in terms of data results. We counted 6 different combinations of the 16 S hypervariable region in 10 papers (i.e., V4=3, V6=2, V3-V4=1, V3-V5=2, V1-V2=1 and V3=1). Consequently, it may be misleading to compare surveys conducted using different marker regions, also considering the different efficiency in target amplification and in the resolution of taxonomic assignment.
Therefore, it is difficult and also controversial to define a precise signature of the breast cancer microbiome. Thus, our effort was not to define a universal bacterial signature in breast cancer tissues, but to reinforce the concept that it is an altered balance that characterizes tumor tissues versus healthy tissues in the same woman, also at the very close proximity regions, which per se increases significance of the microbial presence at the level of breast tissue cell transformation. Indeed, we also found that microbial alpha diversity was overall lower in tumor tissues than in healthy tissues.
Larger studies, conducted in diverse geographic regions, are needed to define - if existing - a precise bacterial signature for each type of tissue neoplasia and thus to determine the role played by the microbiome in breast cancer onset and development. Furthermore, it is difficult to use general approaches in different cohorts particularly those living in different geographic regions. Rather, it may be more effective to study patients, cohort-by-cohort or groups of subjects living in the same region and under similar environmental conditions. It is also necessary to understand, using in vitro systems as human tumoroids and mouse models, how different pre-surgery antibiotic regimens can induce disturbances in the breast microbiota and how these disturbances affect BC progression. Indeed, the lack of this information may represent a limitation.
It is now necessary to understand the effect that the metabolites produced from resident bacteria have on the development and progression of the breast. However, it is necessary not only to study the association among microbiota, tumor development and progression and/or anti-tumor immune responses using metagenomic sequencing technologies, but also to demonstrate microbiota functionality using transcriptional and/or metabolic profiling [68, 69], thereby paving the way to the application of further precision medicine in BC patients.
This study reveals a highly significant difference in the abundance of the various taxa of the microbiome in breast tumor tissues versus their healthy tumor-adjacent counterparts in women after surgery. These alterations reflect qualitative and quantitative differences of taxa, thus indicating their relevance in the comprehension of microbiome content and their role in tumor tissues.
Finally, assessing the different microbial composition in relation to BC onset and progression could be a goal to achieve in future studies on more numerous cohorts of patients.
Availability of data and materials
The raw sequencing data are available in the Sequence Read Archive (SRA) repository under the BioProject PRJNA759366.
Amplicon Sequence Variants
next generation sequencing
Coughlin SS. Epidemiology of Breast Cancer in Women. Adv Exp Med Biol. 2019;1152:9–29.
Van Cott C. Cancer Genetics. Surg Clin North Am. 2020;100:483–98.
Armstrong N, Ryder S, Forbes C, Ross J, Quek RGW. A systematic review of the international prevalence of BRCA mutation in breast cancer. Clin Epidemiol. 2019;11:543–61.
Stenehjem DD, Telford C, Unni SK, Bauer H, Sainski A, Deka R, et al. BRCA testing and outcomes in women with breast cancer. Breast Cancer Res Treat. 2021;186:839–850.
Esposito MV, Nunziato M, Starnone F, Telese A, Calabrese A, D’Aiuto G, et al. A novel pathogenic BRCA1 splicing variant produces partial intron retention in the mature messenger RNA. Int J Mol Sci. 2016;17:2145.
Nunziato M, Starnone F, Lombardo B, Pensabene M, Condello C, Verdesca F, et al. Fast detection of a BRCA2 large genomic duplication by next generation sequencing as a single procedure: A case report. Int J Mol Sci. 2017;18:2487.
Esposito MV, Minopoli G, Esposito L, D’Argenio V, Di Maggio F, Sasso E, et al. A functional analysis of the unclassified Pro2767Ser BRCA2 variant reveals its potential pathogenicity that acts by hampering DNA binding and homology-mediated DNA repair. Cancers (Basel). 2019;11:1454.
Hu C, Hart SN, Gnanaolivu R, Huang H, Lee KY, Na J, et al. A Population-Based Study of Genes Previously Implicated in Breast Cancer. N Engl J Med. 2021;384:440–51.
Dorling L, Carvalho S, Allen J, González-Neira A, Luccarini C, Wahlström C, et al. Breast Cancer Risk Genes — Association Analysis in More than 113,000 Women. N Engl J Med. 2021;384:428–39.
Fanale D, Incorvaia L, Filorizzo C, Bono M, Fiorino A, Calò V, et al. Detection of germline mutations in a cohort of 139 patients with bilateral breast cancer by multi-gene panel testing: Impact of pathogenic variants in other genes beyond brca1/2. Cancers (Basel). 2020;12:2415.
Nasir A, Bullo MMH, Ahmed Z, Imtiaz A, Yaqoob E, Jadoon M, et al. Nutrigenomics: Epigenetics and cancer prevention: A comprehensive review. Crit Rev Food Sci Nutr. 2020;60:1375–87.
Bodai BI, Nakata TE. Breast Cancer: Lifestyle, the Human Gut Microbiota/Microbiome, and Survivorship. Perm J. 2020;24:129.
Alizadehmohajer N, Shojaeifar S, Nedaeinia R, Esparvarinha M, Mohammadi F, Ferns GA, et al. Association between the microbiota and women’s cancers – Cause or consequences? Biomed Pharmacother. 2020;127:11020.
Yu Q, Jobin C, Thomas RM. Implications of the microbiome in the development and treatment of pancreatic cancer: Thinking outside of the box by looking inside the gut. Neoplasia (United States). 2021;23:246–56.
Komorowski AS, Pezo RC. Untapped “-omics”: the microbial metagenome, estrobolome, and their influence on the development of breast cancer and response to treatment. Breast Cancer Res Treat. 2020;179:287–300.
Smith A, Pierre JF, Makowski L, Tolley E, Lyn-Cook B, Lu L, et al. Distinct microbial communities that differ by race, stage, or breast-tumor subtype in breast tissues of non-Hispanic Black and non-Hispanic White women. Sci Rep. 2019;9:1–10.
Hieken TJ, Chen J, Hoskin TL, Walther-Antonio M, Johnson S, Ramaker S, et al. The microbiome of aseptically collected human breast tissue in benign and malignant disease. Sci Rep. 2016;6:30751.
Urbaniak C, Gloor GB, Brackstone M, Scott L, Tangney M, Reida G. The microbiota of breast tissue and its association with breast cancer. Appl Environ Microbiol. 2016;82:5039–48.
Wang H, Altemus J, Niazi F, Green H, Calhoun BC, Sturgis C, et al. Breast tissue, oral and urinary microbiomes in breast cancer. Oncotarget. 2017;8:88122–38.
Laborda-Illanes A, Sanchez-Alcoholado L, Dominguez-Recio ME, Jimenez-Rodriguez B, Lavado R, Comino-Méndez I, et al. Breast and gut microbiota action mechanisms in breast cancer pathogenesis and treatment. Cancers (Basel). 2020;12:2465.
Gubert C, Kong G, Renoir T, Hannan AJ. Exercise, diet and stress as modulators of gut microbiota: Implications for neurodegenerative diseases. Neurobiol Dis. 2020;134:104621.
Song M, Chan AT, Sun J. Influence of the Gut Microbiome, Diet, and Environment on Risk of Colorectal Cancer. Gastroenterology. 2020;158:322–40.
Zhang X, Pan Z. Influence of microbiota on immunity and immunotherapy for gastric and esophageal cancers. Gastroenterol Rep. 2020;8:206–21.
Meng S, Chen B, Yang J, Wang J, Zhu D, Meng Q, et al. Study of microbiomes in aseptically collected samples of human breast tissue using needle biopsy and the potential role of in situ tissue microbiomes for promoting malignancy. Front Oncol. 2018;8:318.
D’Argenio V, Esposito MV, Telese A, Precone V, Starnone F, Nunziato M, et al. The molecular analysis of BRCA1 and BRCA2: Next-generation sequencing supersedes conventional approaches. Clin Chim Acta. 2015;446:221–5.
Martin M. Cutadapt removes adapter sequences from high-throughput sequencing reads. EMBnet.journal. 2011;17:10–2.
Zhang J, Kobert K, Flouri T, Stamatakis A. PEAR: A fast and accurate Illumina Paired-End reAd mergeR. Bioinformatics. 2014;30:614–20.
Callahan BJ, McMurdie PJ, Rosen MJ, Han AW, Johnson AJA, Holmes SP. DADA2: High-resolution sample inference from Illumina amplicon data. Nat Methods. 2016;13:581–3.
Nagamine CM, Chan K, Lau YF. A PCR artifact: generation of heteroduplexes. Am J Hum Genet. 1989;45:337–339.
Kanagawa T. Bias and Artifacts in Multitemplate Polymerase Chain Reactions (PCR). J Biosci Bioeng. 2003;96:317–23.
Mukherjee S, Huntemann M, Ivanova N, Kyrpides NC, Pati A. Large-scale contamination of microbial isolate genomes by illumina Phix control. Stand Genomic Sci. 2015;10:18.
Langmead B, Salzberg SL. Fast gapped-read alignment with Bowtie 2. Nat Methods. 2012;9:357–9.
Santamaria M, Fosso B, Consiglio A, De caro G, Grillo G, Licciulli F, et al. Reference databases for taxonomic assignment in metagenomics. Brief Bioinform. 2012;13:682–95.
Fosso B, Santamaria M, Marzano M, Alonso-Alemany D, Valiente G, Donvito G, et al. BioMaS: A modular pipeline for Bioinformatic analysis of Metagenomic AmpliconS. BMC Bioinformatics. 2015;16:203.
Cole JR, Wang Q, Cardenas E, Fish J, Chai B, Farris RJ, et al. The Ribosomal Database Project: Improved alignments and new tools for rRNA analysis. Nucleic Acids Res. 2009;37:141–5.
Cole JR, Wang Q, Fish JA, Chai B, McGarrell DM, Sun Y, et al. Ribosomal Database Project: Data and tools for high throughput rRNA analysis. Nucleic Acids Res. 2014;42:633–42.
Bolyen E, Rideout JR, Dillon MR, Bokulich NA, Abnet CC, Al-Ghalith GA, et al. Reproducible, interactive, scalable and extensible microbiome data science using QIIME 2. Nat Biotechnol. 2019;37:852–7.
Katoh K, Standley DM. MAFFT multiple sequence alignment software version 7: Improvements in performance and usability. Mol Biol Evol. 2013;30:772–80.
Price MN, Dehal PS, Arkin AP. FastTree 2 - Approximately maximum-likelihood trees for large alignments. PLoS One. 2010;5:e9490.
Alonso-Alemany D, Barré A, Beretta S, Bonizzoni P, Nikolski M, Valiente G. Further steps in TANGO: Improved taxonomic assignment in metagenomics. Bioinformatics. 2014;30:17–23.
Mende DR, Sunagawa S, Zeller G, Bork P. Accurate and universal delineation of prokaryotic species. Nat Methods. 2013;10:881–4.
Weiss S, Xu ZZ, Peddada S, Amir A, Bittinger K, Gonzalez A, et al. Normalization and microbial differential abundance strategies depend upon data characteristics. Microbiome. 2017;5:27.
Pielou EC. Shannon’s Formula as a Measure of Specific Diversity: Its Use and Misuse. Am Nat. 1966;100:463–5.
Faith DP. Conservation evaluation and phylogenetic diversity. Biol Conserv. 1992;61:1–10.
McMurdie PJ, Holmes S. Phyloseq: An R Package for Reproducible Interactive Analysis and Graphics of Microbiome Census Data. PLoS One. 2013;8:e61217.
Chang Q, Luan Y, Sun F. Variance adjusted weighted UniFrac: a powerful beta diversity measure for comparing communities based on phylogeny. BMC Bioinformatics. 2011;12:118.
Oksanen J, Blanchet FG, Kindt R, Legendre P, Minchin PR, O’Hara RB, et al. Package vegan: Community Ecology Package. R Packag version 23-1. 2013.
Love MI, Huber W, Anders S. Moderated estimation of fold change and dispersion for RNA-seq data with DESeq2. Genome Biol. 2014;15:550.
Kuhn M. Building predictive models in R using the caret package. J Stat Softw. 2008;28:1–26.
Kwon S, Park S, Lee B, Yoon S. In-depth analysis of interrelation between quality scores and real errors in illumina reads. Proc Annu Int Conf IEEE Eng Med Biol Soc EMBS. 2013;2013:635–8.
Barone M, Mendozzi L, D’Amico F, Saresella M, Rampelli S, et al. Influence of a High-Impact Multidimensional Rehabilitation Program on the Gut Microbiota of Patients with Multiple Sclerosis. Int J Mol Sci. 2021;22:7173.
Bazzocchi G, Turroni S, Bulzamini MC, D’Amico F, Bava A, et al. Changes in gut microbiota in the acute phase after spinal cord injury correlate with severity of the lesion. Sci Rep. 2021;11:12743.
Murtagh F, Legendre P. Ward’s Hierarchical Agglomerative Clustering Method: Which Algorithms Implement Ward’s Criterion? Journal of Classification. 2014; 31:274–295.
Kordy K, Gaufin T, Mwangi M, Li F, Cerini C, Lee DJ, et al. Contributions to human breast milk microbiome and enteromammary transfer of Bifidobacterium breve. PLoS One. 2020;28;15:e0219633.
Mostafaei S, Kazemnejad A, Norooznezhad AH, Mahaki B, Moghoofei M. Simultaneous effects of viral factors of human papilloma virus and epstein-barr virus on progression of breast and thyroid cancers: Application of structural equation modeling. Asian Pacific J Cancer Prev. 2020;21:1431–9.
Dong TS, Jacobs JP, Hussain SK. Microbial Profiles of Cirrhosis in the Human Small Intestine. Current Gastroenterology Reports. 2019;21:50.
Acharya C, Bajaj JS. Gut Microbiota and Complications of Liver Disease. Gastroenterol Clin North Am. 2017;46:155–69.
Giuffrè M, Campigotto M, Campisciano G, Comar M, Crocè LS. A story of liver and gut microbes: How does the intestinal flora affect liver disease? A review of the literature. Am J Physiol - Gastrointest Liver Physiol. 2020;318:G889–906.
Fan Y, Pedersen O. Gut microbiota in human metabolic health and disease. Nat Rev Microbiol. 2021;19:55–71.
Ghosn B, Benisi-Kohansal S, Ebrahimpour-Koujan S, Azadbakht L, Esmaillzadeh A. Association between healthy lifestyle score and breast cancer. Nutr J. 2020;19:4.
Veiga P, Pons N, Agrawal A, Oozeer R, Guyonnet D, Brazeilles R, et al. Changes of the human gut microbiome induced by a fermented milk product. Sci Rep. 2014;4:6328.
Fernández L, Pannaraj PS, Rautava S, Rodríguez JM. The Microbiota of the Human Mammary Ecosystem. Front Cell Infect Microbiol. 2020;10:586667.
Xuan C, Shamonki JM, Chung A, DiNome ML, Chung M, Sieling PA, et al. Microbial dysbiosis is associated with human breast cancer. PLoS One. 2014;9:e83744.
Costantini L, Magno S, Albanese D, Donati C, Molinari R, Filippone A, et al. Characterization of human breast tissue microbiota from core needle biopsies through the analysis of multi hypervariable 16S-rRNA gene regions. Sci Rep. 2018;8:4–12.
Talib WH, Saleh S. Propionibacterium acnes augments antitumor, anti-angiogenesis and immunomodulatory effects of melatonin on breast cancer implanted in mice. PLoS One. 2015;10:1–13.
Portillo ME, Corvec S, Borens O, Trampuz A. Propionibacterium acnes: An underestimated pathogen in implant-associated infections. Biomed Res Int. 2013;2013:804391.
Thyagarajan S, Zhang Y, Thapa S, Allen MS, Phillips N, Chaudhary P, et al. Comparative analysis of racial differences in breast tumor microbiome. Sci Rep. 2020;10:14116.
Chen RY, Mostafa I, Hibberd MC, Das S, Mahfuz M, Naila NN, et al. A Microbiota-Directed Food Intervention for Undernourished Children. N Engl J Med. 2021;384:1517–28.
De Palma FDE, Del Monaco V, Pol JG, Kremer M, D’Argenio V, Stoll G, et al. The abundance of the long intergenic non-coding RNA 01087 differentiates between luminal and triple-negative breast cancers and predicts patient outcome. Pharmacol Res. 2020;161:105249.
We thank Drs Ilaria Granata and Mario Guarracino for their helpful suggestions and discussions during bioinformatic data analysis. We thank Jean Ann Gilder for text editing, with special reference to English language.
This work, in particular the wet experimental phase, was supported by grants PON03PE_00060_2 and PON03PE_00060_7 (Campania - Bioscience) from the Italian Ministry of University and Research (to F.S.), and CIRO and SATIN grants (to F.S.) from Regional (Campania Region, Italy) funds, including 2017, 2020 and 2021 Campania Region contribution.
Ethics approval and consent to participate
The study was conducted according to the guidelines of the Declaration of Helsinki, and approved by the Istituto Nazionale Tumori - Fondazione G. Pascale Ethics Committee (protocol number 3 of 03/25/2009). Informed Consent Statement: Written Informed consent was obtained from all subjects involved in the study.
Consent for publication
The authors declare that they have no competing interests.
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Figure S1. Rarefaction curves used to define the rarefaction threshold. Figure S1. shows box-plot of statistically different taxa between healthy and tumor samples. Table S1. lists the differences among data obtained in different cohorts of patients in several studies by different Authors for results comparison.
About this article
Cite this article
Esposito, M.V., Fosso, B., Nunziato, M. et al. Microbiome composition indicate dysbiosis and lower richness in tumor breast tissues compared to healthy adjacent paired tissue, within the same women. BMC Cancer 22, 30 (2022). https://doi.org/10.1186/s12885-021-09074-y
- Breast cancer microbiome
- Microbial dysbiosis
- Breast cancer tissues
- Next generation sequencing
- Breast healthy tissues
- Microbiome composition
- cancer/healthy paired samples
- 16S rRNA