- Research article
- Open Access
- Open Peer Review
Profiling the B/T cell receptor repertoire of lymphocyte derived cell lines
BMC Cancervolume 18, Article number: 940 (2018)
Clonal VDJ rearrangement of B/T cell receptors (B/TCRs) occurring during B/T lymphocyte development has been used as a marker to track the clonality of B/T cell populations.
We systematically profiled the B/T cell receptor repertoire of 936 cancer cell lines across a variety of cancer types as well as 462 Epstein-Barr Virus (EBV) transformed normal B lymphocyte lines using RNA sequencing data.
Rearranged B/TCRs were readily detected in cell lines derived from lymphocytes, and subclonality or potential biclonality were found in a number of blood cancer cell lines. Clonal BCR/TCR rearrangements were detected in several blast phase CML lines and unexpectedly, one gastric cancer cell line (KE-97), reflecting a lymphoid origin of these cells. Notably, clonality was highly prevalent in EBV transformed B lymphocytes, suggesting either transformation only occurred in a few B cells or those with a growth advantage dominated the transformed population through clonal evolution.
Our analysis reveals the complexity and heterogeneity of the BCR/TCR rearrangement repertoire and provides a unique insight into the clonality of lymphocyte derived cell lines.
Clonal V(D)J [variable (V), diversity (D) and joining (J)] rearrangement which occurs during development of B/T lymphocytes has been used as a marker to track the clonality of B/T cell populations [1, 2]. This approach is feasible because lymphoid neoplasm/lymphoproliferative cells originate and expand from a single cell; and the progeny cells share the same VDJ rearrangement. A pattern of a monoclonal/oligoclonal population (manifested as the over-representation of either one or a few uniquely rearranged sequences) suggests the presence of a lymphoid neoplasm [or non-malignant clonal lymphoproliferative disorder, such as monoclonal B cell lymphocytosis  or monoclonal gammopathy of undetermined significance (MGUS)].
In this study, we systematically profiled the B/T cell receptor repertoire of 936 cancer cell lines across a variety of cancer types, as well as 462 Epstein-Barr Virus (EBV) transformed normal B lymphocyte lines, using RNA sequencing data from the Cancer Cell Line Encyclopedia (CCLE)  and Geuvadis RNA sequencing project of 1000 Genomes samples . This study cohort contains cell lines from a variety of solid tumors and 164 blood cancer cell lines (annotated as haematopoietic and lymphoid tissue in CCLE), as well as immortalized “normal” B-lymphocyte cell lines. Cancer cell lines are typically deemed to be “pure”, due to the lack of normal stroma cells and infiltrating T/B cells which are frequently presented in primary tumor samples; thus, this cell line collection provides a unique opportunity to profile faithfully and comprehensively the immunoglobulin/TCR gene rearrangement events in different types of blood cancers.
Transcriptome sequencing data were downloaded from the CCLE and Geuvadis RNA sequencing databases; and the B/T cell receptor repertoire of each cell line was analyzed using MiXCR . The 936 CCLE cancer cell lines were authenticated before uniformly processed RNA sequencing (paired-end 100 × 2 bp) . The growth and uniformly processed RNA sequencing (paired-end 75 × 2 bp) of 462 Epstein-Barr Virus (EBV) transformed normal B lymphocyte lines were described in ref . The B cell and T cell gene expression signature and expression of lineage specific markers (e.g., CD4/CD8 for T cell and CD19/CD20 for B cells) were analyzed from the cell line microarray expression data. The relative clonal CPM value (count per million RNA sequencing reads) was calculated by dividing the clonal read counts by the total RNA sequencing read counts. Each clonotypes were determined based on a unique nucleotide sequence of the VDJ junction (which codes for the CDR3 region) and supported by ≥30 sequencing reads. Two clonotypes have the same rearrangement pattern but have ≥1 bp difference in their nucleotide sequence VDJ junction were considered as different clonotypes. As we cannot exclude the possibility of expression of biallelic rearrangements (where the second allele is usually non-productive or has markedly decreased expression), we refer to bi/oligoclonality as the observation of more than two/four clonotypes within the same type of rearrangement within a cell line  (either ≥3 IGH or ≥ 3 IGL or ≥ 3 TRAV, or > 4 IGKV or > 4 TRBV ). For example, a cell line with three different IGH rearrangements (e.g., 60% of IGHV6–1-IGHD1–20-IGHJ4, 20% IGHV3–23-IGHD2–8-IGHJ6 and 18% of IGHV3–20-IGHD2–8-IGHJ5) will be regarded as potential bi/oligoclonality. Nonproductive rearrangement was referred as out of frame rearrangement (denoted with “_” in the CDR sequence) or rearrangement with stop codon inside the CDR3 region (denoted with “*” in the CDR3 sequence).
BCR repertoire of cancer cell lines derived from B lymphocytes
Cancer cell lines derived/transformed from mature B cells that have undergone BCR selection include multiple myeloma (n = 25), mantle cell lymphoma (n = 4), Burkitt lymphoma (n = 10) and chronic lymphocytic leukemia-small lymphocytic lymphoma (CLL, n = 4) etc. High expression of both IGH (heavy chain) and IGK/L (light chain) rearrangement are detected in all of the mantle cell lymphoma, Burkitt lymphoma and CLL-small lymphocytic lymphoma cell lines (Fig. 1). In multiple myeloma (MM), except for cell line KMS-12-BM [in which only a productive IGH rearrangement (IGHV3–7-IGHD4–23-IGHJ6, 18,169 reads) is dominantly expressed], many of the MM cell lines (Additional file 1: Tables S1 and S2) express a single dominant IGK or IGL rearrangement accompanying an either extremely low (< 30 RNA sequencing reads) or a completely undetectable heavy chain (IGH) rearrangement (Fig. 2), suggesting that these cell lines (NCI-H929, KARPAS-620, L363, SK-MM-2, JJN3, RPMI8226, KMS-28BM, KMS-11, KMS34, OPM-2, KMS-26) likely belong to light chain multiple myeloma (LC-MM) , a poor prognosis MM subtype constituting 15% of myelomas which are characterized by either deletion or silencing of the IGH loci after VDJ rearrangement . While the majority of multiple myeloma cell lines express either a single dominant IGK or IGL rearrangement, expression of biallelic-rearrangements (or potential biclonality) was found in a few cell lines (Additional file 1: Figure S2). For example, two productive IGK rearrangements are detected in cell line KMS-27 (625,058 reads of IGKV2–40-IGKJ4 and 382,112 reads of IGKV1–39-IGKJ1).
Potential subclonality was found in several Burkitt lymphoma cell lines (e.g., NAMALWA, EB1 and CA46, Fig. 3). Subclonal rearrangements in these cell lines share the same rearrangement and similar CDR3 sequence. For example, three similar IGH rearrangements were detected in cell line CA46 (Additional file 1: Table S1). These IGH rearrangements share the identical IGH rearrangement pattern (IGHV5–51-IGHD5–12-IGHJ4) but they are slightly different in their CDR3 sequences (CARFNRGGDYW, CARFDRGGDYW, CARARFDRGGDYW, Fig. 3b), suggesting that subclonal rearrangements may have derived from dominant rearrangements through somatic hypermutations. Similarly, four IGLV4–60-IGLJ3 rearrangements were detected in the cell line NAMALWA. Three of them were barely expressed (208 reads, 120 reads and 116 reads, among which two of them are non-productive) and the sequences are highly similar to the dominant rearrangement (67,629 reads, Additional file 1: Table S1). These three minor subclonal rearrangements also appear to be derived from the dominant clone through somatic hypermuation.
Potential subclonality was also found in diffuse large B cell lymphoma (DLBCL) cell lines Pfeiffer and Toledo (Additional file 1: Table S2). All DLBCL cell lines (n = 17), except two (A3/KAW, CPM = 0.11, and U-937, CPM = 0.21), express high levels of clonal rearrangement of both IGH and IGL/K (Additional file 1: Table S1) [9, 10]. Indeed, U-937 is well recognized as an AML cell line instead of a DLBCL, despite the fact that this cell line was originally established from pleural effusion of a patient with histiocytic lymphoma and is still categorized as a histiocytic lymphoma by both ATCC (https://www.atcc.org/Products/All/CRL-1593.2.aspx) and CCLE. The analysis of the expression of major lineage genes also suggests that these two cell lines should not be grouped together with other DLBCL cell lines, as none of them express any typical B cell genes such as CD19, CD20 or CD79. Notably, cell line A4/Fuk also displays an abnormal expression pattern of major lineage specific genes. Conclusions of experiments reached from this cell line for DLBCL research may also need to be interpreted with caution.
Except for BCP-1 and CI-1, cell lines derived from unspecified B cell lymphoma (n = 4) also express high level of heavy and light chain BCR rearrangements. Cell line BCP-1 shows an inconsistent gene expression pattern as compared with other B cell lymphoma lines (e.g., no expression of CD19, CD20), suggesting that it might need to be reclassified into other blood cancer types. In contrast to the finding of subclonality in Burkitt lymphoma or DLBCL, cell lines NU-DUL-1 and JM1 contain more than 3 different IGLV rearrangements (Fig. 3c, e.g., in cell line NU-DUL-1, 10,266 reads were detected for the rearrangement IGLV1–40-IGLJ2; 3738 reads for IGLV1–44-IGLJ3; 3005 reads for IGLV4–60-IGLJ3, 2974 reads for IGLVI-70-IGLJ3, 1180 reads for IGLV1–44-IGLJ3 (encoding a different CDR3) and 507 reads for IGLV9–49-IGLJ3), suggesting a potential bi/oligoclonaity of these cell lines and that subclonal rearrangements had likely evolved independently.
Hodgkin lymphoma cell lines (n = 8) were derived from either mature B cells (germinal center (GC) or post GC B cells), but are also known to have a global down-regulation of B cell gene expression and a general loss of B cell phenotype . Consistent with this notion, analysis of these cell lines confirmed down-regulation of B cell genes such as CD19, CD20, EBF1, etc. (except for the cell line HS.611 T, which still expresses high levels of CD19/CD20 and CD79A/B). Only two of 8 Hodgkin lymphoma cell lines express low levels of BCR rearrangements: cell line HS.611 T (IGKV1–39-IGKJ1, 2517 reads, CPM = 14.1; this cell line is EBV positive and has a high level of EBV viral gene expression) and KM-H2 (IGKV4–1-IGKJ4, 878 reads, CPM = 6.38). Notably, multiple subclonal rearrangements were detected in the cell line HS611.T, and all of them share the same IGH or IGKV rearrangement as well as highly similar CDR3 junction sequences, these subclonal rearrangements were likely derived from the major clonetypes through somatic hypermutation (Fig. 3b, d).
In contrast to the aforementioned cell lines arising from mature B cells, most of the B-ALL cell lines (n = 15) express low levels of IGH rearrangement as their dominant clonotypes. Consistent with the notion that most of the ALL cells had arrested in early stages of B/T cell development , RAG1/2 (the enzymes mediating the VDJ rearrangement process) are highly expressed in most ALL cell lines (both B and T cell ALL, Additional file 1: Figure S1). Three of the ALL cell lines had either extremely low or undetectable clonal BCR [MHH-CALL-2 (CPM = 0.06), SEM (no dominant BCR clone), REH (CPM = 0.66)]. This may suggest a lack of complete IGH and IGK/L rearrangements in immature lymphoid neoplasms, where transformation likely occurred prior to BCR rearrangement (before the pre-B stage) . Transcription of biallelic-rearrangements of IGH or IGK/L or potential biclonality was found in 5 of 15 B-ALL cell lines (33.3%, RCH-ACV, 697, KOPN-8, RS4;11 and NALM-6) (Additional file 1: Figure S2 and Table S2). Intriguingly, in B-ALL cell line MHH-CALL-4, 214 reads of nonproductive TCR (TRAV8–2-TRAJ8) and 163 reads of nonproductive BCR (IGHV2–26-IGHD2–21-IGHJ5) rearrangements were detected. This cell line expressed typical B cell lineage gene such as CD19, CD79 instead of T cell lineage gene (such as CD3), suggesting cross lineage rearrangement [14, 15] or potential mixed immunophenotype of this cell line (Additional file 1: Table S2).
In general, rearrangements of IGK and IGL were highly expressed as dominant rearrangements in most B cell malignancy, highlighting the importance of examining the IGK and IGL loci in future BCR repertoire and clonality analysis. The usage of IGKV/IGLV gene segments with the highest expressed rearrangement in the B-ALL, T-ALL, Burkitt lymphoma, DLBCL and multiple myeloma cohorts is summarized in Fig. 2. Most lymphocyte derived blood cancer cell lines express high levels of a productive rearrangement, except for ALL. Almost half of all B-ALL (7 out of 15 B-ALL) cell lines carry non-productive BCR in either both alleles or in the only expressed allele (Table 1). This observation is in agreement with our recent finding that ~ 41% primary pediatric ALL samples (91 out 219 cases) lack productive BCR expression . B cells carrying nonproductive BCR may bypass the BCR checkpoint through acquisition of driver mutation(s) which mimic BCR signaling (e.g., BCR-ABL1) [13, 17,18,19,20,21,22,23,24]. Alternatively, as noted recently, the pre BCR may function as a tumor suppressor in the majority of precursor B-ALL .
TCR repertoire of cell lines derived from T lymphocytes
T lymphocyte cancer cell lines include T-ALL and T cell lymphoma. Remarkably, all of the five cell lines established from patients with anaplastic large cell lymphoma carry the same TCR alpha chain rearrangement (TRAV40-TRAJ4) with an identical CDR3 sequence (CLLGSISLGILSQ, 170–250 reads, Additional file 1: Table S1). The identical rearrangement/CDR3 sequence has also been detected in DEL, a cell line established from malignant histiocytosis  but also been recognized as an ALK-positive anaplastic large-cell lymphoma cell line .
Mycosis fungoides-Sezary syndrome belongs to cutaneous T-cell lymphoma (cell lines = 3). This disease is a neoplasia of T lymphocytes often possessing helper/inducer cell surface phenotype . Expression of productive alpha and beta TCR rearrangements (200–5000 reads) occur in all three cell lines in the cohort. Expression of productive alpha and beta TCR rearrangements were also found in a peripheral T cell lymphoma cell line (SUP-T11) and an adult T cell lymphoma-leukemia cell line (HH) (Additional file 1: Table S2).
With the exception of the cell line ALL-SIL, which has a TCR gamma rearrangement (TRGV7-TRGJ1), the other 14 T-ALL cell lines express TCR beta rearrangements as their dominant clonotypes (TRBV-TRBD-TRBJ or TRBV-TRBJ). Most of the T-ALL cell lines express biallelic-rearrangements [28,29,30] or potential biclonality, containing more than one alpha or beta TCR rearrangements (Additional file 1: Table S2, Additional file 1: Figure S3). In line with allelic exclusion, the second TCR allele often encodes a non-productive CDR3 or a poorly expressed transcript (For example, in cell line MOLT-16, 15,457 reads were detected for the productive dominant rearrangement TRBV20–1-TRBD1-TRBJ2–3, while only 837 reads were detected for the putative second allele, TRBV27-TRBD1-TRBJ1–1, which codes an out of frame CDR3: CASTDPDR_EWTEAFF). Similar to B-ALL, a number of T-ALL cell lines carry non-productive rearrangements in either both alleles or in the only expressed allele, resulting in a complete lack of expression of functional TCR. For example, cell line ALL-SIL express two TCR beta rearrangements and two gamma rearrangements, and all of these rearrangements code for out of frame CDR3 (Table 1, Additional file 1: Table S2). The prevalence of non-functional TCR/BCR in ALL (both B and T ALL) again supports the recent hypothesis that the TCR/BCR might play a tumor suppressive role in most precursor ALL .
Myeloid derived cancer cell lines and solid tumor cell lines
Cell lines derived from solid tumors and myeloid cells do not undergo B/TCR rearrangements. In almost all of the solid tumor cell lines (except for the few which would be discussed here) and myeloid derived cancer cell lines (AML, n = 31; blast phase CML, n = 12; CML, n = 2; essential thrombocythaemia, n = 1), less than 50–100 BCR/TCR reads were detected in the dominant rearrangement (CPM < 0.5). A few rearrangements appear to be commonly detected in a number of solid cancer and AML cell lines with very low read counts. For example, the rearrangements TRAV8–7 (non-functional segment)-TRAJ19 (non-functional segment) were commonly found in many cell lines with extremely low read count (< 30). These barely expressed rearrangements may be caused by non-specific sequencing noise or possibly due to the trace amount of cross contamination during sample preparation and sequencing [31,32,33].
For the 12 blast crisis CML cell lines, a few of them (CML-T1, NALM-1, BV-173) carry clonal rearrangements (Additional file 1: Table S2), indicating that these cell lines were established from “lymphoid” blast crisis . For example, rearrangements of BCR heavy chain (IGHV3–9-IGHD2–21-IGHJ6, 5346 reads) were detected in NALM-1. Indeed, similar to ALL, NALM-1 expresses high levels of RAG1/2 and CD19/20/CD79A/B as well as antigens specific to ALL .
Remarkably, one gastric cancer cell line, KE-97, expresses high levels of IGL and IGH rearrangements (IGLV3–21-IGLJ1, 19,374 reads, IGHV1–46-IGHD3–10-IGHJ4, 2129 reads). This cell line was derived from a mucinous gastric adenocarcinoma from a 52-year-old Japanese male . Considering 1–4% of the gastrointestinal malignancies are gastric lymphoma , this cell line was likely derived from a gastric lymphoma instead of a gastric carcinoma. Alternatively, as suggested by a previous single-nucleotide polymorphism (SNP) array analysis, the SNP pattern of this cell line was highly similar to that of KMS-18 , a multiple myeloma cell line established in Japan a few years after KE-97 . This could represent a potential mix-up between these two cell lines.
EBV transformed normal B lymphocyte cell lines (cohort 462 lines)
The same bioinformatics pipeline was used to analyze 462 EBV transformed normal B lymphocyte cell lines which were immortalized from healthy donor’s B cells. Rearrangements of both IGH and IGK/L were readily detected in almost all of the lymphocyte cell lines (Fig. 4a). IGH rearrangements were found in 459 cell lines, while IGK and IGL rearrangements were detected in 426 and 421 cell lines, respectively, supporting a notion that most of the EBV immortalized cells were derived from mature B cells. In contrast to the highly diversified repertoire of normal circulating B cell population, clonality occurred in the majority of EBV transformed B lymphocyte cell lines, reflecting a loss of diversity and clonal evolution during the establishment, growth and subculture of these cell lines. Among the 462 B-lymphocyte cell lines, 54 appeared to be monoclonal as only one IGH or IGK/IGL rearrangement is detected (with comparable sequencing depth of other cell lines). Potential biallelic rearrangement or biclonality (2 IGH or IGL or 2–4 IGK) was detected in the other 53 cell lines. For the other 355 cell lines, bi/oligo or polyclonality (> 2 IGH or > 2 IGL or > 4 IGK ) was detected. Most of these have less than 20 clonal rearrangements (with > 100 reads threshold) and many of them have one highly expressed dominant rearrangement with high clonotype faction (Fig. 5a, Additional file 1: Figure S4). The clonal status of different loci appears correlated and cell lines with polyclonality of one locus tend to display polyclonality at the other loci (Fig. 5a, b). A few cell lines still retained a relatively diversified population. For example, the cell lines ERR188358 and ERR188025 contain more than 80 different IGK (83 and 85, respectively) and more than 30 different IGH rearrangements (Fig. 5a). The inferred phylogenetic trees based on the CDR3 region of the dominant rearrangements of these 462 lymphocyte lines are shown in Fig. 5c.
We hypothesize that EBV mediated transformation occurred in a number of B cells, and during culture, some clones outcompeted the others and gradually became dominant clones. Therefore, diversity was lost and the population became oligoclonal or even monoclonal [39, 40]. In total, 9827 different rearrangements of IGH/IGK/IGL (with a cut off of at least 100 sequencing reads, here after referred as index rearrangement) were detected in 426 EBV transformed B lymphocytes lines; 8 of them are highly expressed as prominent rearrangements which occupies more than 90% of the clonotype fraction in their corresponding cell lines (Additional file 1: Table S3). All (except two) of the dominant rearrangements of the 462 B lymphocyte lines encode a productive BCR. Among the 9827 index rearrangements detected in these cell lines, only 363 subclonal rearrangements codes for nonproductive BCR (353 out of frame, 50 interrupted by inside stop codon, 40 have both inside stop codon and out of frame rearrangements). The majority of these non-productive BCR rearrangements were barely expressed. Among all the 462 lymphocyte cell lines, only one TCR rearrangement was detected (TRBV7–9-TRBJ2–1, 143 reads) in cell line ERR188361 (Additional file 1: Table S4). To gain further insight into the BCR repertoire of EBV transformed normal B lymphocyte, we generated a heatmap of heavy/light chain gene segment usage of these 462 lymphocyte lines (Fig. 6, Additional file 1: Figure S4). Analysis of the IGH gene usage revealed a modest, potential population bias of the IGHV segments usage (for example, increase of IGHV3–23 usage was noted in cell lines established from Finnish (FIN) and British (GBR) individuals, Fig. 6a), which may reflect an exposure history of certain common antigens in each population [41,42,43].
One of the potential pitfalls of profiling the BCR/TCR repertoire using RNA sequencing is that non-transcribed rearrangements may not be detected, and the clonality analysis may be biased by the expression level of BCR/TCR loci. Nevertheless, in comparison to traditional PCR based method which is labor intensive and requires well trained staffs, NGS/RNA sequencing based analysis is more standardized and can be simply outsourced to big sequencing centers. In contrast to PCR based analysis which is limited by the number of loci that can be feasibly examined, NGS based profiling can comprehensively detect all expressed TCR/BCR rearrangements, including some rearrangements which are not usually included in PCR based analysis (e.g., TCRA alpha rearrangement). The BCR/TCR repertoire identified using RNA sequencing were consistent with previous studies using BIOMED-2 PCR Sanger sequencing [29, 30] (The TCR gene names in their paper utilized old aliases and need to be converted to standard gene name using Genecards https://www.genecards.org). In addition, the feasibility of profiling BCR repertoires from RNA sequencing data have recently been explored in CLL . In their study comparing two approaches, BCR repertoire profiling using RNA sequencing showed equal or superior results as compared with traditional PCR and Sanger sequencing (clinical technique) . In addition, with a few exceptions which are likely caused by accidental mix-up, a consistent BCR/TCR repertoire could be obtained when comparing the RNA sequencing data from CCLE with sequencing results for blood cancer cell lines generated by different researchers (for example, cancer cell lines RNA sequencing data in SRA database, https://www.ncbi.nlm.nih.gov/sra). This suggests that the BCR/TCR repertoire pattern may be used as an alternative/complement authorization method for lymphocyte derived cell lines. The advantage of this approach is that when RNA sequencing data is available (which is already widely applied in many kinds of research), the cell lines (need to be lymphocyte derived cells) used in these studies can be simultaneously validated based on the BCR/TCR profile. As such, the RNA sequencing data deposited in the online public database (GEO, SRA) can be further checked and authenticated independently by any researchers. Crucially, we noticed that poly A selection before sequencing is important for examining the BCR/TCR repertoire, as poly A enriched samples generated 100-fold more BCR/TCR sequencing reads as compared to samples which were not subjected to poly A selection but were sequenced at similar sequencing depth. Nonetheless, RNA sequencing of samples without poly A selection can still discover the majority of BCR/TCR rearrangements. However, accuracy of clonotype fraction of subclonal rearrangements may be significantly affected.
Currently, multiplex PCR based T/B cell receptor rearrangement testing has been used as a clinical approach to detect suspected lymphoproliferative disease . This has occasionally been hindered by a deletion/translocation (e.g., t(11;14)) event in the Ig loci and different set of primer panels may be tested before clonality can be inferred. Recently, the cost of next-generation sequencing (NGS) has quickly decreased and gradually become comparable with traditional PCR-Sanger analysis. RNA sequencing is straightforward with standardized procedures, eliminating the need of patient based personalized BCR/TCR primer sets selection and optimization. Furthermore, RNA sequencing can be scaled up to a large number of samples easily, allowing simultaneous examination of gene expression, SNP and somatic mutations, in addition to the B/TCR rearrangement repertoire. Our analysis highlights the potential of using RNA sequencing as a diagnostic test to examine the BCR/TCR clonal rearrangement in lymphoid malignancy.
The observation of subclonality or potential biclonality in a number of blood cancer cell lines is interesting. In most of these cases, the subclones appear to have been derived from the major clones through somatic hypermutation (e.g., Burkitt lymphoma cell lines NAMALWA, GA-10, EB1, CA46 and Hodgkin lymphoma cell line HS611.T etc.). However, in the B cell lymphoma cell lines NU-DUL-1, JM1 and the Multiple myeloma cell line AMO-1, more than three different IGL (cell lines NU-DUL-1, JM1) or IGH (cell lines AMO-1) rearrangements were detected, suggesting independent biclonality in these cell lines. As cancer cell lines are generally regarded as of monoclonal origin, the potential biclonality at these cell lines is interesting and may require further detailed study. On the other hand, our observation of high clonality and potential clonal selection/evolution in EBV transformed normal B lymphocytes suggests that careful experimental design and interpretation of the result may be required when using EBV transformed lines as a model to study normal B cell population, B cell gene expression or quantitative trait loci (QTL) [40, 46].
In summary, we comprehensively profiled the B/T cell receptor repertoire in 936 cancer cell lines and 462 samples of EBV transformed normal B lymphocytes. The relative “pure” feature of cancer cell lines circumvents the problem of tumor infiltrating T/B and stroma cells in primary tumor samples. Our analysis provides unique insights into the BCR/TCR rearrangement repertoire and clonality of cell lines derived from lymphocyte cells.
B/T cell receptors
Cancer Cell Line Encyclopedia
variable (V), diversity (D) and joining (J)
van Dongen JJ, Langerak AW, Bruggemann M, Evans PA, Hummel M, Lavender FL, Delabesse E, Davi F, Schuuring E, Garcia-Sanz R, et al. Design and standardization of PCR primers and protocols for detection of clonal immunoglobulin and T-cell receptor gene recombinations in suspect lymphoproliferations: report of the BIOMED-2 concerted action BMH4-CT98-3936. Leukemia. 2003;17(12):2257–317.
Warren EH, Matsen FA, Chou J. High-throughput sequencing of B- and T-lymphocyte antigen receptors in hematology. Blood. 2013;122(1):19–22.
Klinger M, Zheng J, Elenitoba-Johnson KS, Perkins SL, Faham M, Bahler DW. Next-generation IgVH sequencing CLL-like monoclonal B-cell lymphocytosis reveals frequent oligoclonality and ongoing hypermutation. Leukemia. 2016;30(5):1055–61.
Barretina J, Caponigro G, Stransky N, Venkatesan K, Margolin AA, Kim S, Wilson CJ, Lehar J, Kryukov GV, Sonkin D, et al. The Cancer cell line encyclopedia enables predictive modelling of anticancer drug sensitivity. Nature. 2012;483(7391):603–7.
Lappalainen T, Sammeth M, Friedlander MR, t Hoen PA, Monlong J, Rivas MA, Gonzalez-porta M, Kurbatova N, Griebel T, Ferreira PG, et al. Transcriptome and genome sequencing uncovers functional variation in humans. Nature. 2013;501(7468):506–11.
Bolotin DA, Poslavsky S, Mitrophanov I, Shugay M, Mamedov IZ, Putintseva EV, Chudakov DM. MiXCR: software for comprehensive adaptive immunity profiling. Nat Methods. 2015;12(5):380–1.
Langerak AW, van Dongen JJM. Multiple clonal Ig/TCR products: implications for interpretation of clonality findings. J Hematop. 2012;5(1):35–43.
Pfeifer S, Perez-Andres M, Ludwig H, Sahota SS, Zojer N. Evaluating the clonal hierarchy in light-chain multiple myeloma: implications against the myeloma stem cell hypothesis. Leukemia. 2011;25(7):1213–6.
Catherwood MA, Gonzalez D, Patton C, Dobbin E, Venkatraman L, Alexander HD. Improved clonality assessment in germinal Centre/post-germinal Centre non-Hodgkin's lymphomas with high rates of somatic hypermutation. J Clin Pathol. 2007;60(5):524–8.
Jiang Y, Nie K, Redmond D, Melnick AM, Tam W, Elemento O. VDJ-Seq: deep sequencing analysis of rearranged immunoglobulin heavy chain gene to reveal clonal evolution patterns of B cell lymphoma. J Vis Exp. 2015;106:e53215.
Kuppers R, Engert A, Hansmann ML. Hodgkin lymphoma. J Clin Invest. 2012;122(10):3439–47.
Matsuo Y, Drexler HG. Establishment and characterization of human B cell precursor-leukemia cell lines. Leuk Res. 1998;22(7):567–79.
Eswaran J, Sinclair P, Heidenreich O, Irving J, Russell LJ, Hall A, Calado DP, Harrison CJ, Vormoor J. The pre-B-cell receptor checkpoint in acute lymphoblastic leukaemia. Leukemia. 2015;29(8):1623–31.
Szczepanski T, Beishuizen A, Pongers-Willemse MJ, Hahlen K, Van Wering ER, Wijkhuijs AJ, Tibbe GJ, De Bruijn MA, Van Dongen JJ. Cross-lineage T cell receptor gene rearrangements occur in more than ninety percent of childhood precursor-B acute lymphoblastic leukemias: alternative PCR targets for detection of minimal residual disease. Leukemia. 1999;13(2):196–205.
van der Velden VH, Szczepanski T, Wijkhuijs JM, Hart PG, Hoogeveen PG, Hop WC, van Wering ER, van Dongen JJ. Age-related patterns of immunoglobulin and T-cell receptor gene rearrangements in precursor-B-ALL: implications for detection of minimal residual disease. Leukemia. 2003;17(9):1834–44.
Ding L-W, Tan K-T, Sun Q-Y, Lao Z-T, Yang H, Jiang N, Chien W, Xiao J-F, Loh X-Y, Huang M-L, et al. Clonality and clonal evolution analysis of paediatric ALL based on B-cell receptor/T-cell receptor rearrangement. Br J Haematol. 2018. https://doi.org/10.1111/bjh.15179. [Epub ahead of print].
Geng H, Hurtz C, Lenz KB, Chen Z, Baumjohann D, Thompson S, Goloviznina NA, Chen WY, Huan J, LaTocha D, et al. Self-enforcing feedback activation between BCL6 and pre-B cell receptor signaling defines a distinct subtype of acute lymphoblastic leukemia. Cancer Cell. 2015;27(3):409–25.
Trageser D, Iacobucci I, Nahar R, Duy C, von Levetzow G, Klemm L, Park E, Schuh W, Gruber T, Herzog S, et al. Pre-B cell receptor-mediated cell cycle arrest in Philadelphia chromosome-positive acute lymphoblastic leukemia requires IKAROS function. J Exp Med. 2009;206(8):1739–53.
Muschen M. Rationale for targeting the pre-B-cell receptor signaling pathway in acute lymphoblastic leukemia. Blood. 2015;125(24):3688–93.
Varano G, Raffel S, Sormani M, Zanardi F, Lonardi S, Zasada C, Perucho L, Petrocelli V, Haake A, Lee AK, et al. The B-cell receptor controls fitness of MYC-driven lymphoma cells via GSK3beta inhibition. Nature. 2017;546(7657):302–6.
Buchner M, Swaminathan S, Chen Z, Muschen M. Mechanisms of pre-B-cell receptor checkpoint control and its oncogenic subversion in acute lymphoblastic leukemia. Immunol Rev. 2015;263(1):192–209.
Swaminathan S, Duy C. Muschen M BACH2-BCL6 balance regulates selection at the pre-B cell receptor checkpoint. Trends Immunol. 2014;35(3):131–7.
Swaminathan S, Huang C, Geng H, Chen Z, Harvey R, Kang H, Ng C, Titz B, Hurtz C, Sadiyah MF, et al. BACH2 mediates negative selection and p53-dependent tumor suppression at the pre-B cell receptor checkpoint. Nat Med. 2013;19(8):1014–22.
Ding LW, Sun QY, Tan KT, Chien W, Thippeswamy AM, Eng Juh Yeoh A, Kawamata N, Nagata Y, Xiao JF, Loh XY, et al. Mutational landscape of pediatric acute lymphoblastic leukemia. Cancer Res. 2017;77(2):390–400.
Barbey S, Gogusev J, Mouly H, Le Pelletier O, Smith W, Richard S, Soulie J, Nezelof C. DEL cell line: a “malignant histiocytosis” CD30+ t(5;6)(q35;p21) cell line. Int J Cancer. 1990;45(3):546–53.
Piao W, Chau D, Yue LM, Kwong YL, Tse E. Arsenic trioxide degrades NPM-ALK fusion protein and inhibits growth of ALK-positive anaplastic large cell lymphoma. Leukemia. 2017;31(2):522–6.
Whittaker S, Hoppe R, Prince HM. How I treat mycosis fungoides and Sezary syndrome. Blood. 2016;127(25):3142–53.
Langerak AW, van Den Beemd R, Wolvers-Tettero IL, Boor PP, van Lochem EG, Hooijkaas H, van Dongen JJ. Molecular and flow cytometric analysis of the Vbeta repertoire for clonality assessment in mature TCRalphabeta T-cell proliferations. Blood. 2001;98(1):165–73.
Sandberg Y, Verhaaf B, van Gastel-Mol EJ, Wolvers-Tettero IL, de Vos J, Macleod RA, Noordzij JG, Dik WA, van Dongen JJ, Langerak AW. Human T-cell lines with well-defined T-cell receptor gene rearrangements as controls for the BIOMED-2 multiplex polymerase chain reaction tubes. Leukemia. 2007;21(2):230–7.
Yao R, Schneider E. Detection of B- and T-cell-specific gene rearrangements in 13 cell lines and 50 clinical specimens using the BIOMED-2 and the original InVivoScribe primers. Leuk Lymphoma. 2007;48(4):837–40.
Cao S, Strong MJ, Wang X, Moss WN, Concha M, Lin Z, O'Grady T, Baddoo M, Fewell C, Renne R, et al. High-throughput RNA sequencing-based virome analysis of 50 lymphoma cell lines from the Cancer cell line encyclopedia project. J Virol. 2015;89(1):713–29.
Strong MJ, Xu G, Morici L, Splinter Bon-Durant S, Baddoo M, Lin Z, Fewell C, Taylor CM. Flemington EK microbial contamination in next generation sequencing: implications for sequence-based analysis of clinical samples. PLoS Pathog. 2014;10(11):e1004437.
Cibulskis K, McKenna A, Fennell T, Banks E, DePristo M, Getz G. ContEst: estimating cross-contamination of human samples in next-generation sequencing data. Bioinformatics. 2011;27(18):2601–2.
Koeffler HP, Golde DW. Chronic myelogenous leukemia--new concepts (first of two parts). N Engl J Med. 1981;304(20):1201–9.
Minowada J, Tsubota T, Greaves MF, Walters TR. A non-T, non-B human leukemia cell line (NALM-1): establishment of the cell line and presence of leukemia-associated antigens. J Natl Cancer Inst. 1977;59(1):83–7.
Uesugi H, Atari E. establishment and pathological study of a new poorly differentiated mucinous gastric cancer cell line. Nihon Shokakibyo Gakkai Zasshi. 1995;92(1):19–25.
Munari F, Lonardi S, Cassatella MA, Doglioni C, Cangi MG, Amedei A, Facchetti F, Eishi Y, Rugge M, Fassan M, et al. Tumor-associated macrophages as major source of APRIL in gastric MALT lymphoma. Blood. 2011;117(24):6612–6.
Otsuki T, Nakazawa N, Taniwaki M, Yamada O, Sakaguchi H, Wada H, Yawata Y, Ueki A. Establishment of a new human myeloma cell line, KMS-18, having t(4;14)(p16.3;q32.3) derived from a case phenotypically transformed from Ig A-lambda to BJP-lambda, and associated with hyperammonemia. Int J Oncol. 1998;12(3):545–52.
Ryan JL, Kaufmann WK, Raab-Traub N, Oglesbee SE, Carey LA, Gulley ML. Clonal evolution of lymphoblastoid cell lines. Lab Investig. 2006;86(11):1193–200.
Plagnol V, Uz E, Wallace C, Stevens H, Clayton D, Ozcelik T, Todd JA. Extreme clonality in lymphoblastoid cell lines with implications for allele specific expression analyses. PLoS One. 2008;3(8):e2966.
Vardi A, Vlachonikola E, Karypidou M, Stalika E, Bikos V, Gemenetzi K, Maramis C, Siorenta A, Anagnostopoulos A, Pospisilova S, et al. Restrictions in the T-cell repertoire of chronic lymphocytic leukemia: high-throughput immunoprofiling supports selection by shared antigenic elements. Leukemia. 2017;31(7):1555–61.
Bikos V, Stalika E, Baliakas P, Darzentas N, Davis Z, Traverse-Glehen A, Dagklis A, Kanellis G, Anagnostopoulos A, Tsaftaris A, et al. Selection of antigen receptors in splenic marginal-zone lymphoma: further support from the analysis of the immunoglobulin light-chain gene repertoire. Leukemia. 2012;26(12):2567–9.
Dagklis A, Ponzoni M, Govi S, Cangi MG, Pasini E, Charlotte F, Vino A, Doglioni C, Davi F, Lossos IS, et al. Immunoglobulin gene repertoire in ocular adnexal lymphomas: hints on the nature of the antigenic stimulation. Leukemia. 2012;26(4):814–21.
Blachly JS, Ruppert AS, Zhao W, Long S, Flynn J, Flinn I, Jones J, Maddocks K, Andritsos L, Ghia EM, et al. Immunoglobulin transcript sequence and somatic hypermutation computation from unselected RNA-seq reads in chronic lymphocytic leukemia. Proc Natl Acad Sci U S A. 2015;112(14):4322–7.
Langerak AW, Groenen PJ, Bruggemann M, Beldjord K, Bellan C, Bonello L, Boone E, Carter GI, Catherwood M, Davi F, et al. EuroClonality/BIOMED-2 guidelines for interpretation and reporting of Ig/TCR clonality testing in suspected lymphoproliferations. Leukemia. 2012;26(10):2159–71.
Gimelbrant A, Hutchinson JN, Thompson BR, Chess A. Widespread monoallelic expression on human autosomes. Science. 2007;318(5853):1136–40.
Kim E, Hurtz C, Koehrer S, Wang Z, Balasubramanian S, Chang BY, Muschen M, Davis RE, Burger JA. Ibrutinib inhibits pre-BCR+ B-cell acute lymphoblastic leukemia progression by targeting BTK and BLK. Blood. 2017;129(9):1155–65.
Pegoraro L, Matera L, Ritz J, Levis A, Palumbo A, Biagini G. Establishment of a Ph1-positive human cell line (BV173). J Natl Cancer Inst. 1983;70(3):447–53.
Computational resources was supported by the National Supercomputing Centre Singapore.
This work was funded by the Leukemia and Lymphoma Society Transforming CURES Initiative, by the Singapore Ministry of Health’s National Medical Research Council (NMRC) under its Singapore Translational Research (STaR) Investigator Award (NMRC/STaR/0021/2014), Singapore Ministry of Education Academic Research Fund Tier 2 (MOE2013-T2–2-150), the NMRC Centre Grant awarded to National University Cancer Institute of Singapore (NMRC/CG/012/2013) and the National Research Foundation Singapore and the Singapore Ministry of Education under its Research Centres of Excellence initiatives, as well as the generous donations from the Melamed family and Reuben Yeroushalmi. This research is also supported by the RNA Biology Center at the Cancer Science Institute of Singapore, NUS, as part of funding under the Singapore Ministry of Education’s Tier 3 grants, grant number MOE2014-T3–1-006. The funding sources had no role in the design of the study, and collection, analysis, and interpretation of the data, or writing of the manuscript.
Availability of data and materials
The data generated in this study are available in the Additional files for this manuscript.
Ethics approval and consent to participate
Consent for publication
The authors declare that they have no competing interests.
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Figure S1. Expression of CD19/CD20, CD79A/B and RAG1/RAG2 in blood cancer cell lines. X axis: 164 blood cancer cell lines grouped base on the disease types (rectangle color bar below the X axis), each dot in X axis represent one cell line, and the Y axis, expression level (log2 value) of indicated genes. Figure S2. Clonal fraction (filtered by >30 reads) of B-ALL, multiple myeloma, diffuse large B cell lymphoma, Burkitt lymphoma, B cell lymphoma (unspecified) and mantle cell lymphoma base on IGH or IGHK/L. Blue color indicates the clonotype fraction of the most dominant clone, Red color indicate the clonotype fraction of second dominant clone, Yellow color indicate the third dominant clone, while any smaller subclones were aggregated and labelled in grey. Figure S3. Clonal fraction (filtered by >30 reads) of T-ALL and anaplastic large cell lymphoma base on TRCA or TRCB. Blue color indicates the clonotype fraction of dominant clone, red color inidicate the clonotype fraction of second dominant clone, while gray color indicate the third dominant clone. Figure S4. Heatmap showing the usage of IGK/L V genes (A), IGK/L J genes (B), and constant region (C) in 462 samples of EBV transformed normal B lymphocytes. Figure S5. The phylogenetic tree inferred based on the rearrangement of the CDR3 region of IGH, IGK and IGL of EBV transformed B lymphocyte samples ERR188025, ERR188358 and ERR188212. These three cell lines have much higher number of rearrangement types than the other B lymphocyte lines. Clonal Fraction (upper panel) and the read counts (lower panel) of the dominant clone of 462 samples of EBV transformed normal B lymphocytes. (ZIP 1290 kb)