Abstract
Transcription factors (TFs) are frequently mutated in cancer. Paediatric cancers exhibit few mutations genome-wide but frequently harbour sentinel mutations that affect TFs, which provides a context to precisely study the transcriptional circuits that support mutant TF-driven oncogenesis. A broadly relevant mechanism that has garnered intense focus involves the ability of mutant TFs to hijack wild-type lineage-specific TFs in self-reinforcing transcriptional circuits. However, it is not known whether this specific type of circuitry is equally crucial in all mutant TF-driven cancers. Here we describe an alternative yet central transcriptional mechanism that promotes Ewing sarcoma, wherein constraint, rather than reinforcement, of the activity of the fusion TF EWS–FLI supports cancer growth. We discover that ETV6 is a crucial TF dependency that is specific to this disease because it, counter-intuitively, represses the transcriptional output of EWS–FLI. This work discovers a previously undescribed transcriptional mechanism that promotes cancer.
Similar content being viewed by others
Main
As fundamental drivers of cell-type-specific identity and function, aberrant TFs represent an important class of genetic dependencies across distinct cancer types1. Paediatric cancers exhibit few mutations genome-wide, but typically harbour sentinel mutations that alter TF proteins2,3,4,5. Mutant TFs can hijack wild-type lineage-specific TFs into self-reinforcing, feed-forward core regulatory circuits (CRCs)6,7,8,9,10,11,12,13,14. For example, MYCN in MYCN-amplified neuroblastoma and the PAX3–FOXO1 and PAX7–FOXO1 fusion proteins in alveolar rhabdomyosarcoma promote tumour growth by hijacking tumour-type-specific CRC TFs15,16,17,18,19. It is not known, however, to what extent distinct cancer types harbouring mutant TFs rely on this specific type of circuitry.
Ewing sarcoma, the second most common paediatric bone cancer, is defined by pathognomonic chromosomal translocations that fuse a gene member of the FET family of RNA-binding proteins with members of the ETS family of TFs20,21. In 85–90% of cases, a translocation fuses the EWSR1 and FLI1 genes to encode the EWS–FLI fusion protein. EWS–FLI proteins exhibit the neomorphic ability to pioneer de novo enhancers at microsatellites that contain tandem ETS 5′-GGAA-3′ motif repeats22,23,24,25,26,27,28,29,30,31 via multimerization and recruitment of chromatin-modifying complexes, which in turn lead to an altered gene expression programme20,32.
Efforts to establish key dependencies in Ewing sarcoma have prioritized the identification of specific gene targets of EWS–FLI. Studies have described cell-type-specific TFs that are activated by, and cooperate with, EWS–FLI to reinforce oncogenic programmes23,24,32,33,34,35,36,37,38,39, including in CRCs40. Unbiased and systematic approaches are needed, however, to reveal crucial disease mechanisms specific to Ewing sarcoma.
Here we describe the results of a genome-scale CRISPR–Cas9 screen revealing that the wild-type ETS TF ETS variant 6 (ETV6; also known as TEL) is a crucial Ewing-sarcoma-selective TF dependency. We validate this dependency in vitro and in vivo. In contrast to selective TF dependencies that reinforce the oncogenic programmes of mutant TFs in other cancer types, the repressive activity of ETV6 constrains EWS–FLI gene activation at 5′-GGAA-3′ repeat enhancers to promote Ewing sarcoma growth. We therefore discover a previously undescribed mechanism promoting cancer: competition on chromatin between an oncogenic fusion TF and a ‘restraining’ inhibitory TF.
Results
ETV6 is a selective TF dependency in Ewing sarcoma
We recently reported a genome-scale CRISPR–Cas9 loss-of-function screen performed in paediatric cancer cell lines (Pediatric Cancer DepMap) that identified transcriptional activators as a strongly enriched class of selective dependencies (genetic vulnerabilities unique to a specific cancer type) in several paediatric cancer subtypes41. Among the most highly selective dependencies were activating CRC TFs in neuroblastoma and rhabdomyosarcoma (Fig. 1a and Supplementary Tables 1–3). TFs involved in CRCs co-opted by MYCN in neuroblastoma (PHOX2B, HAND2, ISL1 and GATA3)15,16 and the PAX3–FOXO1 and PAX7–FOXO1 fusion proteins in alveolar rhabdomyosarcoma (MYOD1, MYOG, SOX8 and MYCN)17,18 constituted the strongest dependencies specific to each tumour type. By contrast, selective TF gene dependencies in Ewing sarcoma were transcriptional repressors, including the known dependencies BCL11B and ZEB2 (refs. 36, 37), which are activated by EWS–FLI, and the previously uncharacterized dependency ETV6 (Fig. 1a and Extended Data Fig. 1a). These TFs also scored in independent screens (Extended Data Fig. 1b). ETV6 is not recurrently mutated in Ewing sarcoma42,43,44, and ETV6 dependency was not associated with a specific EWS–ETS fusion (that is, EWS–FLI or EWS–ERG) (Extended Data Fig. 1c). ETV6 was not defined as a gene regulated by EWS–FLI in 18 of 19 gene sets (Supplementary Table 4) and did not exhibit a unique pattern of expression in cell lines (Fig. 1a) or in primary tumours45 (Extended Data Fig. 1d and Supplementary Table 5) of Ewing sarcoma. By contrast, CRC TFs in neuroblastoma and rhabdomyosarcoma exhibited tumour-type-specific expression (Fig. 1a), as did BCL11B and ZEB2 in Ewing sarcoma45 (Extended Data Fig. 1e and Supplementary Table 5).
We validated an ETV6 dependency in three cell lines of Ewing sarcoma, A673, EW8 and TC32, via CRISPR–Cas9 disruption. Loss of ETV6 reduced cell growth in vitro (Fig. 1b and Extended Data Fig. 1f) and reduced anchorage-independent growth in methylcellulose (Fig. 1c and Extended Data Fig. 1g). We established a biochemical dTAG approach46,47 to perturb ETV6 abundance with precise temporal control and without eliciting acute DNA damage. FKBP12F36V-tagged proteins can be acutely degraded following exposure to the dTAG small-molecule dTAGV-1, which recruits the von Hippel–Lindau E3 ligase to ubiquitinate FKBP12F36V (ref. 46). In the Ewing sarcoma cell lines A673 and EW8, we exogenously expressed ETV6 carboxy-terminally tagged with FKBP12F36V and a human influenza haemagglutinin (HA) epitope (Fig. 1d). Simultaneously, we knocked out endogenous ETV6 such that FKBP12F36V-tagged ETV6 constituted the dominant form of ETV6 protein. ETV6–FKBP12F36V degradation reduced anchorage-independent growth (Fig. 1e,f and Extended Data Fig. 1h). Degradation of ETV6 (Fig. 1g and Extended Data Fig. 2a) as well as CRISPR–Cas9-mediated knockout of endogenous ETV6 in parental A673 cells (Extended Data Fig. 2b) led to G1/G0 cell cycle arrest but did not induce apoptosis (Extended Data Fig. 2c and Supplementary Fig. 1).
In vivo, CRISPR–Cas9-mediated knockout of ETV6 reduced the growth of subcutaneous TC32 tumours (Extended Data Fig. 2d). Using an orthotopic-like mouse model, in which A673 Ewing sarcoma cells implanted intramuscularly in the hindlimb are capable of metastasis48, we observed that ETV6 loss reduced primary tumour growth (Fig. 1h). ETV6 loss reduced metastasis to liver tissues (Fig. 1i, left), and lung tissues displayed the same trend in one out of two ETV6 knockout conditions (Fig. 1i, right).
Next we asked whether the DNA-binding domain (DBD) of ETV6 was crucial to its function. We knocked out endogenous ETV6 and exogenously expressed wild-type ETV6 or mutant ETV6 bearing a C-terminal DBD deletion, which precluded ETV6 binding to chromatin and partially impeded its nuclear localization. This result is consistent with the report that the nuclear localization signal of ETV6 protein lies in its C terminus49 (Extended Data Fig. 2e). Whereas wild-type ETV6 expression rescued ETV6 knockout, expression of the mutant ETV6 did not (Extended Data Fig. 2f), which suggests that the specific activity of ETV6 on chromatin is crucial to its function in Ewing sarcoma.
ETV6 and EWS–FLI co-occupy loci genome-wide
ETV6 and EWS–FLI harbour the ETS family DBD, which recognizes consensus 5′-GGA(A/T)-3′ motifs. We therefore asked whether they co-localized on chromatin. We profiled endogenous ETV6 binding sites in parental A673 cells using cleavage under targets and tagmentation (CUT&Tag)50 and profiled ETV6–FKBP12F36V–HA binding sites in ETV6–dTAG cells using anti-HA chromatin immunoprecipitation with sequencing (ChIP-seq). These analyses defined a consensus list of ETV6-binding sites (Extended Data Fig. 3a and Fig. 2a). dTAGV-1 treatment reduced ETV6 abundance on chromatin in both dTAG models (Fig. 2b and Extended Data Fig. 3b). In parental Ewing sarcoma cells, we performed histone H3 lysine 27 acetylation (H3K27ac) ChIP-seq and analysed public histone H3 lysine 4 trimethylation (H3K4me3) ChIP-seq data26 to annotate ETV6-binding sites. The results showed that these sites occurred at active promoters and enhancers (Fig. 2c and Extended Data Fig. 3c). We performed ChIP-seq for EWS–FLI in A673 and EW8 parental cells by immunoprecipitating the C-terminal FLI1 domain. This is an accepted approach to identify EWS–FLI-binding sites because wild-type FLI1 typically is not expressed in Ewing sarcoma cells39,43. EWS–FLI bound ubiquitously at ETV6-binding sites in both models (Fig. 2c and Extended Data Fig. 3c), although co-occupied binding sites constituted only a small proportion of total EWS–FLI-binding sites (Fig. 2d and Extended Data Fig. 3d). EWS–FLI pioneers closed chromatin at GGAA repeat microsatellites27, including at repeats of four or more26. ETV6 localized at these longer consecutive GGAA repeats at a higher frequency in Ewing sarcoma than in B lymphocytes or in K-562 leukaemia cells51,52, which express ETV6 (P < 2.2 × 10−16) (Fig. 2e and Extended Data Fig. 3e).
Loss of ETV6 increases EWS–FLI occupancy
We next asked whether loss of ETV6 alters EWS–FLI chromatin occupancy. We degraded ETV6 and profiled EWS–FLI binding by ChIP-seq at 6 and 72 h. At 6 h, in both dTAG models, significant alterations in EWS–FLI occupancy primarily constituted increases in binding (Fig. 2f). At 72 h, alterations were more dynamic, exhibiting both increases and decreases (Fig. 2f). We categorized loci by whether they gained or lost EWS–FLI binding at 6 h and whether they occurred at transcription start sites (TSSs) or at H3K27ac-defined enhancers (Fig. 2g,h). Regions that lost EWS–FLI binding did not change to as great a degree as regions that gained binding (Fig. 2i). Thus, the loss of ETV6 led acutely and predominantly to increased EWS–FLI binding, which provides support for the hypothesis that these TFs compete for binding. Additionally, ChIP-seq of H3K27ac at 6 h in both models (Fig. 2g,h) demonstrated a modest increase in H3K27ac abundance at enhancer regions that gained EWS–FLI binding (Extended Data Fig. 3f).
Differential EWS–FLI binding was highly dynamic at tandem 5′-GGAA-3′ repeats (Extended Data Fig. 3g). Notably, genomic regions that gained EWS–FLI binding were more likely to contain shorter tandem repeats of 2, 3 or 4 motifs compared with regions that lost EWS–FLI binding (P < P = 6.974 × 10−15). Consistent differences were not observed for single GGAA motifs or >4 GGAA repeats.
ETV6 is a transcriptional repressor in Ewing sarcoma
We next characterized genes regulated by ETV6, a reported transcriptional repressor53,54,55,56,57. We performed RNA sequencing (RNA-seq) in both dTAG models at 6, 24 and 72 h following treatment with dimethylsulfoxide (DMSO) or dTAGV-1 (Fig. 1e and Extended Data Fig. 4a). Globally, the expression profiles of each of the engineered dTAG cell lines approximated that of their corresponding parental cell lines (Extended Data Fig. 4b). At 6 h, the majority of differentially expressed genes were upregulated, which suggests that ETV6 acts predominantly as a transcriptional repressor in Ewing sarcoma (Fig. 3a). Strongly ETV6-repressed genes increased in expression over time following ETV6 degradation (Fig. 3b). We observed concordance in regulated genes between dTAG models (Extended Data Fig. 4c) and identified a common set of 85 ETV6-repressed genes (Fig. 3c and Supplementary Table 6). We performed RNA-seq on parental A673 cells transduced with ETV6 CRISPR knockout (Fig. 3d and Extended Data Fig. 4d). The results showed that most of the 85 genes were also repressed by endogenous levels of wild-type ETV6 (P = 2.66 × 10−20). Consistent with the localization of ETV6 at active promoters and enhancers, ETV6-repressed genes were expressed and not completely silenced (Extended Data Fig. 4e). Additionally, ETV6-binding sites were enriched in ETV6-regulated genes (Fig. 3e and Extended Data Fig. 4f).
ETV6 is a master TF implicated in the normal development of neural and mesenchymal lineages58,59. Developmental lineage-specific gene sets were enriched in ETV6-repressed genes (Fig. 3f and Supplementary Tables 6–11) and in ETV6-activated genes (Extended Data Fig. 4g and Supplementary Tables 12–17). ETV6-repressed genes, but not activated genes, were strongly enriched for genes regulated by histone deacetylases (HDACs), which may reflect the ability of ETV6 to recruit HDACs54,60,61,62,63. We also observed strong enrichment of EWS–FLI-regulated genes in ETV6-regulated genes (Fig. 3f), consistent with their co-localization on chromatin.
Loss of ETV6 alters gene expression and the chromatin state
We next sought to associate locus-specific alterations in chromatin with differential gene expression after ETV6 degradation. Alterations in EWS–FLI binding at individual loci at 6 h were sustained at 72 h, and these alterations correlated between dTAG models (Fig. 4a). Consistently, the greatest degree of differential EWS–FLI binding was exhibited by loci that gained EWS–FLI binding; loci that lost EWS–FLI binding exhibited smaller changes in magnitude (Fig. 4a). This pattern was paralleled by alterations in H3K27ac abundance at EWS–FLI-binding sites (Fig. 4b and Extended Data Fig. 5a) and by alterations in chromatin accessibility (Fig. 4c). We assigned EWS–FLI-binding sites to nearby genes and examined their expression following ETV6 loss (Fig. 4d). Genes that gained or lost EWS–FLI binding exhibited significantly increased or decreased expression, respectively (P < 1 × 10−10), with genes in the former category exhibiting the greatest degree of change on average (Fig. 4d and Extended Data Fig. 5b). Thus, the most profound consequences of ETV6 loss are increased EWS–FLI binding, opening of chromatin and increased gene expression.
Increased EWS–FLI occupancy upregulates gene expression
We knocked out EWS–FLI in A673 ETV6–dTAG cells to evaluate whether loss of EWS–FLI rescues gene expression changes with ETV6 loss. FAS, ACTA2, TRIB1 and SEMA5B were identified as ETV6-repressed genes that exhibit increased EWS–FLI binding, H3K27ac and chromatin accessibility at ETV6-vacated sites, some of which occurred at GGAA repeats (Fig. 4e,f and Extended Data Fig. 5c). We compared these genes to BCL11B because it is activated by EWS–FLI but is not repressed by ETV6 and does not exhibit altered EWS–FLI binding acutely following ETV6 loss (Extended Data Fig. 5d). Quantitative PCR (qPCR) demonstrated that degradation of ETV6 led to upregulation of ETV6-repressed genes but not BCL11B (Fig. 4g, top plots). EWS–FLI knockout significantly reduced the upregulation of ETV6-repressed genes (Fig. 4g, bottom plots). Immunoblotting validated that the attenuation of mRNA upregulation also affected protein levels (Fig. 4h). Thus, ETV6 and EWS–FLI antagonistically regulate FAS, ACTA2, TRIB1 and SEMA5B expression.
ETV6 functions similarly in clinically relevant Ewing sarcoma models
Well-established cancer cell lines may use distinct biological mechanisms to that of primary tumour cells. We therefore tested the relevance of our findings from cell lines in two newly derived Ewing sarcoma cell lines: CCLF_PEDS_0009_T (PEDS0009) and CCLF_PEDS_0010_T (PEDS0010)64. ETV6 knockout impaired cell growth in vitro and colony formation in methylcellulose (Fig. 5a,b and Extended Data Fig. 6a,b). Additionally, we tested cells from a minimally passaged Ewing sarcoma patient-derived xenograft (PDX): ES-PDX-001 (refs. 65, 66). Again, knockout of ETV6 impaired cell growth in vitro (Extended Data Fig. 6c). In PEDS0009 cells, we observed ETV6 and EWS–FLI binding at previously defined EWS–FLI consensus binding sites (Fig. 5c). Concordant with our cell line data, ETV6 bound to GGAA microsatellites (Fig. 5d), and ETV6 loss resulted in increased EWS–FLI binding at the same loci that exhibited increased EWS–FLI occupancy in cell lines (Fig. 5e–g and Extended Data Fig. 6d,e). Genomic regions that gained EWS–FLI binding were more likely to contain shorter GGAA repeats of 2, 3 or 4 compared with regions that lost EWS–FLI (Extended Data Fig. 6f) (P = 5.186 × 10−11). These observations in minimally passaged cells were concordant with the data from well-established cell lines.
ETV6 and EWS–FLI antagonism at SOX11 is functional
We next asked whether the antagonistic relationship between EWS–FLI and ETV6 is responsible for the dependency of Ewing sarcoma cells on ETV6. Almost half of the gene sets enriched in ETV6-repressed genes were related to developmental pathways (Extended Data Fig. 7a and Supplementary Table 18), and 46 of these included SOX11 (Supplementary Table 19). SOX11 expression exerts context-dependent effects on cancer cell survival, growth and metastasis67,68. SOX11 acts as an oncogene in mantle cell lymphoma69 and promotes metastasis in breast cancer70,71. Conversely, it also reduces proliferation and metastasis in prostate cancer72 and induces differentiation of glioma cells73. In Ewing sarcoma cells, the exogenous expression of SOX11 impaired cell growth, whereas the expression of a DBD-deleted mutant did not (Extended Data Fig. 7b). These results provide support for a tumour-suppressive role for SOX11 activity.
We observed differential EWS–FLI binding at a distal enhancer that mapped to SOX11 as the nearest expressed gene (Fig. 6a, left). RNA-seq data from the Cancer Cell Line Encyclopedia74 show that the neighbouring genes, SILC1 and LOC400940, are not expressed in Ewing sarcoma. This enhancer occurred at tandem GGAA repeats and exhibited increased EWS–FLI binding, H3K27ac abundance and chromatin accessibility following ETV6 loss (Fig. 6a, right). RNA-seq confirmed that SOX11 is repressed by ETV6-FKBP12F36V in dTAG cells and by endogenous ETV6 in parental A673 cells (Extended Data Fig. 7c). EWS–FLI was required for SOX11 upregulation after ETV6 loss (Fig. 6b,c). Knockout of SOX11 in A673 ETV6–dTAG cells (Fig. 6d) rescued the effects of ETV6 degradation (Fig. 6e). Additionally, knockout of SOX11 in A673 and TC32 cells (Fig. 6f and Extended Data Fig. 7d) rescued ETV6 knockout (Fig. 6g and Extended Data Fig. 7e,f). In vivo, we observed rescue in TC32 cells grown as subcutaneous tumours in mice (Fig. 6h). These findings support the hypothesis that ETV6 dependency is specific to Ewing sarcoma cells because ETV6 constrains EWS–FLI activation of SOX11 expression.
Finally, we asked whether co-regulation at SOX11 by ETV6 and EWS–FLI could be recapitulated with ectopic expression of EWS–FLI. In rhabdomyosarcoma RD cells, we exogenously expressed wild-type EWS–FLI or the R340N DNA-binding mutant of EWS–FLI, which cannot bind to DNA75. SOX11 protein expression was induced by wild-type EWS–FLI but not the mutant (Extended Data Fig. 7g). Knockout of ETV6 further upregulated SOX11 abundance in the setting of wild-type EWS–FLI but not in the context of mutant EWS–FLI expression (Extended Data Fig. 7g). These findings demonstrate that the DBD of EWS–FLI is required for its activation of SOX11 expression, an activity that is repressed by ETV6.
Discussion
In this study, we discovered an oncogenic mechanism underlying the paediatric cancer Ewing sarcoma. We demonstrated that the ETS TF ETV6 is a selective dependency in Ewing sarcoma because it antagonizes the transcriptional activity of EWS–FLI at ETS motifs. To our knowledge, this report constitutes the first description of transcriptional constraint of a fusion TF on chromatin as a crucial driver of tumour growth.
Although previous studies have described specific TFs as dependencies that reinforce the EWS–FLI transcriptional programme in Ewing sarcoma32,33,34,35,37, including in CRCs40, these targets were not identified in DepMap screening as selective gene dependencies. Instead, our discovery that ETV6 constrains EWS–FLI activity highlights a distinct, but equally central, epigenetic mechanism that drives tumour growth and reveals an unexpected contrast between Ewing sarcoma and other paediatric tumours in which CRCs are functionally dominant.
Cancer cells frequently co-opt mechanisms that underlie normal development76. The competition between EWS–FLI and ETV6 in Ewing sarcoma bears resemblance to a mechanism of ETS TF competition governing cell-fate decisions in developing Drosophila. Pointed, the activating orthologue of human Ets-1, competes for binding at ETS motifs within specific enhancers with Yan, the repressive orthologue of ETV6, to regulate the expression of key differentiation genes in distinct tissues77,78,79,80,81. Here we described a similar mechanism that has been co-opted in cancer to regulate the transcriptional output of a fusion TF.
The epigenetic activity of ETS TFs other than EWS–FLI may contribute to the phenotype of ETV6 loss. Notably, ETV7, the homologue of ETV6, is not expressed in Ewing sarcoma cells (Supplementary Table 20), and we did not observe strong changes in the expression of other ETS TFs with ETV6 loss. The maximum change exhibited by one gene was roughly threefold, and only five genes displayed a significant alteration in expression across the models evaluated. Moreover, none of the genes that displayed a change in expression were scored as dependencies or tumour suppressors in DepMap in Ewing sarcoma.
Although most human TF families contain paralogues that are co-expressed within distinct cell types82,83,84, an understanding of their interactions at shared motifs is lacking. We began to unravel key cis regulatory principles that distinguish the specific functions of ETV6 and EWS–FLI, the antagonism of which on chromatin frequently occurred at shorter 5′-GGAA-3′ repeats. As the pathogenesis of EWS–FLI is typically associated with its activity at longer repeats or true microsatellites, we highlight a previously undescribed cis regulatory role for shorter GGAA repeats in this disease, which facilitates ETV6 fine-tuning of EWS–FLI. Even though the reconstitution of EWS–FLI for biochemical assays has been a challenge for the field, future work is needed to delineate the precise GGAA repeat code that determines the activities of each TF. Similarly, ETV6 and wild-type FLI1 proteins can engage in an inhibitory heterodimer85, an interaction mediated by the amino-terminal Pointed (PNT) domain of ETV6, and further studies are needed to determine whether ETV6 and EWS–FLI engage in a protein–protein interaction. Notably, however, our experiments using an ETS DBD-deleted mutant of ETV6, with an intact PNT domain, demonstrated that the DNA-binding activity of ETV6 is crucial to its function in Ewing sarcoma.
ETV6 is a master TF in normal development and is recurrently mutated in cancer. ETV6 mutations include deletions and chromosomal translocations involving 30 distinct gene partners53. Germline and somatic loss-of-function mutations frequently occur in pre-malignant disorders and leukaemias. For example, in B cell acute lymphoblastic leukaemia, ETV6 deletions frequently co-occur with ETV6–RUNX1 rearrangements, which result in biallelic loss of the ETV6 protein86,87. Chromosomal translocations also fuse the N terminus of ETV6 with the tyrosine kinase domain from a number of receptor tyrosine kinases, which facilitate constitutive autophosphorylation and growth signalling53. ETV6, however, has not been reported as recurrently mutated in Ewing sarcoma42,43,44. Furthermore, ETV6 is not regulated by EWS–FLI (Supplementary Table 4) and does not exhibit a marked pattern of expression specific to this cancer type (Extended Data Fig. 1d). Nonetheless, we discovered its role as a crucial tumour-type-selective dependency in regulating EWS–FLI activity. As such, this report reaffirms the importance of performing unbiased functional screens at scale to reveal oncogenic mechanisms sustained by wild-type proteins.
Our findings suggest that a hallmark of Ewing sarcoma biology may involve the reliance on mechanisms constraining EWS–FLI activity to promote tumour growth. Indeed, we previously described mechanisms mediated by an E3 ligase (TRIM8) and cohesin that restrain EWS–FLI activity48,66,88. Here we discovered a distinct mechanism in support of an EWS–FLI Goldilocks phenomenon66 that is operative on chromatin. Future translational efforts could ultimately seek to modulate the activity of this pharmacologically challenging protein, either by decreasing or paradoxically increasing its activity.
In conclusion, we discovered the oncogenic role of TF competition on chromatin between a mutant TF and a wild-type paralogue. Our work contributes to an understanding of the dysregulated epigenetic mechanisms that can promote cancer, raising the possibility that similar mechanisms are relevant in other disease contexts.
Methods
Our research complied with all ethical guidelines determined by the Dana-Farber Cancer Institute Institutional Animal Care and Use Committee under Animal Welfare Assurance number D16-00010 (A3023-01). No human studies were performed. In Extended Data Fig. 6c, a minimally passaged cell line previously derived in our laboratory66 from a previously characterized Ewing sarcoma PDX65 (HSJD-ES-PDX-001) was studied. As such, this experiment was performed in vitro and did not involve the use of animals. As previously described65, this PDX originated from a biopsy in a 21.7-year-old patient whose sex was not reported65. It was collected with informed consent without compensation under an Institutional Review Board-approved protocol at Sant Joan de Déu Hospital (HSJD, Barcelona, Spain), protocol number HSJD 135/11 (ref. 65).
CRISPR–Cas9 screen dependency analysis
All genome-scale dependency data are available at the DepMap portal website: https://depmap.org. DepMap AVANA 21Q1 dependency data were used (18,333 genes in 808 cell lines, https://figshare.com/articles/dataset/public_21q1/13681534). Twelve cell lines were not included in the analyses: four cell lines are classified as engineered lines; the origin of one cell line, CHLA57, is unknown as it is incorrectly identified as Ewing sarcoma; seven cell lines are listed as commonly misidentified cell lines in the ICLAC Register of Misidentified Cell Lines (https://iclac.org/databases/cross-contaminations/). Therefore, dependency data for 796 cell lines were examined. CERES gene effect scores were calculated as previously described41,90. A lower CERES gene effect score indicates an increased likelihood that a specific gene is required for viability in that cell line. A CERES score of 0 indicates that gene deletion exhibited no effect on growth, whereas a score of −1 is comparable with the median of all commonly essential genes, that is, genes that were essential for growth in nearly every cell line across the entire screen. Tumour-type-enriched ‘selective’ dependencies were determined by performing a two-class comparison between the gene effect scores for cell lines of each tumour type (in-group) and the remainder of all other cell lines in the screen (out-group) for a specific gene as previously described41. In brief, effect size was calculated as the difference in the mean gene effect dependency score in the in-group compared with that in the out-group. In addition to two-sided P values, one-sided P values were generated to test whether the in-group exhibited, on average, greater or lesser dependency on a specific gene than the out-group. All P values were corrected for multiple hypothesis testing using the Benjamini–Hochberg correction and reported as q values. Tumour-type-enriched dependencies were identified in each tumour type as those with a q value of <0.05 and with a negative effect size (the mean of dependency gene effect score was more negative in the in-group than in the out-group). The same analyses were performed on the genome-scale CRISPR–Cas9 screens using the Broad Institute GeCKO library (18,478 genes in 43 cell lines, https://figshare.com/articles/dataset/DepMap_GeCKO_19Q1/7668407) as well as the Sanger library (17,799 genes in 318 cell lines, https://figshare.com/articles/dataset/Project_SCORE_processed_with_CERES/9116732/1).
Cancer cell line and primary tumour gene expression
RNA-seq gene expression data from the Cancer Cell Line Encyclopedia74 were downloaded (19,177 genes in 1,376 cell lines) from the 21Q1 DepMap portal website (https://depmap.org). Tumour-type-enriched expression for each gene was calculated by performing a two-class comparison between the log2(transcripts per million (TPM) + 1) gene expression for cell lines of each tumour type (in-group) and the remainder of all other cell lines profiled (out-group). All P values were corrected for multiple hypothesis testing using the Benjamini–Hochberg correction and reported as q values. RNA-seq gene expression data for primary tumours were downloaded from the Treehouse Childhood Cancer Initiative45 (UCSC Genomics Institute, https://treehousegenomics.soe.ucsc.edu/public-data).
Cell samples and culture
All cell lines were genotyped by short tandem repeat analysis and tested for Mycoplasma. Whole-exome sequencing and RNA-seq were performed to validate cell line identity43. The A673 cell line was purchased from the American Type Culture Collection (ATCC, CRL-1598). EW8 (originally derived by P. Houghton91) and TC32 (originally derived by T. Triche92) cell lines were obtained from the Golub Lab. A673 and EW8 cells were grown in Dulbecco’s modified Eagle’s medium (DMEM) (Thermo Fisher Scientific, MT10013CM), supplemented with 10% FBS (Sigma-Aldrich, F2442) and 1% penicillin–streptomycin (Life Technologies, 15140163). TC32 cells were grown in Roswell Park Memorial Institute (RPMI)-1640 medium (Thermo Fisher Scientific, MT10040CM), supplemented with 10% FBS and 1% penicillin–streptomycin. The PEDS0009 and PEDS0010 cell lines were obtained from the Cancer Cell Line Factory (Broad Institute) and were derived as previously described64. They were cultured in RPMI-1640 supplemented with 10% FBS and 1% penicillin–streptomycin. The Ewing sarcoma PDX (HSJD-ES-PDX-001) was provided by J. Mora (HSJD)65. To generate the minimally passaged cell line (ES-PDX-001), PDX tumours were processed as previously described66. The RD cell line (ATCC, CRL-7731) was a gift from the DepMap group at the Broad Institute. RD cells were cultured in RPMI-1640 supplemented with 10% FBS.
CRISPR–Cas9 genome editing of Ewing sarcoma cell samples
The lentiCRISPRv2 plasmid backbone93 (Addgene, 52961) encoding the Cas9 nuclease was digested with the restriction endonuclease BsmbI (Esp3I) (Thermo Fisher Scientific, FERFD0454) and gel extracted (Qiagen, 28704). Synthetic oligonucleotides containing gene-targeting single guide RNA (sgRNA) sequences and adapter sequences (provided below) were ligated into the restriction digest site. Oligonucleotides were purchased from Integrated DNA Technologies (IDT), annealed and end-phosphorylated using T4 polynucleotide kinase (New England Biolabs, M0201S). Ligated vectors were transformed into One Shot Stbl3 Escherichia coli (Life Technologies, C737303), shaken at 37 °C for 1 h and grown overnight on 100 µg ml–1 ampicillin Luria broth plates (Teknova, L1004). Picked colonies were grown for 8–16 h in 100 µg ml–1 carbenicillin Luria broth starter cultures. Plasmids were DNA-extracted (Qiagen, 27104) and submitted for Sanger sequencing validation by Genewiz. Validated clones were cultured overnight, and plasmids were extracted (Qiagen, 12963).
CRISPR–Cas9 constructs were packaged into lentiviral particles. HEK293T cells were seeded at 70–80% confluence in 10 cm plates and co-transfected with 9 µg of lentiCRISPRv2-sgRNA construct plasmid, 0.9 µg pVSVg plasmid (Addgene 8454) and 9 µg pPAX2 plasmid (Addgene 19319) using Lipofectamine 3000 (Life Technologies, L3000015). Sixteen hours after transfection, medium was supplemented with 30% FBS. The following day, virus-containing medium was collected, 0.45 µm sterile-filtered and stored at −80 °C.
One million Ewing sarcoma cells were seeded per well of a 6-well plate and spin-infected using 2 ml of virus and 8 µg ml–1 polybrene (Santa Cruz Biotechnology, SC-134220) at 37 °C at 1,190 r.c.f. for 30 min. The following day, fresh medium containing 1 µg ml–1 puromycin (InvivoGen, ant-pr-1) was added. Cells were selected for at least 48 h. In experiments requiring knockout of two genes, the cells were co-transduced with constructs encoding two distinct sgRNAs, each conferring resistance to either puromycin or blasticidin. Cells were selected with 1 µg ml–1 puromycin and 5 µg ml–1 blasticidin (Life Technologies, A1113903) for at least 5 days. Separate samples of non-infected cells treated with drug were used to confirm cell death.
All sgRNA sequences used in the Broad Institute AVANA CRISPR–Cas9 screen are available for download at the DepMap Portal (https://depmap.org). The following sequences were used: sgETV6-1: 5′-GCAGCCAATTTACTGGAGCA-3′, sgETV6-2: 5′-GCAGGGATGACGTAGCCCAG-3′, sgETV6-3: 5′-GTGTGTGTATAGAGTTTCCA-3′, sgETV6-4: 5′-GTTATGGTGCACATTATCCA-3′, sgSOX11: 5′-CACCGGAAGATCCCGTTCATCCGGG-3′, sgFLI1: 5′-CACCGTGTCGGAGAGCAGCTCCAGG-3′. As previously described94,95, sgChr2.2 served as a cutting control and targets a gene desert on chromosome 2, 5′-GGTGTGCGTATGAAGCAGTG-3′; sgLacZ served as a non-targeting control and targets a non-human gene, 5′-AACGGCGGATTGACCGTAAT-3′. For ligation into the lentiCRISPRv2 plasmid, the additional bases 5′-CACCG-3′ were added to the 5′ end of the forward sequence. 5′-AAAC-3′ and 5′-C-3′ were added at the 5′ and 3′ ends of the reverse sequence, respectively.
Generation, culture and application of polyclonal dTAG cell lines
Four sgRNAs targeting exon–intron junctions of the endogenous ETV6 locus were cloned, as described above, into the lentiCRISPRv2 plasmid backbone conferring blasticidin resistance (Addgene, 83480). The following sgRNA sequences were used: (1) sgETV6-endo-1, 5′-TCCTGCTCAGTGTAGCATTA-3′, (2) sgETV6-endo-2, 5′-GAACACTCACGCAGGTGCGC-3′, (3) sgETV6-endo-3, 5′-TCCAGACTCTCACCTGAATG-3′, and (4) sgETV6-endo-4, 5′-AGTTCATAGAGCACATCACC-3′. A codon-optimized gBlock encoding coding sequences of ETV6 was cloned into the pLEX_305 vector backbone (Addgene, 91798) to C-terminally tag the ETV6 protein with the FKBP12F36V protein domain and a HA epitope tag. A673 and EW8 parental Ewing sarcoma cells were spin-infected and selected with puromycin and blasticidin as described above. Cells were cultured in DMEM supplemented with 15% FBS, 1% penicillin–streptomycin and 0.5 µg ml–1 puromycin and 2.5 µg ml–1 blasticidin to maintain selection. Cells were split at a ratio of 1:5 every other day.
The dTAGV-1 molecule was provided by the Gray Laboratory (Dana-Farber Cancer Institute, Boston, MA) and used at a stock concentration of 10 mM suspended in DMSO. For 6, 24 and 72 h RNA-seq time points, A673 ETV6–dTAG cells were seeded at 1 million cells per 6 cm dish, 0.5 million cells per 6 cm dish and 0.5 million cells per 10 cm dish, respectively. EW8 ETV6–dTAG cells were seeded at 0.75 million cells per 6 cm dish, 0.5 million cells per 10 cm dish and 0.2 million cells per 10 cm dish, respectively. For each time point, three separate dishes were seeded and treated per DMSO or dTAGV-1 condition. Cells were collected for total RNA extraction and western blot validation (described below). For 6 and 72 h ChIP-seq time points, A673 ETV6–dTAG cells were seeded at 5 million cells per 15 cm dish and 1 million cells per 15 cm dish, respectively. EW8 ETV6–dTAG cells were seeded at 7.7 million cells per 15 cm dish and 1 million cells per 15 cm dish, respectively. Twenty-four hours after seeding, existing medium was exchanged for DMSO or dTAGV-1-containing medium. For all experiments, dTAGV-1 was used at a final concentration of 1 µM. Equivalent volumes of DMSO were used as control.
A673 ETV6–dTAG cells were further perturbed to knockout EWS–FLI or SOX11. sgRNAs targeting FLI1 (5′-TGTCGGAGAGCAGCTCCAGG-3′) or SOX11 (5′-GAAGATCCCGTTCATCCGGG-3′) from the Broad AVANA screen were cloned into lentiCRISPRv2 as described above. In total, 250,000 cells per well of a 6-well plate were spin-infected with 2 ml virus. Knockout was validated by western blotting.
Relative viability studies
Cells transduced with lentivirally packaged CRISPR–Cas9 constructs were seeded in 384-well plates at densities of 3,500 (TC32), 2,000 (A673), 250 (EW8) and 1,000 (PEDS0009, PEDS0010 and ES-PDX-001) cells per well suspended in 40–50 µl of medium per well containing 0.5–1 µg ml–1 puromycin. Cells from each condition were grown separately in 6–8 wells per plate across 4 plates, which corresponded to day 0, 3, 5 and 7 time points. Wells at plate edges were filled with 50 µl of PBS to maintain humidity. To measure cell viability, 10 µl of CellTiter-Glo reagent (Promega, G7573) was added to each well, luminescing at an intensity proportional to ATP abundance, and plates were shaken at room temperature for 15 min. Luminescence was measured using a FLUOstar Omega microplate reader (BMG LabTech). Relative viability was calculated by dividing the luminescence measurement of each well on day 7 by luminescence at day 0 using Microsoft Excel 16.50. In parallel, whole cell lysate was collected on day 7 for western blotting to confirm ETV6 knockout. Statistics shown compare mean relative viability between conditions at day 7, analysed using GraphPad Prism 9.0.0.
Anchorage-independent growth
A 16-gauge blunt-end needle was used to transfer 12 ml of semi-solid methylcellulose-based medium (Stemcell Technologies, 03814) to a 50 ml conical tube and 3 ml of cell suspension containing 15,000 (A673 and /TC32), 5,000 (EW8) or 20,000 (PEDS0009, PEDS0010 and ES-PDX-001) cells. The mixture was vortexed and left at room temperature for 10–15 min until bubbles dissolved. A blunt-end needle was used to transfer 3 ml of the mixture to separate 6 cm dishes, which were placed inside a 15 cm plate containing a PBS-filled 6 cm dish used to maintain humidity. Colonies were stained 7 days later by adding 1 ml of a 1:1 mixture of PBS and MTT dye (Roche Diagnostics, 11465007001) per dish and incubating for 30–45 min at 37 °C. Colonies in each dish were imaged using an ImageQuant LAS 4000 imager (GE Healthcare) and quantified using ImageQuant TL 8.2 software (Cytiva). In parallel, whole cell lysate was collected from cultured cells for western blotting.
Flow cytometry and cell cycle analysis
Cell cycle analysis was performed using Click-iT Plus EdU Alexa Fluor 647 Flow Cytometry Assay kits (Life Technologies, C10424) per kit instructions with minor modifications. Cells were seeded and cultured separately before being pulsed with 10 µM of the modified nucleotide analogue 5-ethynyl-2′-deoxyuridine (EdU) for 90 min at 37 °C. Around 1–2 million cells per sample were trypsinized, washed, fixed, permeabilized and then treated with a reaction cocktail containing Alexa Fluor-647-conjugated picolyl azide to label incorporated EdU. Cells were stained with a RNAse-containing propidium iodide solution (Cell Signaling, 4087S) for 45 min at 37 °C. Cells were analysed by flow cytometry at 5,000–10,000 cells per sample on a BD FacsCelesta instrument. Live cells were gated using FSC-A and SSC-A. The data were analysed using FlowJo v.10.6.1 software. Cells were collected from each sample for western blotting.
Mouse studies
All mouse studies were approved by the Dana-Farber Cancer Institute Institutional Animal Care and Use Committee (Animal Welfare Assurance number: D16-00010 (A3023-01)) and were performed in accordance with NIH guidelines for the humane care and use of animals. The intramuscular mouse xenograft experiment (Fig. 1h,i) studied immunodeficient NOD.Cg-Prkdcscid Il2rgtm1Wjl/SzJ (NSG) mice ordered from Jackson Laboratory in a semi-orthotopic manner as previously described48. A673 cells were lentivirally transduced to express luciferase and CRISPR–Cas9 constructs targeting ETV6. These cells were intramuscularly implanted in the hindlimbs of 7-week-old female mice. On the day of implantation, cells were suspended in a 1:1 mixture of PBS and Matrigel (Thermo Fisher Scientific, CB40230C) and injected directly into the hindlimb cranial thigh muscle, away from the sciatic nerve, at a concentration of 50,000 cells per mouse in 50 µl. Five mice per condition (sgChr2.2, sgLacZ, sgETV6-1 and sgETV6-2) were implanted. Disease progression was monitored by serial bioluminescence imaging of the whole body. Bioluminescence was measured 10 min following subcutaneous injection of luciferin using a PerkinElmer IVIS Spectrum (exposure time, 0.5–180 s; binning, 2–16; luminescent, 25,000) to determine the maximum bioluminescence exhibited by each mouse. Mice in each condition were imaged at the same time. Mice were euthanized at the end point. Lung and liver tissue samples were collected following euthanasia and placed in a 6-well dish for bioluminescence imaging. Subcutaneous mouse xenograft experiments were conducted in Jackson NSG mice (Extended Data Fig. 2d) and CrTac:NCR-Foxn1<nu>(nude) mice from Taconic Biosciences (Fig. 6h). In the former study, 12-week-old males were used. In the latter study, 6–8-week-old females were used. Cells were suspended in a 7:3 mixture of culture medium and Matrigel and injected bilaterally subcutaneously into sublethally irradiated mice at 3 million cells in 100 μl. Three to four mice per condition received transplants. Tumours were measured with calipers serially twice weekly.
Animals were euthanized when tumours reached maximal 2 cm in at least one dimension or a humane end point such as ulceration or reduced mobility, in adherence to the NIH/NCI guidelines on limits of tumour size (equal to or less than 2.0 cm per tumour in any one dimension). This limit was not exceeded. Randomization was not appropriate in any study as drug treatments were not used. Mice from the same conditions were kept in different cages to minimize confounding environmental factors. Mice were housed with strictly controlled temperature and humidity and kept on 12-h light and dark cycles. No statistical methods were used to predetermine sample sizes, but sample sizes were similar to those reported in previous publications in which statistical significance was achieved48,66. Data distribution was assumed to be normal, but this was not formally tested, with the exception of data shown in Fig. 1i, for which the data were not normal (Shapiro–Wilk P < 0.05) and thus log-transformed. Data collection and analysis were not performed blind to the conditions of the experiments. No animals or data points were excluded from the analyses.
Crystal violet staining and quantification
Cell samples were cultured separately and re-seeded at normalized cell densities (50,000 cells per well in 6-well plate) every 5 days with refreshed DMSO or 1 μM dTAGV-1. On day 20, each well was incubated with 1 ml of crystal violet stain, composed of 20% methanol and 1% w/v crystal violet powder (Sigma Aldrich, C6158) at room temperature for 20 min. Wells were washed with 3 ml of deionized H2O five times and dried at room temperature. Plates were imaged using an ImageQuant LAS 4000 imager (GE Healthcare). The median intensity of stain in each well was quantified using ImageQuant TL 8.2 image analysis software (Cytiva).
Western blotting
Cells were lysed using cell lysis buffer (Cell Signaling Technology, 9803S), supplemented with protease inhibitor (Sigma Aldrich, 11836170001) and phosphatase inhibitor (Sigma Aldrich, 04906837001). Protein quantification of whole cell lysate was measured using a Bradford-based colorimetric assay (Bio-Rad, 5000006). Around 50–60 µg of whole cell lysate was mixed with loading buffer (Life Technologies, NP0007), reducing buffer (Life Technologies, NP0009) and water and heated to 75 °C for 10 min. Samples were loaded onto 4–12% bis-tris 10-well gels (Life Technologies, NP0335BOX) and run at 100 V for 30 min followed by 150 V for 90 min using MOPS buffer (Life Technologies, NP0001). Gels were transferred to polyvinylidene difluoride membranes (Thermo Fisher Scientific, IPVH00010) at 100 V for 90 min using transfer buffer (Boston BioProducts, BP-190-1L) at 4 °C. Membranes were blocked in milk (Cell Signaling Technology, 9999S) for 60 min at room temperature. Membranes were rocked overnight at 4 °C in a solution of Tris-buffered saline and Tween-20 (TBST; Cell Signaling Technology, 9997S) containing 5% w/v BSA (Research Products International, A30075-1000.0), 0.02% sodium azide (Santa Cruz Biotechnology, SC-208393) and primary antibody. The following day, membranes were washed in TBST five times, 5 min per wash. For a subset of western blots, membranes were rocked for 1 h at room temperature in milk containing 1:5,000 dilution of HRP-conjugated secondary antibody against mouse (Cell Signaling Technology, 7076S) or rabbit (Cell Signaling Technology, 7074S). Membranes were then washed in TBST three times and immersed in a solution containing chemiluminescent substrate (Life Technologies, 34076), allowed to develop for 1 min, then imaged using film (Thermo Fisher Scientific, PI34091). Other western blots were imaged using a LI-COR system. Membranes were rocked in a TBST solution containing a 1:10,000 dilution of secondary antibody against mouse (LI-COR Biosciences, 926-32210) and rabbit (LI-COR Biosciences, 926-68071) and 1:10,000 dilution of 10% SDS solution (Life Technologies, 15553027) for 1 h at room temperature. Membranes were washed in TBST three times and then briefly rinsed in PBS and imaged on an Odyssey CLx machine at medium resolution (ImageStudioLite 5.2.5).
The following primary antibodies were used at the following dilutions: anti-GAPDH at 1:10,000 (2118S, rabbit, monoclonal, Cell Signaling Technology); anti-ETV6 primary at 1:500 (WH0002120M1-100UG, mouse, monoclonal, Sigma Aldrich or SC-166835, mouse, monoclonal, Santa Cruz Biotechnology); anti-HA at 1:2,000 (3724S, rabbit, monoclonal, Cell Signaling Technology); anti-PARP at 1:1,000 (9542S, rabbit, polyclonal, Cell Signaling Technology); anti-cleaved caspase-3 at 1:1,000 (9664S, rabbit, monoclonal, Cell Signaling Technology); anti-FLI1 primary antibody at 1:1,000 (ab15289, rabbit, polyclonal, Abcam); anti-FAS at 1:500 (SC-8009, mouse, monoclonal, Santa Cruz Biotechnology); anti-SEMA5B at 1:500 (PA5113369, rabbit, polyclonal, Thermo Fisher Scientific); anti-BCL11B at 1:500 (12120S, rabbit, monoclonal, Cell Signaling Technology); and anti-SOX11 at 1:1,000 (58207S, rabbit, monoclonal, Cell Signaling Technology).
SOX11 overexpression
Complementary DNAs of wild-type SOX11 and SOX11 mutants harbouring a deletion of the DBD (H48-R119) were synthesized as gBlocks fragments (IDT), and then cloned into a pLX_TRC307 lentiviral expression vector co-expressing a puromycin resistance gene (obtained from the Genetic Perturbation Platform at the Broad Institute) using a Gibson Assembly Cloning kit (New England Biolabs E5510S). Constructs were lentivirally delivered to cells as described above.
Inducible EWS–FLI overexpression
The previously described vector pINDUCER20-EWS/FLI-HA66, which encodes HA epitope-tagged EWS–FLI, was mutated to create the R340N mutation96 by site-directed mutagenesis using a NEB Q5 Quick Change Site Directed Mutagenesis kit (E0554) using the following primers: forward 5′-CCGGGCCCTCAATTATTACTATGATAAAAAC-3′; reverse 5′ CTCAGCTTGTCGTAATTC-3′. The correct mutation was confirmed by Sanger sequencing with both a forward and reverse sequencing primer (forward: 5′-TCCCACACCGACCAGTCCTCAC-3′; reverse: 5′-AGACTGCCTTGGGAAAAGCGCC-3′). pINDUCER20-GFP-HA vector was used as a control. RD cells stably expressing these three vectors were generated. For knockout experiments, RD cells stably expressing the inducible vectors were transduced with sgRNAs targeting Chr2.2 (cutting control) or ETV6. Expression of EWS–FLI was induced using 1 µg ml–1 doxycycline replenished every 48 h.
Rescue of ETV6 knockout with wild-type and ETS-deleted ETV6 overexpression
DNA fragments encoding codon-optimized ETV6 wild-type (ETV6-WT) and mutant ETV6 harbouring deletion of the ETS domain (ETV6-ΔETS) were purchased from gBlock (IDT) and cloned into pDONR-221 via BP gateway cloning. Constructs were further cloned into pINDUCER20 (Addgene, 44012) by LR cloning and lentivirally packaged as described above. A673 and EW8 cells were transduced with lentivirus encoding either ETV6-WT or ETV6-ΔETS and incubated with 100 ng ml–1 of doxycycline or vehicle for 24 h. Subcellular fractionation was performed according to the manufacturer’s protocol (Thermo Fisher, PI78840). Western blotting and cell viability experiments were performed as described above.
qPCR
Total RNA was extracted from cells using an extraction kit with column-based genomic DNA removal (Qiagen, 74134). RNA was reverse transcribed to cDNA using an iScript kit (Bio-Rad Laboratories, 1708841) and diluted 1:7 with H2O. For sgFLI rescue experiments, A673 ETV6–dTAG cells were transduced with sgChr2.2 or sgFLI CRISPR–Cas9 constructs and treated separately with DMSO or dTAGV-1 in duplicate. All qPCR reactions were performed using a TaqMan system (Thermo Fisher Scientific) with technical triplicates. Probes were selected to span exon–exon junctions when possible. Specific probes were as follows: GAPDH: Hs02758991_g1; FAS: Hs00236330_m1; ACTA2: Hs00426835_g1; TRIB1: Hs00179769_m1; SEMA5B: Hs00400720_m1; BCL11B: Hs01102259_m1; and SOX11: Hs00846583_s1. In each qPCR reaction, the gene of interest was measured using FAM dye, whereas GAPDH control was measured using VIC dye. Samples were analysed in 384-well plate format using 5 µl TaqMan gene expression master mix (Thermo Fisher Scientific, 4369016), 0.5 µl of FAM-emitting probe, 0.5 µl of VIC-emitting GAPDH probe and 4 µl of diluted cDNA for a total of 10 µl per reaction. qPCR plates were analysed using a QuantStudio 6Flex Real-Time PCR machine and the accompanying QuantStudio Real-Time PCR software v.1.7 (Thermo Fisher Scientific). The delta-threshold cycle number (ΔCt) was calculated as the difference in threshold cycle number (Ct) between the gene of interest and GAPDH. The ΔΔCt was calculated as the difference between the ΔCt of a particular sample and the average ΔCt of the DMSO-treated, sgChr2.2 control samples. Fold increase in gene expression (after the loss of ETV6) was calculated as the ratio of 2−ΔΔCt in dTAGV-1-treated cells to the average 2−ΔΔCt in DMSO-treated cells, in either the sgChr2.2 or the sgFLI conditions.
RNA-seq
All RNA-seq experiments were performed using total RNA extracted using a column-based kit (Qiagen, 74104) and treated with DNAse digestion. The Life Technologies external RNA control consortium (ERCC) RNA spike-in samples were added to each sample for normalization per kit instructions (Thermo Fisher Scientific, 4456740). For all RNA-seq experiments, except the A673 sgETV6 CRISPR–Cas9 experiments, RNA-seq library preparation and sequencing were performed by Novogene (https://en.novogene.com) at a depth of roughly 20 million reads per sample. Per Novogene correspondence, the quality control for the RNA samples was performed using Qubit fluorometric quantitation (Thermo Fisher Scientific) and a Bioanalyzer instrument (Agilent). Libraries were then prepared using a New England Biolabs NEBNext Ultra II non-directional RNA Library Prep kit. Library quality and concentrations were assessed using Labchip (Perkin Elmer) and qPCR. Libraries were sequenced in 150-bp paired-end fashion on a Novaseq6000 instrument (Illumina). For the A673 sgETV6 CRISPR–Cas9 experiments, polyA-tailed mRNA was isolated from 1 μg total RNA using a magnetic bead-based kit per kit instructions (New England Biolabs, E7490S). RNA-seq library preparation was performed using a NEBNext Ultra II Directional RNA Library Prep Kit for Illumina (New England Biolabs, E7760S). Libraries were quantified using a Qubit dsDNA high sensitivity assay (Q32851). The distribution of DNA fragment sizes was measured using a High Sensitivity D1000 assay (Agilent, ScreenTape, 5067-5584; reagents, 5067-5585). The molarity of each library was calculated and normalized to 4 nM. Libraries were pooled and sequenced on a Nextseq 500 instrument (Illumina) (single-end; 75 cycles at a depth of roughly 40 million reads per sample) using a Nextseq 500 sequencing kit (Illumina, 20024906).
CUT&Tag
CUT&Tag was performed as previously described50 with slight modifications by the Lessnick Laboratory (Nationwide Children’s Hospital, Columbus, OH). About 250,000 cells per CUT&Tag condition were bound to BioMag Plus Concanavalin A-coated magnetic beads (Bangs Laboratories, BP531) and incubated with primary antibodies (ETV6 rabbit, Bethyl A303-674, 1:50; ETV6 mouse, Sigma WH0002120M1, 1:50; rabbit anti-mouse, Abcam, ab46540, 1:50) overnight at 4 °C, and secondary antibodies (guinea pig anti-rabbit IgG, Antibodies-Online ABIN101961, 1:100; rabbit anti-mouse, Abcam ab46540, 1:100) for 1 h at room temperature. Adapter-loaded protein A–Tn5 fusion protein was added at a dilution of 1:250 and incubated for 1 h at room temperature. To activate Tn5, tagmentation buffer containing MgCl2 was added and samples were incubated for 1 h at 37 °C. Reactions were stopped by addition of EDTA, and DNA was solubilized with SDS and proteinase K for 1 h at 50 °C. Total DNA was purified using phenol–chloroform extraction followed by ethanol precipitation. CUT&Tag libraries were prepared using NEBNext HiFi 2× PCR master mix (NEB, M0541S) and indexed primers97 using a combined annealing–extension step at 63 °C for 10 s and 15 cycles followed by a 1.1× post-amplification AMPure XP (Beckman Coulter, A63880) bead clean-up. Libraries were pooled and sequenced (2 × 150 bp paired end) on an Illumina HiSeq4000 platform (Nationwide Children’s Hospital Institute for Genomic Medicine). Two independent replicates of each CUT&Tag sample were prepared.
CUT&RUN
CUT and release using nuclease (CUT&RUN) was performed as previously described98,99,100 with slight modifications. In brief, 500,000 cells per condition were bound to activated ConA beads (EpiCypher 21-1401). Next, the ConA bead–cell mixture was resuspended in a cold antibody buffer and FLI-1-(ab133485; 1 μg per sample) antibody or 0.5 μg H3K4me3 (EpiCyper, 13-0041) as positive and 0.5 μg IgG (EpiCypher, 13-0042) as negative control were per sample added overnight. pAG-MNase (EpiCypher, 15-1016) was then added to each reaction to allow binding to the antibody-labelled chromatin. E. coli spike-in DNA (EpiCypher, 18-1401) was added following MNase activation. Subsequently, targeted chromatin was digested and released by the addition of CaCl2. The fragmented chromatin was purified using a CUTANA DNA Purification kit (EpiCypher, 14-0050). Quantification, library preparation and sequencing were performed by the genomics core at Dana-Farber Cancer Institute.
ChIP-seq
Antibodies were conjugated to magnetic beads. For each immunoprecipitation (IP), 100 µl of protein A Dynabeads (Thermo Fisher Scientific, 10002D) were washed three times in 1 ml BSA blocking solution (0.5% w/v sterile-filtered BSA in H2O) and resuspended in 250 µl. Beads were then rotated overnight at 4 °C with antibody, using 5 µg to target H3K27ac (Abcam, 4729) or 10 µg to target TFs (anti-HA, Abcam, ab9110; anti-FLI1, Abcam, ab15289). For comparative studies (that is, comparing the relative binding of EWS–FLI), 2 µg of spike-in antibody recognizing a Drosophila-specific histone variant was added (Active Motif, 61686). The following morning, the antibody-conjugated beads were washed four times in 1 ml BSA blocking solution and then resuspended in 100 µl of the solution and stored at 4 °C.
To prepare sheared chromatin, Ewing sarcoma cells (20 million cells per ChIP reaction) were collected in a 1.5 ml tube and washed twice in 1 ml PBS. Cells were then crosslinked by resuspension in 1 ml PBS containing 1% methanol-free formaldehyde (Thermo Fisher Scientific, 28906) and rotated for 10 min at room temperature at 12 r.p.m. The reaction was quenched with 100 µl of 1.25 M glycine (Sigma Aldrich, G7126) and 100 µl 1 M Tris-HCl pH 8.0 (Thermo Fisher Scientific, 15568025). Cell pellets were washed twice with 1 ml PBS at room temperature and resuspended in 1 ml of SDS lysis buffer (0.5% SDS, 5 mM EDTA, 50 mM Tris-HCl pH 8.0) supplemented with protease inhibitor cocktail (Thermo Fisher Scientific, PI78429) and incubated at room temperature for 2 min with gentle agitation. Lysates were centrifuged at 15,000g for 10 min at 4 °C. The nuclear pellet was re-suspended in 950 µl of ChIP IP buffer (2 parts SDS lysis buffer and 1 part Triton dilution buffer, which was composed of 100 mM Tris-HCl pH 8.0, 100 mM NaCl, 5 mM EDTA, 0.2% NaN3 and 5% Triton X-100) supplemented with protease inhibitor and transferred to a milliTUBE (Covaris, 520130). Sonication was performed on an E220 Focus Ultra sonicator (Covaris) at 5% duty cycle, 140 W peak power, 200 cycles per burst, at 4 °C for 30 min per milliTUBE. Sheared chromatin was transferred to a 1.5 ml tube and centrifuged at 15,000g for 10 min at 4 °C. The supernatant of sheared chromatin was transferred to a new reaction tube. To prepare the ChIP DNA input sample, 5 µl of sheared chromatin was transferred to a PCR strip-tube and mixed with 40 µl de-crosslinking buffer (100 mM NaHCO3 and 1% SDS buffer), 1 µl RNAse A (Thermo Fisher Scientific, 12091021) and 1 µl proteinase K (Thermo Fisher Scientific, AM2546). The tube was incubated for 2 h at 65 °C in a thermal cycler to de-crosslink DNA–protein covalent bonds. DNA was isolated using Agencourt AMPure XP bead-based purification at a 1.2× ratio (Beckman Coulter, A63881), eluted in 50 µl H2O and stored at −20 °C. The remaining sheared chromatin was divided or pooled according to the target of interest; at least 5 million cells were used for IP of histone marks and 40 million cells for TFs. Each IP reaction was brought up to a total volume of at least 1 ml with ChIP IP buffer. Pooled reactions were conducted in 2 ml or 5 ml reaction tubes. 50 ng or 20 ng of Drosophila spike-in chromatin was added for each H3K27ac or TF ChIP reaction, respectively. The 100 µl conjugated bead–antibody solution was then added to the sheared chromatin. IP reactions were rotated overnight at 4 °C.
ChIP reactions were washed twice in 1 ml low-salt buffer (0.1% SDS, 1% Triton X-100, 2 mM EDTA, 20 mM Tris-HCl pH 8.0, and 150 mM NaCl), high-salt buffer (0.1% SDS, 1% Triton X-100, 2 mM EDTA, 20 mM Tris-HCl pH 8.0, and 500 mM NaCl), lithium chloride buffer (0.25 M LiCl, 1% IGEPAL-CH 630, 1% sodium deoxycholate, 10 mM Tris-HCl pH 8.0, and 1 mM EDTA) and then once in 700 µl Tris-EDTA buffer (Sigma Aldrich, 93283). Chromatin was eluted using 100 µl fresh ChIP elution buffer (1% SDS and 0.1 M NaHCO3) and rotated at room temperature for 15 min. Eluate was transferred to PCR tubes and mixed with 8 µl 2.5 M NaCl, 1 µl RNAse A and 1 µl proteinase K. Samples were de-crosslinked for 12–16 h at 65 °C on a thermal cycler. ChIP DNA was extracted from the de-crosslinked samples using AMPure XP beads at a 1.2× ratio and eluted in 20 µl of H2O. DNA was quantified using a Qubit dsDNA high sensitivity assay (Q32851). DNA fragment sizes were measured with a Tapestation instrument using a High Sensitivity D1000 assay (Agilent, ScreenTape, 5067-5584; reagents, 5067-5585).
ChIP-seq libraries were prepared using a SMARTer ThruPLEX single-index DNA-Seq kit (Takara Bio, R400674, R400695). H3K27ac and TF samples were PCR-amplified 4 and 10 cycles, respectively. Libraries were prepared as described above and sequenced in 37-bp paired-end fashion for 75 cycles (Illumina, 20024906) at a depth of roughly 30 million reads per sample on the NextSeq 500.
ATAC-seq
A673 ETV6–dTAG cells were seeded and treated separately with DMSO or 1 µM dTAGV-1 for 72 h. ATAC-seq was performed as previously described97 on samples of 100,000 cells using a publicly available protocol (available at https://www.med.upenn.edu/kaestnerlab/assets/user-content/documents/ATAC-seq-Protocol-(Omni)-Kaestner-Lab.pdf) without modifications. The molarity of each library was calculated using a Qubit dsDNA Broad Range Assay kit (Thermo Fisher Scientific, Q32850) and a Tapestation D5000 dsDNA assay (Agilent; ScreenTape, 5067–5588; Ladder, 5067–5590; reagents, 5067–5589). Libraries were pooled and sequenced in 37-bp paired-end fashion for 75 cycles on an Illumina NextSeq 500 instrument.
ChIP-seq data analysis
The raw Illumina sequencer output was converted to fastq format using the program bcl2fastq (v.2.17). Sequencing read quality was examined using FastQC (http://www.bioinformatics.babraham.ac.uk) (v.0.11.9). Trimming of low-quality reads and clipping of sequencing adapters was done using the program Trimmomatic (v.0.36)101, and all reads shorter than 40 bp after trimming were discarded. Reads were aligned to the human genome (hg19) using Bowtie2 (v.2.3.5)102,103 using the ‘—very_sensitive’ preset collection of parameters. File conversion of .bam to .sam was done using SamTools (v.1.9q)104, and duplicate reads were removed using Picard-tools (v.2.19.0) (http://picard.sourceforge.net). ChIP-seq peaks were called using MACS2 (ref. 105) with a false discovery rate (FDR) q < 0.01 unless otherwise stated. The MACS2 algorithm utilizes a dynamic Poisson distribution to capture local biases in the genomic sequence, which allows for a sensitive and robust prediction of peaks. Unless otherwise noted, peaks were assigned to the closest gene within ±400 kb using the ChIPseeker package in R106. Visualizations of the ChIP-seq data tracks were produced with the R Bioconductor Gviz package107.
CUT&Tag data analysis
Quality control on raw sequencing reads were performed using FastQC (http://www.bioinformatics.babraham.ac.uk) (v.0.11.4). Adapter sequences and/or low-quality reads were trimmed using trim_galore (http://www.bioinformatics.babraham.ac.uk) (0.4.4_dev). Reads were aligned to human (hg19) and spike-in E. coli (Escherichia_coli_K_12_DH10B NCBI 2008-03-17) genomes using Bowtie2 (v.2.3.4.3)102,103 with the following options: --no-unal --no-mixed --no-discordant --dovetail --phred33 -q -I 10 -X 700. The option --very-sensitive was added when aligning to the spike-in genome. SamTools (v.1.9)104 was used to convert .sam to .bam with the ‘-bq 10’ option. Counts of mapped reads were spike-in normalized by calculating a scale factor using the ‘median ratio method’ from DESeq2. Spike-in normalization in conjunction with the median ratio method provide a robust normalization method to appropriately account for global changes of ETV6 occupancies108,109. Peaks in each biological replicate were called using MACS2 (v.2.2.7.1)105 with the spike-in normalization. All duplicate reads were kept in the analysis. To ensure reproducibility and consistency of peaks called across multiple biological replicates, we calculated irreproducibility discovery rate109 values and combined the replicates with rabbit anti-mouse as controls using DiffBind (v.2.14.0)110 and DESEq2 (v.1.26.0)108. To ensure high-quality peaks that are most likely to represent biological signals, the final peak lists were generated with following thresholds: irreproducibility discovery rate < 0.005, FDR < 0.05, log2(fold change) > 3 and mean normalized counts of ETV6 > 80 (Bethyl antibody) and >300 (Sigma antibody).
CUT&RUN data analysis
CUT&RUN FLI1 data for the PEDS0009 sample used a pipeline based on the bulk-level method outlined in CUT&RUNTools 2.0 (ref. 111) that is largely the same as the ChIP-seq pipeline. The changes to the ChIP-seq pipeline are an extra adapter trimming step run after Trimmomatic using kseq from CUT&RUN Tools and the addition of the ‘—dovetail’ flag to the Bowtie2 command. CUT&RUN samples also included E. coli spike-in for sample normalization and it was aligned to the E. coli (Escherichia_coli_K_12_DH10B NCBI 2008-03-17) genome.
Differential ChIP-seq binding
Differential binding analysis in ETV6–dTAG ChIP-seq samples was performed with the R Bioconductor package CSAW89. CSAW uses a sliding window approach to count reads across the genome from sorted and indexed .bam files, for which each window is tested for significant differences between libraries using statistical methods from the edgeR package. Differential CSAW analysis was performed on A673 and EW8 ETV6–dTAG at 6 and 72 h in FLI1 and H3K27ac. The differential analysis performed here normalized samples based on Drosophila spike-in values, the reads of which were aligned to the dm6 version of the Drosophila genome. The differential ChIP-seq analysis procedure generally followed the approach outlined in the CSAW introductory usage tutorial as follows. The .bam files were read in allowing a maximum fragment length of 800, a minimum q = 20 and discarding any reads that fell in the hg19 or dm6 ENCODE blacklist files. A window size of 150 bases was used for analysis and tiled across the genome in 50 base steps. The ChIP-seq input control samples were used to help filter out regions containing just background reads by binning input control reads into 10,000 base blocks with a threshold of minimum prior counts of 2. The binned input reads were then compared with the ChIP-seq binding across all regions, and all ChIP-seq regions with a fold change of less than 3 over input were filtered out. After filtering, adjacent and overlapping 150 base regions were merged together to reduce the number of hypotheses tested (for example, A673 6 h ETV6–dTAG FLI1 had an average merged window width of 494 bases). Drosophila spike-in control reads were processed similarly to the human reads except, as there was no input control for the spike-in control, the spike-in reads were filtered using a global filtering method that required regions to be threefold above background. The counts for all enriched spike-in regions were used to calculate the normalization factors by applying the trimmed mean of M-values method on these counts via the function normFactors. Differential binding is tested for significance using the quasi-likelihood framework in the edgeR package, whereby edgeR models the counts using a negative binomial distribution that accounts for over-dispersion between biological replicates. To account for multiple hypothesis testing, CSAW converts per-window statistics into a P value for each region and then applies the Benjamini–Hochberg method to calculate the corrected FDR.
ChIP-seq heatmaps
ChIP-seq heatmaps were produced by functions in the following deeptools package (v.3.3.0)112: computeMatrix, plotProfile and plotHeatmap. All heatmaps were made using data in .bigWig files that have been generated by deeptools bamCompare that generates .bigWig files based on the comparison of a ChIP-seq sample .bam file to its corresponding input (from the same cell line and same batch) while being simultaneously normalized for sequencing depth. The function computeMatrix was then used to calculate scores for genome regions and to prepare an intermediate file that can be used with plotHeatmap and plotProfiles. Unless otherwise stated, the genome regions were regions defined by a BED file corresponding to ETV6 or FLI1 peaks. For Fig. 2a–c and Extended Data Fig. 3a–c, computeMatrix was used with multiple .bigWig score files and two BED region files, in which the ETV6 peaks are split into two groups depending on whether the ETV6 peak overlapped with a region defined by gene TSSs ± 2.5 kb according to UCSC hg19 refGene transcript definitions. Figure 2g,h used regions defined by differential FLI1 regions from P < 0.05 CSAW, whereby regions not intersecting with a TSS were further divided into two groups according to whether the region intersects with a H3K27ac ChIP-seq peak from MACS2 with q < 0.01 in the parental A673 or EW8 cell line.
GGAA repeat frequency at peak locations
Stacked bar plots were created in R using frequencies of overlap from the function summarizePatternInPeaks from R Bioconductor package ChIPpeakAnno (v.3.9)113. The function summarizePatternInPeaks was used to calculate the frequency of overlap of regions of the standard hg19 reference genome with GGAA repeats (from a single GGAA up to five consecutive GGAA sequences without any gaps) with peaks in FLI1 and ETV6 as called by MACS2. The ENCODE datasets analysed were from the Gene Expression Omnibus: GSE96274 (B lymphocyte) and GSE95877 (K-562).
Differential ATAC-seq regions
Processing of ATAC-seq data (that is, Fig. 4c) used the same pipeline as the ChIP-seq data, although an extra step was added after Bowtie2 alignment that used samtools to remove mitochondrial reads (ChrM). CSAW was used for the differential analysis of ATAC-seq data in the same manner as CSAW was used with ChIP-seq data, except that there was no input control for filtering or spike-in control for sample normalization. In the absence of a matching input control, CSAW region filtering was performed by requiring regions to be threefold above the local background, whereby local background was defined by using wider windowing function of 2,000 bases and requiring regions to be threefold above the neighbouring regions. Within CSAW, ATAC-seq samples were normalized to the background using 10,000 base windows to calculate compositional biases of samples.
RNA-seq data analysis
Gene expression values were derived from paired-end RNA-seq data, except for the A673 sgETV6 CRISPR–Cas9 RNA-seq experiment, which was sequenced in single-end fashion. The RNA-seq processing pipeline was roughly modelled on the GTEx pipeline (https://github.com/broadinstitute/gtex-pipeline/)114. FastQC was used to evaluate read quality on raw RNA-seq reads. Reads were aligned to the human genome (hg19) using STAR115. Transcript-level quantifications were calculated using RSEM (v.1.3.1)116. Gene counts from STAR were then used to quantify differentially expressed genes between the experimental and control conditions using the R Bioconductor package DESeq2 (ref. 108) using the approximate posterior estimation for GLM coefficients (apeglm) method for effect size. Normalized expression values for individual samples were obtained from RSEM log2(TPM) values with the RSEM log2(TPM + 1) values used for GSEA and producing RNA-seq heatmap plots.
The RNA-seq samples included the ERCC spike-in control mix117. Sequences for the ERCC transcripts were added to the hg19 reference for the STAR transcript alignment, and the fold changes of ERCC probes were examined in the DESeq2 output. Fold changes for ERCC probes were typically very small between the conditions in the ETV6–dTAG sample sets (for example, average fold change for 24 h A673 ETV6–dTAG of 0.995 between conditions). As such, ERCC spike-ins were not used to perform sample normalization.
Gene set pathway enrichment analysis
Gene set pathway enrichment analysis was performed with signatures from v.6.0 of the Broad Institute’s molecular signature database (MSigDB) (http://www.broadinstitute.org/gsea/msigdb/index.jsp) using the c2 curated gene sets from various sources such as online pathway databases, the biomedical literature and knowledge of domain experts. These pathway enrichment results are shown in Fig. 3f and Extended Data Fig. 4g. Pathway enrichment analysis was performed in R using the clusterProfiler package that provides the enricher function for a hypergeometric test for a test of over-representation of pathway genes in a set of user-defined genes. Figure 3f shows a combined enrichment plot of the top MSigDB c2 pathways enriched in the ETV6-repressed genes at 6, 24 and 72 h common to both A673 and EW8 (genes up in ETV6 dTAGV-1 treatment RNA-seq). The plot shows a selected subset of the top enriched c2 gene sets, and the complete set of enriched sets is shown Supplementary Tables 7–11. The dot size corresponds to the number of genes in the gene set out of the total number of significantly ETV6-repressed genes at 6, 24, and 72 h (85, 251 and 832 genes, respectively). The colour corresponds to the gene set grouping. Missing points at times along the x axis represent times at which the enrichment was not significant with P < 0.05. The pathways are ordered first by the gene group and then by the average gene ratio (count of repressed genes in a pathway/number of repressed genes) across the three time points. Extended Data Fig. 4g shows a combined enrichment plot of the top MSigDB c2 pathways enriched in the ETV6-activated genes at 6, 24 and 72 h common to both A673 and EW8 (genes down in ETV6 dTAGV-1 treatment RNA-seq). The plot shows a selected subset of the top enriched c2 gene sets, and the complete set of enriched sets is shown Supplementary Tables 12–17. The dot size corresponds to the number of genes in the gene set out of the total number of significantly ETV6-activated genes at 6, 24 and 72 h (33, 130 and 543 genes, respectively). The colour corresponds to the gene set grouping. Missing points at times along the x axis represent times at which the enrichment was not significant. The pathways are ordered first by the gene group and then by the average gene ratio (count of repressed genes in a pathway/number of repressed genes) across the three time points. Extended Data Fig. 7a shows a pie chart of the top 100 enriched c5 gene sets, ranked by significance, in A673 ETV6–dTAG cells at 24 h. Each c5 gene signature was assigned to one of the categories listed; a complete list is shown in Supplementary Table 19.
GSEA
The GSEA algorithm118,119 was used to evaluate the association of gene sets with ETV6 regulation. GSEA was run with 2,500 permutations of the phenotype using signal-to-noise to rank genes. This GSEA algorithm was used for Fig. 3e to test enrichment and generate enrichment plots of ETV6-bound genes in ETV6-regulated genes. The A673 ETV6 peak locations are defined by the peaks that overlap in all three A673 ETV6 samples (two A673 ETV6 CUT&Tag samples from two ETV6 antibodies and one untreated A673 ETV6 dTAG HA sample) and the EW8 ETV6 peak locations are defined by peaks in the EW8 ETV6 HA sample. ETV6-bound genes were identified by mapping the peaks to their nearest genes using the R package ChIPseeker.
Statistics and reproducibility
Further information is available in the Nature Portfolio Reporting Summary linked to this article. Figure panels displaying data from experiments with n = 1 include Figs. 2a–e, 4h, 5c–g and 6c,f,h and Extended Data Figs. 2c,e, 3a–d, 6d–g and 7g. Figure panels displaying data from experiments with n = 2 include Figs. 2f–i, 4a–c,e–g and 6a,b and Extended Data Figs. 3f,g and 5a,c,d. All other figure panels display data from experiments with at least n = 3.
Reporting summary
Further information on research design is available in the Nature Portfolio Reporting Summary linked to this article.
Data availability
CRISPR–Cas9 screen data and the genomic characterization of cancer cell lines (RNA-seq) used in this study are publicly available at https://depmap.org. Gene expression data from the Treehouse Childhood Cancer Initiative characterizing primary tumours is publicly available at https://treehousegenomics.soe.ucsc.edu/public-data/. The Broad Institute’s MSigDB is publicly available at http://www.broadinstitute.org/gsea/msigdb/index.jsp. Genomics data shown in this study have been deposited in the Gene Expression Omnibus under accession code GSE181554. Source data are provided with this paper.
Code availability
No custom code was used in this study. Analysis was done with publicly available pipelines using approaches described in the Methods.
References
Tsherniak, A. et al. Defining a cancer dependency map. Cell 170, 564–576.e16 (2017).
Filbin, M. & Monje, M. Developmental origins and emerging therapeutic opportunities for childhood cancer. Nat. Med. 25, 367–376 (2019).
Panditharatna, E. & Filbin, M. G. The growing role of epigenetics in childhood cancers. Curr. Opin. Pediatr. 32, 67–75 (2020).
Lawrence, M. S. et al. Mutational heterogeneity in cancer and the search for new cancer-associated genes. Nature 499, 214–218 (2013).
Gröbner, S. N. et al. The landscape of genomic alterations across childhood cancers. Nature 555, 321–327 (2018).
Jahangiri, L. et al. Core regulatory circuitries in defining cancer cell identity across the malignant spectrum. Open Biol. 10, 200121 (2020).
Saint-André, V. et al. Models of human core transcriptional regulatory circuitries. Genome Res. 26, 385–396 (2016).
Ott, C. J. et al. Enhancer architecture and essential core regulatory circuitry of chronic lymphocytic leukemia. Cancer Cell 34, 982–995.e7 (2018).
Riddick, G. et al. A core regulatory circuit in glioblastoma stem cells links MAPK activation to a transcriptional program of neural stem cell identity. Sci. Rep. 7, 43605 (2017).
Sanda, T. et al. Core transcriptional regulatory circuit controlled by the TAL1 complex in human T cell acute lymphoblastic leukemia. Cancer Cell 22, 209–221 (2012).
Kron, K. J. et al. TMPRSS2–ERG fusion co-opts master transcription factors and activates NOTCH signaling in primary prostate cancer. Nat. Genet. 49, 1336–1345 (2017).
Bradner, J. E., Hnisz, D. & Young, R. A. Transcriptional addiction in cancer. Cell 168, 629–643 (2017).
Chapuy, B. et al. Discovery and characterization of super-enhancer-associated dependencies in diffuse large B cell lymphoma. Cancer Cell 24, 777–790 (2013).
Sengupta, S. & George, R. E. Super-enhancer-driven transcriptional dependencies in cancer. Trends Cancer 3, 269–281 (2017).
Boeva, V. et al. Heterogeneity of neuroblastoma cell identity defined by transcriptional circuitries. Nat. Genet. 49, 1408–1413 (2017).
Durbin, A. D. et al. Selective gene dependencies in MYCN-amplified neuroblastoma include the core transcriptional regulatory circuitry. Nat. Genet. 50, 1240–1246 (2018).
Gryder, B. E. et al. Histone hyperacetylation disrupts core gene regulatory architecture in rhabdomyosarcoma. Nat. Genet. 51, 1714–1722 (2019).
Gryder, B. E. et al. PAX3–FOXO1 establishes myogenic super enhancers and confers BET bromodomain vulnerability. Cancer Discov. 7, 884–899 (2017).
van Groningen, T. et al. Neuroblastoma is composed of two super-enhancer-associated differentiation states. Nat. Genet. 49, 1261–1266 (2017).
Grünewald, T. G. P. et al. Ewing sarcoma. Nat. Rev. Dis. Prim. 4, 5 (2018).
Delattre, O. et al. Gene fusion with an ETS DNA-binding domain caused by chromosome translocation in human tumours. Nature 359, 162–165 (1992).
Toomey, E. C., Schiffman, J. D. & Lessnick, S. L. Recent advances in the molecular pathogenesis of Ewing’s sarcoma. Oncogene 29, 4504–4516 (2010).
Gangwal, K., Close, D., Enriquez, C. A., Hill, C. P. & Lessnick, S. L. Emergent properties of EWS/FLI regulation via GGAA microsatellites in Ewing’s sarcoma. Genes Cancer 1, 177–187 (2010).
Gangwal, K. et al. Microsatellites as EWS/FLI response elements in Ewing’s sarcoma. Proc. Natl Acad. Sci. USA 105, 10149–10154 (2008).
Guillon, N. et al. The oncogenic EWS–FLI1 protein binds in vivo GGAA microsatellite sequences with potential transcriptional activation function. PLoS ONE 4, e4932 (2009).
Riggi, N. et al. EWS–FLI1 utilizes divergent chromatin remodeling mechanisms to directly activate or repress enhancer elements in Ewing sarcoma. Cancer Cell 26, 668–681 (2014).
Boulay, G. et al. Cancer-specific retargeting of BAF complexes by a prion-like domain. Cell 171, 163–178.e19 (2017).
Boulay, G. et al. Epigenome editing of microsatellite repeats defines tumor-specific enhancer functions and dependencies. Genes Dev. 32, 1008–1019 (2018).
Riggi, N., Suva, M. L. & Stamenkovic, I. Ewing’s sarcoma origin: from duel to duality. Expert Rev. Anticancer Ther. 9, 1025–1030 (2009).
Tanaka, M. et al. Ewing’s sarcoma precursors are highly enriched in embryonic osteochondrogenic progenitors. J. Clin. Invest. 124, 3061–3074 (2014).
von Levetzow, C. et al. Modeling initiation of Ewing sarcoma in human neural crest cells. PLoS ONE 6, e19305 (2011).
Cidre-Aranaz, F. & Alonso, J. EWS/FLI1 target genes and therapeutic opportunities in Ewing sarcoma. Front. Oncol. 5, 162 (2015).
Fadul, J. et al. EWS/FLI utilizes NKX2-2 to repress mesenchymal features of Ewing sarcoma. Genes Cancer 6, 129–143 (2015).
García-Aragoncillo, E. et al. DAX1, a direct target of EWS/FLI1 oncoprotein, is a principal regulator of cell-cycle progression in Ewing’s tumor cells. Oncogene 27, 6034–6043 (2008).
Kinsey, M., Smith, R., Iyer, A. K., McCabe, E. R. B. & Lessnick, S. L. EWS/FLI and its downstream target NR0B1 interact directly to modulate transcription and oncogenesis in Ewing’s sarcoma. Cancer Res. 69, 9047–9055 (2009).
Wiles, E. T., Bell, R., Thomas, D., Beckerle, M. & Lessnick, S. L. ZEB2 represses the epithelial phenotype and facilitates metastasis in Ewing sarcoma. Genes Cancer 4, 486–500 (2013).
Wiles, E. T., Lui-Sargent, B., Bell, R. & Lessnick, S. L. BCL11B is up-regulated by EWS/FLI and contributes to the transformed phenotype in Ewing sarcoma. PLoS ONE 8, e59369 (2013).
Charville, G. W. et al. EWSR1 fusion proteins mediate PAX7 expression in Ewing sarcoma. Mod. Pathol. 30, 1312–1320 (2017).
Smith, R. et al. Expression profiling of EWS/FLI identifies NKX2.2 as a critical target gene in Ewing’s sarcoma. Cancer Cell 9, 405–416 (2006).
Shi, X. et al. EWS–FLI1 regulates and cooperates with core regulatory circuitry in Ewing sarcoma. Nucleic Acids Res. 48, 11434–11451 (2020).
Dharia, N. V. et al. A first-generation pediatric cancer dependency map. Nat. Genet. 53, 529–538 (2021).
Brohl, A. S. et al. The genomic landscape of the Ewing sarcoma family of tumors reveals recurrent STAG2 mutation. PLoS Genet. 10, e1004475 (2014).
Crompton, B. D. et al. The genomic landscape of pediatric Ewing sarcoma. Cancer Discov. 4, 1326–1341 (2014).
Tirode, F. et al. Genomic landscape of Ewing sarcoma defines an aggressive subtype with co-association of STAG2 and TP53 mutations. Cancer Discov. 4, 1342–1353 (2014).
Vaske, O. M. Comparative tumor RNA sequencing analysis for difficult-to-treat pediatric and young adult patients with cancer. JAMA Netw. Open 2, e1913968 (2019).
Nabet, B. et al. Rapid and direct control of target protein levels with VHL-recruiting dTAG molecules. Nat. Commun. 11, 4687 (2020).
Nabet, B. et al. The dTAG system for immediate and target-specific protein degradation. Nat. Chem. Biol. 14, 431–441 (2018).
Adane, B. et al. STAG2 loss rewires oncogenic and developmental programs to promote metastasis in Ewing sarcoma. Cancer Cell 39, 827–844.e10 (2021).
Park, H., Seo, Y., Kim, J. I., Kim, W.-J. & Choe, S. Y. Identification of the nuclear localization motif in the ETV6 (TEL) protein. Cancer Genet. Cytogenet. 167, 117–121 (2006).
Kaya-Okur, H. S. et al. CUT&Tag for efficient epigenomic profiling of small samples and single cells. Nat. Commun. 10, 1930 (2019).
Venkataraman, A. et al. A toolbox of immunoprecipitation-grade monoclonal antibodies to human transcription factors. Nat. Methods 15, 330–338 (2018).
ENCODE Project Consortium. The ENCODE (ENCyclopedia Of DNA Elements) Project. Science 306, 636–640 (2004).
Rasighaemi, P. & Ward, A. C. ETV6 and ETV7: siblings in hematopoiesis and its disruption in disease. Crit. Rev. Oncol. Hematol. 116, 106–115 (2017).
Chakrabarti, S. R. & Nucifora, G. The leukemia-associated gene TEL encodes a transcription repressor which associates with SMRT and mSin3A. Biochem. Biophys. Res. Commun. 264, 871–877 (1999).
Irvin, B. J. et al. TEL, a putative tumor suppressor, induces apoptosis and represses transcription of Bcl-XL. J. Biol. Chem. 278, 46378–46386 (2003).
Kim, C. A. et al. Polymerization of the SAM domain of TEL in leukemogenesis and transcriptional repression. EMBO J. 20, 4173–4182 (2001).
Lopez, R. G. et al. TEL is a sequence-specific transcriptional repressor. J. Biol. Chem. 274, 30132–30138 (1999).
Wang, L. C. et al. Yolk sac angiogenic defect and intra-embryonic apoptosis in mice lacking the Ets-related factor TEL. EMBO J. 16, 4374–4383 (1997).
Wang, L. C. et al. The TEL/ETV6 gene is required specifically for hematopoiesis in the bone marrow. Genes Dev. 12, 2392–2402 (1998).
Wang, L. & Hiebert, S. W. TEL contacts multiple co-repressors and specifically associates with histone deacetylase-3. Oncogene 20, 3716–3725 (2001).
Guidez, F. et al. Recruitment of the nuclear receptor corepressor N-CoR by the TEL moiety of the childhood leukemia-associated TEL-AML1 oncoprotein. Blood 96, 2557–2561 (2000).
Fenrick, R. et al. TEL, a putative tumor suppressor, modulates cell growth and cell morphology of ras-transformed cells while repressing the transcription of stromelysin-1. Mol. Cell. Biol. 20, 5828–5839 (2000).
Fisher, M. H. ETV6 germline mutations cause HDAC3/NCOR2 mislocalization and upregulation of interferon response genes. JCI Insight 5, e140332 (2020).
Guenther, L. M. et al. A combination CDK4/6 and IGF1R inhibitor strategy for Ewing sarcoma. Clin. Cancer Res. 25, 1343–1357 (2019).
García-Domínguez, D. J. et al. The combination of epigenetic drugs SAHA and HCI-2509 synergistically inhibits EWS-FLI1 and tumor growth in Ewing sarcoma. Oncotarget 9, 31397–31410 (2018).
Seong, B. K. A. et al. TRIM8 modulates the EWS/FLI oncoprotein to promote survival in Ewing sarcoma. Cancer Cell 39, 1262–1278.e7 (2021).
Tsang, S. M., Oliemuller, E. & Howard, B. A. Regulatory roles for SOX11 in development, stem cells and cancer. Semin. Cancer Biol. 67, 3–11 (2020).
Yang, Z. et al. SOX11: friend or foe in tumor prevention and carcinogenesis? Ther. Adv. Med. Oncol. 11, 1758835919853449 (2019).
Beekman, R., Amador, V. & Campo, E. SOX11, a key oncogenic factor in mantle cell lymphoma. Curr. Opin. Hematol. 25, 299–306 (2018).
Shepherd, J. H. et al. The SOX11 transcription factor is a critical regulator of basal-like breast cancer growth, invasion, and basal-like gene expression. Oncotarget 7, 13106–13121 (2016).
Zvelebil, M. et al. Embryonic mammary signature subsets are activated in Brca1–/– and basal-like breast cancers. Breast Cancer Res. 15, 25 (2013).
Yao, Z. et al. The role of tumor suppressor gene SOX11 in prostate cancer. Tumour Biol. 36, 6133–6138 (2015).
Hide, T. et al. Sox11 prevents tumorigenesis of glioma-initiating cells by inducing neuronal differentiation. Cancer Res. 69, 7953–7959 (2009).
Barretina, J. et al. The Cancer Cell Line Encyclopedia enables predictive modelling of anticancer drug sensitivity. Nature 483, 603–607 (2012).
Liang, H. et al. Solution structure of the ets domain of Fli-1 when bound to DNA. Nat. Struct. Biol. 1, 871–876 (1994).
Boyer, L. A. et al. Core transcriptional regulatory circuitry in human embryonic stem cells. Cell 122, 947–956 (2005).
Boisclair Lachance, J. F., Webber, J. L., Hong, L., Dinner, A. R. & Rebay, I. Cooperative recruitment of Yan via a high-affinity ETS supersite organizes repression to confer specificity and robustness to cardiac cell fate specification. Genes Dev. 32, 389–401 (2018).
Webber, J. L., Zhang, J., Massey, A., Sanchez-Luege, N. & Rebay, I. Collaborative repressive action of the antagonistic ETS transcription factors Pointed and Yan fine-tunes gene expression to confer robustness in Drosophila. Development https://doi.org/10.1242/dev.165985 (2018).
Webber, J. L., Zhang, J., Mitchell-Dick, A. & Rebay, I. 3D chromatin interactions organize Yan chromatin occupancy and repression at the even-skipped locus. Genes Dev. 27, 2293–2298 (2013).
Rebay, I. & Rubin, G. M. Yan functions as a general inhibitor of differentiation and is negatively regulated by activation of the Ras1/MAPK pathway. Cell 81, 857–866 (1995).
O’Neill, E. M., Rebay, I., Tjian, R. & Rubin, G. M. The activities of two Ets-related transcription factors required for Drosophila eye development are modulated by the Ras/MAPK pathway. Cell 78, 137–147 (1994).
Zhang, Y., Ho, T. D., Buchler, N. E. & Gordân, R. Competition for DNA binding between paralogous transcription factors determines their genomic occupancy and regulatory functions. Genome Res. 31, 1216–1229 (2021).
Sizemore, G. M., Pitarresi, J. R., Balakrishnan, S. & Ostrowski, M. C. The ETS family of oncogenic transcription factors in solid tumours. Nat. Rev. Cancer 17, 337–351 (2017).
Lambert, S. A. et al. The human transcription factors. Cell 172, 650–665 (2018).
Kwiatkowski, B. A. et al. The ets family member Tel binds to the Fli-1 oncoprotein and inhibits its transcriptional activity. J. Biol. Chem. 273, 17525–17530 (1998).
Raynaud, S. et al. The 12;21 translocation involving TEL and deletion of the other TEL allele: two frequently associated alterations found in childhood acute lymphoblastic leukemia. Blood 87, 2891–2899 (1996).
Golub, T. R. et al. Fusion of the TEL gene on 12p13 to the AML1 gene on 21q22 in acute lymphoblastic leukemia. Proc. Natl Acad. Sci. USA 92, 4917–4921 (1995).
Surdez, D. et al. STAG2 mutations alter CTCF-anchored loop extrusion, reduce cis-regulatory interactions and EWSR1–FLI1 activity in Ewing sarcoma. Cancer Cell 39, 810–826.e9 (2021).
Lun, A. T. & Smyth, G. K. csaw: a Bioconductor package for differential binding analysis of ChIP-seq data using sliding windows. Nucleic Acids Res. 44, e45 (2016).
Meyers, R. M. et al. Computational correction of copy number effect improves specificity of CRISPR–Cas9 essentiality screens in cancer cells. Nat. Genet. 49, 1779–1784 (2017).
Khoogar, R. et al. Single-cell RNA profiling identifies diverse cellular responses to EWSR1/FLI1 downregulation in Ewing sarcoma cells. Cell Oncol. 45, 19–40 (2022).
Reynolds, C. P. et al. Biological classification of cell lines derived from human extra-cranial neural tumors. Prog. Clin. Biol. Res. 271, 291–306 (1988).
Sanjana, N. E., Shalem, O. & Zhang, F. Improved vectors and genome-wide libraries for CRISPR screening. Nat. Methods 11, 783–784 (2014).
Price, C. et al. Genome-wide interrogation of human cancers identifies EGLN1 dependency in clear cell ovarian cancers. Cancer Res. 79, 2564–2579 (2019).
Malone, C. F. et al. Selective modulation of a pan-essential protein as a therapeutic strategy in cancer. Cancer Discov. 11, 2282–2299 (2021).
Welford, S. M., Hebert, S. P., Deneen, B., Arvand, A. & Denny, C. T. DNA binding domain-independent pathways are involved in EWS/FLI1-mediated oncogenesis. J. Biol. Chem. 276, 41977–41984 (2001).
Buenrostro, J. D., Wu, B., Chang, H. Y. & Greenleaf, W. J. ATAC-seq: a method for assaying chromatin accessibility genome-wide. Curr. Protoc. Mol. Biol. 109, 21 (2015).
Skene, P. J., Henikoff, J. G. & Henikoff, S. Targeted in situ genome-wide profiling with high efficiency for low cell numbers. Nat. Protoc. 13, 1006–1019 (2018).
Skene, P. J. & Henikoff, S. An efficient targeted nuclease strategy for high-resolution mapping of DNA binding sites. eLife 6, e21856 (2017).
Schmid, M., Durussel, T. & Laemmli, U. K. ChIC and ChEC; genomic mapping of chromatin proteins. Mol. Cell 16, 147–157 (2004).
Bolger, A. M., Lohse, M. & Usadel, B. Trimmomatic: a flexible trimmer for Illumina sequence data. Bioinformatics 30, 2114–2120 (2014).
Langmead, B. & Salzberg, S. L. Fast gapped-read alignment with Bowtie 2. Nat. Methods 9, 357–359 (2012).
Langmead, B., Wilks, C., Antonescu, V. & Charles, R. Scaling read aligners to hundreds of threads on general-purpose processors. Bioinformatics 35, 421–432 (2019).
Li, H. et al. The Sequence Alignment/Map format and SAMtools. Bioinformatics 25, 2078–2079 (2009).
Zhang, Y. et al. Model-based analysis of ChIP-Seq (MACS). Genome Biol. 9, R137 (2008).
Yu, G., Wang, L. G. & He, Q. Y. ChIPseeker: an R/Bioconductor package for ChIP peak annotation, comparison and visualization. Bioinformatics 31, 2382–2383 (2015).
Hahne, F. & Ivanek, R. Visualizing genomic data using Gviz and Bioconductor. Methods Mol. Biol. 1418, 335–351 (2016).
Anders, S. & Huber, W. Differential expression analysis for sequence count data. Genome Biol. 11, R106 (2010).
Chen, K. et al. The overlooked fact: fundamental need for spike-in control for virtually all genome-wide analyses. Mol. Cell. Biol. 36, 662–667 (2015).
Stark, R. & Brown, G. DiffBind: Differential Binding Analysis of ChIP-Seq Peak Data (Bioconductor, 2021).
Yu, F., Sankaran, V. G. & Yuan, G.-C. CUT&RUNTools 2.0: a pipeline for single-cell and bulk-level CUT&RUN and CUT&Tag data analysis. Bioinformatics 38, 252–254 (2021).
Ramírez, F. et al. deepTools2: a next generation web server for deep-sequencing data analysis. Nucleic Acids Res. 44, W160–W165 (2016).
Zhu, L. J. et al. ChIPpeakAnno: a Bioconductor package to annotate ChIP-seq and ChIP-chip data. BMC Bioinformatics 11, 237 (2010).
Battle, A., Brown, C. D., Engelhardt, B. E. & Montgomery, S. B. Genetic effects on gene expression across human tissues. Nature 550, 204–213 (2017).
Dobin, A. et al. STAR: ultrafast universal RNA-seq aligner. Bioinformatics 29, 15–21 (2013).
Li, B. & Dewey, C. N. RSEM: accurate transcript quantification from RNA-Seq data with or without a reference genome. BMC Bioinformatics 12, 323 (2011).
Jiang, L. et al. Synthetic spike-in standards for RNA-seq experiments. Genome Res. 21, 1543–1551 (2011).
Mootha, V. K. et al. PGC-1α-responsive genes involved in oxidative phosphorylation are coordinately downregulated in human diabetes. Nat. Genet. 34, 267–273 (2003).
Subramanian, A. et al. Gene set enrichment analysis: a knowledge-based approach for interpreting genome-wide expression profiles. Proc. Natl Acad. Sci. USA 102, 15545–15550 (2005).
Acknowledgements
D.L. was supported by a NIH NIGMS Medical Scientist Training Program Award (T32GM007753), a T32 Award (T32GM722643), a Harvard Landry Cancer Biology Consortium Fellowship, and a NIH F30 Award (F30CA246925); K.S. by the National Cancer Institute R35 CA210030, R01 CA204915 and a Cancer Moonshot U54 CA231637, a St. Baldrick’s Foundation Robert J. Arceci Innovation Award, the St Jude Children’s Research Hospital Collaborative Research Program, and the Brian MacIsaac Sarcoma Foundation; J.M.E. by The Helen Gurley Brown Presidential Initiative (The Pussycat Foundation); C.F.M. by a Ruth L. Kirschstein National Research Service Award (F32CA243266); S.L. by NCI K99 CA263161 and a Leukemia & Lymphoma Society fellowship; N.V.D. by the Julia’s Legacy of Hope St. Baldrick’s Foundation Fellowship; B.K.A.S. by the Department of Defense PRCRP Horizon Award CA181249; N.S.G. by the Katherine L. and Steven C. Pinard Research Fund and the Hale Center for Pediatric Cancer; and S.L.L. by U54 CA231641. We thank J. Mora for the gift of PDX Ewing cells, J. Perry for administrative support, A. Krivtsov and J. Gadrey for experimental guidance, C. Hatton for bioinformatics assistance, and A. Balboni Iniguez and A. Rubin for mentorship and scientific discussions.
Author information
Authors and Affiliations
Contributions
D.L. and K.S. conceived and designed the experiments, interpreted results and wrote the manuscript. K.N.R. performed computational analyses of the genomics data and interpreted results. J.M.E., C.F.M., S.L., N.W.M. and A.H.S. assisted with experimental design, execution and interpretation. N.V.D. performed analyses of CRISPR–Cas9 screen data. A.M. and A.C. assisted with animal experiments. B.K.A.S. assisted with cloning for the dTAG system. B.A. assisted with cell cycle experiments. J.S.-A., C.T. and A.K.B. performed CUT&Tag, computational analyses and interpretation. N.S.G. provided expertise and materials for establishment of the dTAG system. S.L.L. and M.N.R. provided reagents, resources and interpreted results.
Corresponding author
Ethics declarations
Competing interests
K.S. receives grant funding from the DFCI/Novartis Drug Discovery Program and from KronosBio, is a member of the SAB and has stock options with Auron Therapeutics and has consulted for AstraZeneca. S.L.L. declares a competing interest as a member of the advisory board for Salarius Pharmaceuticals and is a listed the inventor on US patent no. US 7,939,253 B2, ‘Methods and compositions for the diagnosis and treatment of Ewing’s sarcoma’ and US patent no. US 8,557,532, ‘Diagnosis and treatment of drug-resistant Ewing’s sarcoma.’ N.V.D. is an employee of Genentech, a member of the Roche Group. N.S.G. is a founder, scientific advisory board member and equity holder in Syros, C4, Allorion, Jengu, B2S, Inception, EoCys, Larkspur (board member) and Soltego (board member). The remaining authors declare no competing interests.
Peer review
Peer review information
Nature Cell Biology thanks Thomas Grunewald, Tiebang Kang, and Aykut Uren for their contribution to the peer review of this work.
Additional information
Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Extended data
Extended Data Fig. 1 The repressive ETS transcription factor, ETV6, is a selective dependency in Ewing sarcoma cells.
a. Volcano plot of genes in the DepMap screen in Ewing sarcoma cell lines (n = 14) compared to all other cell lines (n = 782). Effect size (x-axis) indicates the impact of gene deletion on growth. Log10(q-value) (y-axis) indicates specificity of dependency in Ewing sarcoma. Blue marks known selective TF dependencies. b. Venn diagram of Ewing sarcoma selective TF dependencies in the DepMap, GeCKO, and Sanger CRISPR/Cas9 screens. c. Scaled rank plot depicting all cell lines in DepMap. Gene effect (x-axis) measures ETV6 deletion impact in each cell line. Ewing lines are enlarged and color-coded by specific EWS/ETS fusion (EWS/FLI n = 11, EWS/ERG n = 2, EWS/FEV n = 1). d and e. Expression (log2(TPM + 1); TPM, transcripts per million) of ETV6 (d) and BCL11B, ZEB2 (e) in primary tumors (Treehouse Childhood Cancer Initiative37, Ewing sarcoma n = 85; other n = 12,571). Points show the full range between maxima and minima. Boxes show values for the 25th and 75th percentiles; middle line shows median (50th percentile). Whiskers extend no further than 1.5 times the inter-quartile range. Gene expression in Ewing sarcoma was different from other tumor types (Welch 2−sample T-test, ETV6 p = 2.067 × 10−15, ZEB2 p = 4.995 × 10−31, BCL11B p = 1.515 × 10−37). f. Line graphs depicting mean cell viability ±SEM in Ewing sarcoma cell lines (n = 8 biological replicates). ETV6 knock-out cells exhibited lower viability than control (2−way ANOVA, Dunnett multiple comparisons EW8 p-adj<0.0001; TC32 p-adj<0.0001). Represents two independent experiments. Westerns show ETV6 and GAPDH loading control (kDa, kiloDaltons). g. Bar plots showing mean ±SEM number of methylcellulose cell colonies. ETV6 knock-out samples formed fewer colonies (EW8 two-tailed t-test, n = 4 biological replicates, p = 0.0001; TC32 two-tailed t-test, n = 4 biological replicates, p = 0.0052). h. (Left) Bar plot showing mean ±SEM number of EW8 ETV6-dTAG cell colonies. dTAGV-1-treated cells formed fewer colonies than control (n = 3 biological replicates, two-tailed t-test, p < 0.0001). (Right) Western blot of dTAG cells shown here and in Fig. 1f.
Extended Data Fig. 2 ETV6 promotes growth in Ewing sarcoma cells.
a. Cell cycle analysis in EW8 ETV6-dTAG cells treated for 96 hours with DMSO or dTAGV-1 (n = 3 biological replicates, two-tailed t-test, Sidak’s multiple comparisons, G1/G0 p-adjust=0.002, S p-adj=1.8 × 10−5). b. (Left) Cell cycle analysis in A673 cells transduced with CRISPR/Cas9 constructs (n = 3 biological replicates, two-tailed t-test, Sidak’s multiple comparisons, G1/G0 p-adjust=2.25 × 10−11, S p-adj=4.47 × 10−11). (Right) Western blot of ETV6. c. Western blot of A673 and EW8 ETV6-dTAG cells treated with 1 µM cisplatin (as a positive control) for 24 hours or with DMSO or dTAGV-1 for 72 hours. Represents one independent experiment. d. (Left) Mean subcutaneous tumor volume in cubic millimeters ±SEM (n = 5, biological replicates) in mice implanted with CRISPR/Cas9-transduced TC32 cells (2-way ANOVA, p = 0.0191). (Right) Western blot showing ETV6. e. Western blots detecting exogenous expression of wild-type ETV6 protein (ETV6-WT) or mutant ETV6 harboring an ETS DNA binding domain deletion (ETV6-ΔETS) in cytosolic (Cyt), nuclear (Nuc), and chromatin (Chrom) subcellular fractions in A673 (left) and EW8 (right) cell lines. Vinculin and Histone 3 demonstrate the quality of fractionation. Represents one independent experiment. f. Line graphs depicting mean cell viability ±SEM (n = 4 biological replicates) in A673 and EW8 cells transduced with control (sgChr2.2, black) or ETV6-targeting (sgETV6, red) CRISPR/Cas9 constructs and expressing doxycycline-induced wild-type ETV6 (ETV6-WT) or ETS DNA binding domain-deleted ETV6 (ETV6-ΔETS). Cells were treated with either vehicle (circles, solid lines) or doxycycline (squares, dashed lines). Knockout of ETV6 alone reduced cell viability (2-way ANOVA, Tukey’s multiple comparisons, A673 p-adj=0.0169; EW8 p-adj=0.0060). ETV6 knockout in combination with wild-type ETV6 exogenous expression did not alter cell viability (ns, A673 p-adj=0.9769; EW8 p-adj=0.0972), but ETV6 knockout in combination with DBD-deleted ETV6 expression reduced cell viability (A673 p-adj=0.0003; EW8 p-adj<0.0001).
Extended Data Fig. 3 Acute loss of ETV6 leads to increased EWS/FLI binding.
a. (Left) Heatmaps of endogenous ETV6 CUT&Tag in parental A673 cells using two commercial antibodies and anti-ETV6-FKBP12F36V-HA ChIP-seq in A673 ETV6-dTAG cells. Shown are ETV6 consensus binding sites sub-plotted by TSS overlap, ranked by height. (Right) Venn diagram showing that consensus binding sites were detected in at least 2 data sets. b. Heatmaps at consensus binding sites detected by anti-HA ChIP-seq in EW8 ETV6-dTAG cells at 24 hours DMSO or dTAGV-1 treatment. c. (Left to right): Anti-HA ChIP-seq in EW8 ETV6-dTAG cells, anti-FLI1 ChIP-seq in parental EW8 cells, anti-H3K27ac ChIP-seq in parental EW8 cells. d. Venn diagram showing overlap between 718 ETV6-FKBP12F36V-HA binding sites in EW8 ETV6-dTAG cells and 16,525 EWS/FLI binding sites in parental EW8 cells. e. Log2(TPM + 1) expression of ETV6 in Ewing sarcoma cell lines (n = 21) and the K-562 leukemia cell line (n = 1) from CCLE. Specific cell lines are in red. Points show the full range between maxima and minima. Boxes show values for the 25th and 75th percentiles; middle line shows median (50th percentile). Whiskers extend no further than 1.5 times the inter-quartile range. f. Metaplots of H3K27ac abundance at regions shown in Fig. 3g, h. g. Stacked column plot showing varying lengths of tandem 5′-GGAA-3′ motif repeats occurring at genomic regions exhibiting significantly altered EWS/FLI binding (CSAW using the edgeR generalized linear model; p < 0.05) in A673 or EW8 dTAG cells at 72 hours. The number of peaks in each data set is listed. No genomic regions lost EWS/FLI binding in A673 dTAG at 6 hours. At 72 hours, regions that gained EWS/FLI binding were enriched for repeats of 2, 3, and 4 compared to regions that lost EWS/FLI binding (Fisher exact test, A673 p = 6.974 × 10−15, EW8 p = 1.15 × 10−15).
Extended Data Fig. 4 ETV6 is primarily a transcriptional repressor in Ewing sarcoma.
a. Western blot of EW8 ETV6-dTAG cells treated for 6 hours with DMSO or dTAGV-1. Represents one experiment. b. t-SNE of RNA-seq expression in parental Ewing sarcoma cell lines from the Cancer Cell Line Encyclopedia (CCLE)66 and in A673 ETV6-dTAG and EW8 ETV6-dTAG cells treated for 6 hours with DMSO. EW8 parental and EW8 ETV6-dTAG cell samples are blue. A673 parental and A673 ETV6-dTAG cell samples are red. c. Gene set enrichment analysis (GSEA) of ETV6-repressed genes identified at 6 hours in A673 ETV6-dTAG compared to EW8 ETV6-dTAG RNA-seq data at 6 hours (top) and vice versa (bottom). d. Western blot of A673 parental cells transduced with sgChr2.2, sgLacZ, and sgETV6 CRISPR/Cas9 constructs. Represents one independent experiment. e. RNA-seq mean Log2(TPM + 1) for 81 of 85 ETV6-repressed genes, compared to all other genes, across Ewing sarcoma cell lines in CCLE66. Boxes show the values for the 25th and 75th percentiles; middle line shows the median (50th percentile). Whiskers extend up from the 75th percentile and down from the 25th percentile, no further than 1.5*IQR (where IQR is the inter−quartile range, or distance between the 25th and 75th percentiles). Points show the full range between maxima and minima. f. RNA-seq heatmap of ETV6-activated core enrichment genes, identified by GSEA described in Fig. 3e, in A673 (68 genes) and EW8 dTAG cells (126 genes). g. Combined enrichment plot of MSigDB c2 pathways enriched in ETV6-activated genes in both dTAG models (24-hour RNA-seq, hypergeometric enrichment test; p < 0.05). Gene sets are ranked by significance; missing dots indicate insignificance. Dot size indicates the number of ETV6-activated genes at 6, 24, and 72 hours also in the gene set (33, 130, and 543 genes, respectively). ‘EWS/FLI’, ‘HDAC’, and ‘Lineage’ gene sets characterize EWS/FLI-regulated genes, histone deacetylase enzyme-regulated genes, and genes exhibiting tissue-specific expression, respectively.
Extended Data Fig. 5 ETV6 constrains EWS/FLI-induced gene expression.
a. Scatter plot comparing log2 fold-change in EWS/FLI binding to log2 fold-change in H3K27ac abundance detected by ChIP-seq at 6 hours in EW8 ETV6-dTAG cells (n = 2 biological replicates). Pearson correlation value (R) is shown. b. Plot comparing genes mapped from significantly altered EWS/FLI binding sites (n = 2 biological replicates) to Log2 Fold-change in expression measured by RNA-seq in EW8 ETV6-dTAG cells (n = 3 biological replicates) at 72 hours. Gray boxes indicate median and first and third quartiles. Red diamond and error bars indicate mean expression ±standard deviation (FLI1 Up n = 404 mean=0.3; FLI1 Down n = 1,042 mean = -0.062; No Change n = 8,252 mean=0.033). FLI1 Up vs. No Change p = 1.99e-12, FLI1 Down vs. No Change p = 2.81e-18, paired t-test, Benjamini-Hochberg corrections. c and d. Gviz-generated views of the SEMA5B (c) and BCL11B (d) loci. ETV6 tracks show CUT&Tag of ETV6-FKBP12F36V-HA in A673 ETV6-dTAG cells at 24 hours. FLI1 tracks show ChIP-seq for EWS/FLI performed at 6 hours, H3K27ac tracks show ChIP-seq for H3K27ac at 6 hours, and ATAC tracks show ATAC-seq at 72 hours. FLI1 (EW8) tracks show ChIP-seq for EWS/FLI at 6 hours in EW8 ETV6-dTAG cells. GGAA tracks indicate locations of tandem GGAA motif repeats.
Extended Data Fig. 6 ETV6 competes with EWS/FLI for binding in clinically relevant Ewing sarcoma models.
a and b. Western blot of cells from the newly derived Ewing sarcoma cell lines, PEDS0009 (a) and PEDS0010 (b), transduced with CRISPR/Cas9 constructs. Represents one independent experiment. c. (Left) Western blot of cells from the Ewing sarcoma patient-derived xenograft, ES-PDX-001, transduced with CRISPR/Cas9 constructs. (Right) Line graphs depicting mean cell viability ±SEM in vitro (n = 6 biological replicates). Knockout of ETV6 and EWS/FLI reduced cell growth compared to sgChr2.2 control (2-way ANOVA, Tukey’s multiple comparisons, p-adj = <0.0001). d. Heatmaps showing 3-kilobase (kb) windows centered at 3,309 consensus ETV6 binding sites, sub-plotted by overlap within 2.5 kb of transcription start sites (TSS). Shown are ETV6 peaks detected in sgChr2.2 control and ETV6-knockout PEDS0009 cells profiled by CUT&RUN. e. Scatter plots of log2 fold-change in EWS/FLI binding in EW8 ETV6-dTAG cells following 72 hours of treatment with DMSO or dTAGV-1 (y-axis) and in control and ETV6-knockout PEDS0009 cells profiled by FLI1 CUT&RUN (x-axis). Pearson correlation value (R) is shown. f. Heatmaps of FLI1 CUT&RUN performed in control and ETV6-knockout PEDS0009 cells. Loci shown were defined in Fig. 2g as regions that exhibited increased EWS/FLI binding upon ETV6 loss in EW8 ETV6-dTAG cells. g. Stacked column plot showing varying lengths of tandem 5′-GGAA-3′ motif repeats occurring at genomic regions exhibiting significantly (CSAW; p < 0.05) increased (FLI1 Up) or decreased (FLI1 Down) EWS/FLI occupancy in PEDS0009 cells upon ETV6 knockout. FLI1 Up regions were more likely than FLI1 Down regions to contain GGAA repeats of 2, 3, or 4 (Fisher Exact test, p = 5.186e-11).
Extended Data Fig. 7 Knock-out of the ETV6-repressed gene, SOX11, rescues the phenotype of ETV6 loss.
a. Top 100 significantly enriched MSigDB c5 Gene Ontology gene set categories in ETV6-repressed genes (24-hour RNA-seq, A673 ETV6-dTAG, parentheses show gene set number; full list in Supplementary Table 19). b. Line graphs depicting mean viability ±SEM (n = 8 biological replicates) in A673 (left) and EW8 (right) cells exogenously expressing wild-type SOX11 (SOX11 WT, red), DBD-deleted mutant SOX11 (SOX11 DBD, gray), or empty pLX_TRC307 vector control (307 C, black). Wild-type SOX11 expression reduced viability compared to control (2-way ANOVA, Dunnett’s multiple comparisons, p-adj<0.0001). Western blots show SOX11 and Vinculin loading control. c. RNA-seq log2(TPM + 1) ±SEM of SOX11 expression in A673 ETV6-dTAG cells (left) (n = 3 biological replicates) and in CRISPR/Cas9-perturbed parental A673 cells (right) (controls n = 2 biological replicates; ETV6-knockout n = 3 biological replicates). d. Western blot of CRISPR/Cas9-perturbed A673 cells. e. Mean viability ±SEM (n = 6 biological replicates) of A673 cells shown in d in vitro. ETV6 knockout (red open and closed circles) reduced viability compared to control (black circles) (2-way ANOVA, Tukey’s multiple comparisons, p-adj<0.0001). Simultaneous ETV6 and SOX11 knockout (blue star) did not reduce viability compared to SOX11 knockout alone (gray square and circle) (sgSOX11 + sgETV6 vs. sgSOX11+sgLacZ, not significant ‘ns’ p-adj=0.7343) and exhibited greater viability than ETV6 knockout alone (p-adj<0.0001). f. Mean methylcellulose colony number ±SEM (n = 3 biological replicates) formed by A673 cells shown in d and e. ETV6 knockout reduced colony number compared to control (2-way ANOVA, Tukey’s multiple comparisons, sgLacZ+sgETV6 vs. sgLacZ+sgChr2.2 p-adj<0.0001, sgChr2.2 + sgETV6 vs. sgLacZ+sgChr2.2 p-adj=0.0011). Simultaneous ETV6 and SOX11 knockout did not reduce colony number compared to SOX11 knockout alone (sgSOX11 + sgLacZ vs. sgSOX11 + sgETV6, ns, p-adj=0.9984) and increased colonies compared to ETV6 knock-out alone (sgSOX11 + sgETV6 vs. sgLacZ+sgETV6 p-adj<0.0001; vs. sgChr2.2 + sgETV6 p-adj=0.0007). g. Western blot of rhabdomyosarcoma RD cells expressing doxycycline-inducible HA-tagged GFP, wild-type EWS/FLI, or DNA binding-incompetent R340N mutant EWS/FLI, in combination with CRISPR/Cas9 perturbation. One independent experiment.
Supplementary information
Supplementary Fig. 1
Example of the flow gating strategy.
Supplementary Tables 1–20
File includes all Supplementary Tables referenced in the text. Supplementary Tables 1–3. Tumour-type-specific dependencies and gene expression. List of genes interrogated in the Pediatric Cancer DepMap CRISPR–Cas9 screen in Ewing sarcoma (Table 1), neuroblastoma (Table 2) and rhabdomyosarcoma (Table 3) and their expression in the Cancer Cell Line Encyclopedia (CCLE). The columns include gene name, effect size measuring the degree of dependency in the screen, q value of the comparison of dependency in each tumour type against all other tumour types in the screen and q value of the comparison of expression in each tumour type against all other tumour types in CCLE. Supplementary Table 4. EWS–FLI-regulated genes do not consistently include ETV6. The columns list, from left to right, (1) the names of MSigDB c2 gene sets characterizing EWS–FLI-regulated genes, (2) the total number of genes in each gene set and (3) whether each gene set included the gene ETV6. Supplementary Table 5. Expression of ETV6, BCL11B and ZEB2 in primary tumour samples. List of primary tumour samples from the Treehouse Childhood Cancer Initiative and each sample’s expression of the genes ETV6, BCL11B and ZEB2. Samples are marked as being either Ewing sarcoma samples or other. Supplementary Table 6. ETV6-repressed genes defined by dTAG at 6 h. List of genes that were significantly upregulated in RNA-seq at 6 h following treatment with 1 μM dTAGV-1, compared to DMSO, in both A673 ETV6–dTAG cells and EW8 ETV6–dTAG cells (DESeq2 Wald test Benjamini–Hochberg (BH) adjusted P value; P adjust < 0.05). The columns list the gene symbols and their DESeq2 base mean (mean normalized counts of all samples), DESeq2 log2(fold change) (shrunken apeglm method), DESeq2 standard error, DESeq2 Wald test P value and DESeq2 BH FDR-adjusted P value in both models. Genes are ranked by log2(fold change) in the A673 data. Supplementary Table 7. ETV6-repressed genes defined by dTAG at 24 h. List of genes that were significantly upregulated in RNA-seq at 24 h following treatment with 1 μM dTAGV-1, compared to DMSO, in both A673 ETV6-dTAG cells and EW8 ETV6–dTAG cells (DESeq2 Wald test BH-adjusted P value; P adjust < 0.05). The columns list the gene symbols and their DESeq2 base mean (mean normalized counts of all samples), DESeq2 log2(fold change) (shrunken apeglm method), DESeq2 standard error, DESeq2 Wald test P value and DESeq2 BH FDR-adjusted P value in both models. Genes are ranked by log2(fold change) in the A673 data. Supplementary Table 8. ETV6-repressed genes defined by dTAG at 72 h. List of genes that were significantly upregulated in RNA-seq at 72 h following treatment with 1 μM dTAGV-1, compared to DMSO, in both A673 ETV6–dTAG cells and EW8 ETV6–dTAG cells (DESeq2 Wald test BH-adjusted P value; P adjust < 0.05). The columns list the gene symbols and their DESeq2 base mean (mean normalized counts of all samples), DESeq2 log2(fold change) (shrunken apeglm method), DESeq2 standard error, DESeq2 Wald test P value and DESeq2 BH FDR-adjusted P value in both models. Genes are ranked by log2(fold change) in the A673 data. Supplementary Table 9. MSigDB c2 curated gene sets enriched in 6 h ETV6-repressed genes. List of MSigDB c2 curated gene sets enriched in 6 h ETV6-repressed genes shown in Supplementary Table 6. Enrichment is determined by an over-representation analysis with P values based on the hypergeometric distribution (one-sided version of Fisher’s exact test). The columns list the Gene Set ID, gene ratio (fraction of repressed genes in set), bg ratio (background gene ratio), one-sided Fisher’s exact test P value, BH FDR-adjusted P value, q value and the number of genes scoring in each gene set. Gene sets are ranked by q value. Supplementary Table 10. MSigDB c2 curated gene sets enriched in 24 h ETV6-repressed genes. List of MSigDB c2 curated gene sets enriched in 24 h ETV6-repressed genes shown in Supplementary Table 7. Enrichment is determined by an over-representation analysis with P values based on the hypergeometric distribution (one-sided version of Fisher’s exact test). The columns list the Gene Set ID, gene ratio (fraction of repressed genes in set), bg ratio (background gene ratio), one-sided Fisher’s exact test P value, BH FDR-adjusted P value, q value and the number of genes scoring in each gene set. Gene sets are ranked by q value. Supplementary Table 11. MSigDB c2 curated gene sets enriched in 72 h ETV6-repressed genes. List of MSigDB c2 curated gene sets enriched in 72 h ETV6-repressed genes shown in Supplementary Table 8. Enrichment is determined by an over-representation analysis with P values based on the hypergeometric distribution (one-sided version of Fisher’s exact test). The columns list the Gene Set ID, gene ratio (fraction of repressed genes in set), bg ratio (background gene ratio), one-sided Fisher’s exact test P value, BH FDR-adjusted P value, q value and the number of genes scoring in each gene set. Gene sets are ranked by q value. Supplementary Table 12. ETV6-activated genes defined by dTAG at 6 h. List of genes that were significantly downregulated in RNA-seq at 6 h following treatment with 1 μM dTAGV-1, compared to DMSO, in both A673 ETV6-dTAG cells and EW8 ETV6–dTAG cells (DESeq2 Wald test BH-adjusted P value; P adjust < 0.05). The columns list the gene symbols and their DESeq2 base mean (mean normalized counts of all samples), DESeq2 log2(fold change) (shrunken apeglm method), DESeq2 standard error, DESeq2 Wald test P value and DESeq2 BH FDR-adjusted P value in both models. Genes are ranked by log2(fold change) in the A673 data. Supplementary Table 13. ETV6-activated genes defined by dTAG at 24 h. List of genes that were significantly downregulated in RNA-seq at 24 h following treatment with 1 μM dTAGV-1, compared to DMSO, in both A673 ETV6–dTAG cells and EW8 ETV6–dTAG cells (DESeq2 Wald test BH-adjusted P value; P adjust < 0.05). The columns list the gene symbols and their DESeq2 base mean (mean normalized counts of all samples), DESeq2 log2(fold change) (shrunken apeglm method), DESeq2 standard error, DESeq2 Wald test P value and DESeq2 BH FDR-adjusted P value in both models. Genes are ranked by log2(fold change) in the A673 data. Supplementary Table 14. ETV6-activated genes defined by dTAG at 72 h. List of genes that were significantly downregulated in RNA-seq at 72 h following treatment with 1 μM dTAGV-1, compared to DMSO, in both A673 ETV6-dTAG cells and EW8 ETV6–dTAG cells (DESeq2 Wald test BH-adjusted P value; P adjust < 0.05). The columns list the gene symbols and their DESeq2 base mean (mean normalized counts of all samples), DESeq2 log2(fold change) (shrunken apeglm method), DESeq2 standard error, DESeq2 Wald test P value and DESeq2 BH FDR-adjusted P value in both models. Genes are ranked by log2(fold change) in the A673 data. Supplementary Table 15. MSigDB c2 curated gene sets enriched in 6 h ETV6-activated genes. List of MSigDB c2 curated gene sets enriched in 6 h ETV6-activated genes shown in Supplementary Table 12. Enrichment is determined by an over-representation analysis with P values based on the hypergeometric distribution (one-sided version of Fisher’s exact test). The columns list the Gene Set ID, gene ratio (fraction of repressed genes in set), bg ratio (background gene ratio), one-sided Fisher’s exact test P value, BH FDR-adjusted P value, q value and the number of genes scoring in each gene set. Gene sets are ranked by q value. Supplementary Table 16. MSigDB c2 curated gene sets enriched in 24 h ETV6-activated genes. List of MSigDB c2 curated gene sets enriched in 24 h ETV6-activated genes shown in Supplementary Table 13. Enrichment is determined by an over-representation analysis with P values based on the hypergeometric distribution (one-sided version of Fisher’s exact test). The columns list the Gene Set ID, gene ratio (fraction of repressed genes in set), bg ratio (background gene ratio), one-sided Fisher’s exact test P value, BH FDR-adjusted P value, q value and the number of genes scoring in each gene set. Gene sets are ranked by q value. Supplementary Table 17. MSigDB c2 curated gene sets enriched in 72 h ETV6-activated genes. List of MSigDB c2 curated gene sets enriched in 72 h ETV6-activated genes shown in Supplementary Table 14. Enrichment is determined by an over-representation analysis with P values based on the hypergeometric distribution (one-sided version of Fisher’s exact test). The columns list the Gene Set ID, gene ratio (fraction of repressed genes in set), bg ratio (background gene ratio), one-sided Fisher’s exact test P value, BH FDR-adjusted P value, q value and the number of genes scoring in each gene set. Gene sets are ranked by q value. Supplementary Table 18. MSigDB c5 gene ontology (GO) gene sets enriched in 24 h ETV6-repressed genes. List of MSigDB c5 curated gene sets enriched in 24 h ETV6-repressed genes shown in Supplementary Table 7. Enrichment is determined by an over-representation analysis with P values based on the hypergeometric distribution (one-sided version of Fisher’s exact test). The gene sets are ranked by q value. The columns list the rank, Gene Set ID, the assigned biological category (displayed in Extended Data Fig. 7a), the gene ratio (fraction of repressed genes in set), bg ratio (background gene ratio), one-sided Fisher’s exact test P value, BH FDR-adjusted P value, q value and the number of genes scoring in each dataset. Supplementary Table 19. MSigDB c5 gene GO gene sets containing SOX11 in ETV6-repressed genes. The first, second and third columns list MSigDB c5 GO gene sets that contained the gene SOX11 and were significantly enriched (one-sided Fisher’s exact test; P < 0.05) in 6 h, 24 h and 72 h ETV6-repressed genes, respectively. Supplementary Table 20. Expression of ETS family TF genes after ETV6 loss. RNA-seq expression and log2(fold change) of ETS family TF genes identified in Lambert et al., 2018 in A673 ETV6–dTAG and EW8 ETV6–dTAG cells at 6, 24 and 72 h following DMSO or 1 μM dTAGV-1 treatment. The columns list the gene symbols and their DESeq2 base mean (mean of normalized counts of all samples), DESeq2 log2(fold change) (shrunken apeglm method), DESeq2 Wald test P value and DESeq2 BH FDR-adjusted P value.
Source data
Source Data for Fig. 1
Statistical source data analysed in Fig. 1.
Source Data for Fig. 4
Statistical source data analysed in Fig. 4.
Source Data for Fig. 5
Statistical source data analysed in Fig. 5.
Source Data for Fig. 6
Statistical source data analysed in Fig. 6.
Extended Data Fig. 1
Statistical source data analysed in Extended Data Fig. 1.
Extended Data Fig. 2
Statistical source data analysed in Extended Data Fig. 2.
Extended Data Fig. 6
Statistical source data analysed in Extended Data Fig. 6.
Extended Data Fig. 7
Statistical source data analysed in Extended Data Fig. 7.
Fig. 1 raw western blots
Unprocessed westerns shown in Fig. 1.
Fig. 4 raw western blots
Unprocessed westerns shown in Fig. 4.
Fig. 6 raw western blots
Unprocessed westerns shown in Fig. 6.
Extended Data Fig. 1
Unprocessed westerns shown in Extended Data Fig. 1.
Extended Data Fig. 2
Unprocessed westerns shown in Extended Data Fig. 2.
Extended Data Fig. 4
Unprocessed westerns shown in Extended Data Fig. 4.
Extended Data Fig. 6
Unprocessed westerns shown in Extended Data Fig. 6.
Extended Data Fig. 7
Unprocessed westerns shown in Extended Data Fig. 7.
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.
About this article
Cite this article
Lu, D.Y., Ellegast, J.M., Ross, K.N. et al. The ETS transcription factor ETV6 constrains the transcriptional activity of EWS–FLI to promote Ewing sarcoma. Nat Cell Biol 25, 285–297 (2023). https://doi.org/10.1038/s41556-022-01059-8
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1038/s41556-022-01059-8