Intruduction

Recent research demonstrates that circulating tumor DNA (ctDNA) has the potential to revolutionize screening, diagnosis, and treatment of cancer by enabling a non-invasive liquid biopsy. ctDNA fragments bearing cancer-specific methylation patterns have been investigated as feasible biomarkers in cancers because they can be easily discriminated from those of normal cells; however, currently there are few validated methylation markers available, with notable exceptions such as SEPT9 in colorectal cancer. DNA methylation profiling offers several advantages over somatic mutation analysis for cancer detection, including a broader distribution across hundreds of target loci in the disease state and multiple altered CpG sites within each targeted genomic region and thus higher clinical sensitivity and dynamic range. A recent study identified 10 CpG sites for diagnosis and prognosis of hepatocellular carcinoma using DNA methylation of cfDNA. However, the application of liquid biopsies to other cancer types remains a challenge.

Our approach is broadly built on the observation that epigenetic restriction of extraembryonic lineages mirrors the somatic transition to cancer [Smith and Shi, Nature 2017]. We systematically compared DNA methylation profiles of extraembryonic ectoderm (ExE) and epiblast (Epi), and found a signature of CpG Islands (CGI) that are hyper-methylated in ExE (ExE hyper). De novo methylation of CGIs is a general feature of in vitro tissue culture, numerous cancer cell lines, and primary tumors, suggesting that somatic cells remain vulnerable to acquiring this state. To investigate a possible link between this early developmental program and the reemergence of this landscape in cancer, we mapped orthologous CGIs from mouse to human and directly compared ExE hyper CGIs across patient-matched DNA methylation profiles acquired as part of The Cancer Genome Atlas (TCGA) project, as well as other samples profiled by the ENCODE/Roadmap Epigenomics Project. Of the 15 TCGA tumor types with sufficient numbers of patient-matched normal biopsied tissues, we found that 14 significantly methylate ExE hyper CGIs (Fig. 1). The signal is surprisingly robust, such that the ExE hyper CGI set segregates tumor and normal tissue when measured as a feature across patients. Moreover, ExE and tumor methylated CGIs significantly overlap – 84% are shared with at least one other cancer type – and are conserved across cancer types. Furthermore, ExE hyper methylation seems to be an early event during disease progression and can be found in pre-cancer stages. For instance, this signature can be found in ductal carcinoma in situ (DCIS) and monoclonal B-cell lymphocytosis (MBL), which are pre-cancer stages of breast cancer and Chronic Lymphocytic Leukemia (CLL), respectively (Fig. 1). We believe that our signature is sufficiently sensitive to detect most cancer types and represents a universal signature for for cancer diagnostics.

CpG islands protected by this patent

We have filed a list of CpG Islands that are hypermethylated in either mouse ExE or human placenta. [CGIs for Universal Early Cancer Diagnostics]. It’s publicly available through Google Patents.

Session info

sessionInfo()
## R version 3.5.1 (2018-07-02)
## Platform: x86_64-conda_cos6-linux-gnu (64-bit)
## Running under: Red Hat Enterprise Linux Server 7.7 (Maipo)
## 
## Matrix products: default
## BLAS: /sibcb2/bioinformatics/software/BcbioNG/anaconda/lib/R/lib/libRblas.so
## LAPACK: /sibcb2/bioinformatics/software/BcbioNG/anaconda/lib/R/lib/libRlapack.so
## 
## locale:
## [1] C
## 
## attached base packages:
## [1] stats     graphics  grDevices utils     datasets  methods   base     
## 
## other attached packages:
## [1] rmarkdown_2.0
## 
## loaded via a namespace (and not attached):
##  [1] compiler_3.5.1  magrittr_1.5    tools_3.5.1     htmltools_0.5.0
##  [5] yaml_2.2.0      stringi_1.4.3   knitr_1.26      stringr_1.4.0  
##  [9] xfun_0.18       digest_0.6.23   rlang_0.4.6     evaluate_0.14

References