BMC Bioinformatics: CancerSiamese: one-shot learning for predicting primary and metastatic tumor types unseen during model training (Chen Lab)

Abstract

Background

The state-of-the-art deep learning-based cancer type prediction can only predict cancer types whose samples are available during the training where the sample size is commonly large. In this paper, we consider how to utilize the existing training samples to predict cancer types unseen during the training. We hypothesize the existence of a set of type-agnostic expression representations that define the similarity/dissimilarity between samples of the same/different types and propose a novel one-shot learning model called CancerSiamese to learn this common representation. CancerSiamese accepts a pair of query and support samples (gene expression profiles) and learns the representation of similar or dissimilar cancer types through two parallel convolutional neural networks joined by a similarity function.

Results

We trained CancerSiamese for cancer type prediction for primary and metastatic tumors using samples from the Cancer Genome Atlas (TCGA) and MET500. Network transfer learning was utilized to facilitate the training of the CancerSiamese models. CancerSiamese was tested for different N-way predictions and yielded an average accuracy improvement of 8% and 4% over the benchmark 1-Nearest Neighbor (1-NN) classifier for primary and metastatic tumors, respectively. Moreover, we applied the guided gradient saliency map and feature selection to CancerSiamese to examine 100 and 200 top marker-gene candidates for the prediction of primary and metastatic cancers, respectively. Functional analysis of these marker genes revealed several cancer-related functions between primary and metastatic tumors.

Conclusion

This work demonstrated, for the first time, the feasibility of predicting unseen cancer types whose samples are limited. Thus, it could inspire new and ingenious applications of one-shot and few-shot learning solutions for improving cancer diagnosis, prognostic, and our understanding of cancer.

Read Full Text

_________________________________________________________

Since 2004, UT Health San Antonio, Greehey Children’s Cancer Research Institute’s (Greehey CCRI) mission has been to advance scientific knowledge relevant to childhood cancer, contribute to the understanding of its causes, and accelerate the translation of knowledge into novel therapies. Through discovery, development, and dissemination of new scientific knowledge, Greehey CCRI strives to have a national and global impact on childhood cancer. Our mission consists of three key areas — research, clinical, and education.

Stay connected with the Greehey CCRI on FacebookTwitterLinkedIn, and Instagram.

_________________________________________________________

 

 

Article Categories: Research Paper

Since 2004, UT Health San Antonio, Greehey Children’s Cancer Research Institute’s (Greehey CCRI) mission has been to advance scientific knowledge relevant to childhood cancer, contribute to understanding its causes, and accelerate the translation of knowledge into novel therapies. Greehey CCRI strives to have a national and global impact on childhood cancer by discovering, developing, and disseminating new scientific knowledge. Our mission consists of three key areas — research, clinical, and education.

Stay connected with the Greehey CCRI on Facebook, Twitter, LinkedIn, and Instagram.