Bioinformatics: A novel significance score for gene selection and ranking

Yufei Xiao, Tzu-Hung Hsiao, Uthra Suresh, Hung-I Harry Chen, Xiaowu Wu, Steven E. Wolf, Yidong Chen

Abstract

Motivation: When identifying differentially expressed (DE) genes from high-throughput gene expression measurements, we would like to take both statistical significance (such as P-value) and biological relevance (such as fold change) into consideration. In gene set enrichment analysis (GSEA), a score that can combine fold-change and P-value together is needed for better gene ranking.

Results: We defined a gene significance score π-value by combining expression fold change and statistical significance (P-value), and explored its statistical properties. When compared to various existing methods, the π-value-based approach is more robust in selecting DE genes, with the largest area under the curve in its receiver operating characteristic curve. We applied π-value to GSEA and found it comparable to P-value and t-statistic-based methods, with added protection against false discovery in certain situations. Finally, in a gene functional study of breast cancer profiles, we showed that using π-value helps elucidating otherwise overlooked important biological functions.

Learn More Button

Article Categories: All News, Research Paper

Since 2004, UT Health San Antonio, Greehey Children’s Cancer Research Institute’s (Greehey CCRI) mission has been to advance scientific knowledge relevant to childhood cancer, contribute to understanding its causes, and accelerate the translation of knowledge into novel therapies. Greehey CCRI strives to have a national and global impact on childhood cancer by discovering, developing, and disseminating new scientific knowledge. Our mission consists of three key areas — research, clinical, and education.

Stay connected with the Greehey CCRI on Facebook, Twitter, LinkedIn, and Instagram.