2023 Vol. 9, No. 2

Cover Story

Identifying cancer-related differentially expressed genes provides sig-nificant information for diagnosing tumors, predicting prognoses, and effective treatments. Recently, deep learning methods have been used to perform gene differential expression analysis using microarray-based high- throughput gene profiling and have achieved good results. In this study, the authors proposed a new robust multiple-datasetsbased semi-supervised learning model, MSSL, to perform tumor type classification and candidate    cancer-specific biomarkers discovery across multiple tumor types and mul-tiple datasets, which addressed the following long-lasting obstacles: (1) the data volume of the existing single dataset is not enough to fully exert the advantages of deep learning; (2) a large number of datasets from different research institutions cannot be effectively used due to inconsistent internal variances and low quality; (3) relatively uncommon cancers have limited effects on deep learning methods. In this article, the authors applied MSSL to The Cancer Genome Atlas (TCGA) and the Gene Expression Comprehen-sive Database (GEO) pan-cancer normalized-level3 RNA-seq data and got 97.6% final classification accuracy, which had a significant performance leap compared with previous approaches. Finally, they got the ranking of the importance of the corresponding genes for each cancer type based on classification results and validated that the top genes selected in this way were biologically meaningful for corresponding tumors and some of them had been used as biomarkers, which showed the efficacy of our method.

column
METHOD
Tumor type classification and candidate cancer-specific biomarkers discovery via semi-supervised learning
PROTOCOL
Integrated proteomic and phosphoproteomic data-independent acquisition data evaluate the personalized drug responses of primary and metastatic tumors in colorectal cancer
A basic phosphoproteomic-DIA workflow integrating precise quantification of phosphosites in systems biology
A hydrogen–deuterium exchange mass spectrometry-based protocol for protein–small molecule interaction analysis