publications
*: equal contribution.
2025
- Genom. Proteom. Bioinform.LEGEND: Identifying Co-expressed Genes in Multimodal Transcriptomic Sequencing DataTao Deng*, Mengqian Huang, Kaichen Xu, Yan Lu, Yucheng Xu, Siyu Chen, Nina Xie, and 3 more authorsGenomics, Proteomics & Bioinformatics, Sep 2025
Identifying co-expressed genes across tissue domains and cell types is essential for revealing co-functional genes involved in biological or pathological processes. While both single-cell RNA sequencing (scRNA-seq) and spatially resolved transcriptomics (SRT) data offer insights into gene co-expression patterns, current methods typically utilize either data type alone, potentially diluting the co-functionality signals within co-expressed gene groups. To bridge this gap, we introduce muLtimodal co-Expressed GENes finDer (LEGEND), a novel computational method that integrates scRNA-seq and SRT data for identifying groups of co-expressed genes at both cell type and tissue domain levels. LEGEND employs an innovative hierarchical clustering algorithm designed to maximize intra-cluster redundancy and inter-cluster complementarity, effectively capturing more nuanced patterns of gene co-expression and spatial coherence. Enrichment and co-function analyses further showcase the biological relevance of these gene clusters and their utilities in exploring context-specific novel gene functions. Notably, LEGEND can reveal shifts in gene–gene interactions under different conditions, providing insights into disease-associated gene crosstalk. Moreover, LEGEND can enhance the annotation accuracy of both spatial spots in SRT and single cells in scRNA-seq, and serve as a pioneering tool for identifying genes with designated spatial expression patterns. LEGEND is available at https://github.com/ToryDeng/LEGEND.
@article{deng2025LEGEND, title = {{LEGEND}: Identifying Co-expressed Genes in Multimodal Transcriptomic Sequencing Data}, author = {Deng, Tao and Huang, Mengqian and Xu, Kaichen and Lu, Yan and Xu, Yucheng and Chen, Siyu and Xie, Nina and Tao, Qiuyue and Wu, Hao and Sun, Xiaobo}, year = {2025}, month = sep, journal = {Genomics, Proteomics \& Bioinformatics}, volume = {23}, number = {4}, pages = {qzaf056}, doi = {10.1093/gpbjnl/qzaf056}, }
2024
- Genome Biol.scCTS: Identifying the Cell Type-Specific Marker Genes from Population-Level Single-Cell RNA-seqLuxiao Chen*, Zhenxing Guo*, Tao Deng, and Hao WuGenome Biology, Oct 2024
Single-cell RNA-sequencing (scRNA-seq) provides gene expression profiles of individual cells from complex samples, facilitating the detection of cell type-specific marker genes. In scRNA-seq experiments with multiple donors, the population level variation brings an extra layer of complexity in cell type-specific gene detection, for example, they may not appear in all donors. Motivated by this observation, we develop a statistical model named scCTS to identify cell type-specific genes from population-level scRNA-seq data. Extensive data analyses demonstrate that the proposed method identifies more biologically meaningful cell type-specific genes compared to traditional methods.
@article{chen2024scCTS, title = {{{scCTS}}: Identifying the Cell Type-Specific Marker Genes from Population-Level Single-Cell {{RNA-seq}}}, author = {Chen, Luxiao and Guo, Zhenxing and Deng, Tao and Wu, Hao}, year = {2024}, month = oct, journal = {Genome Biology}, volume = {25}, number = {1}, pages = {269}, doi = {10.1186/s13059-024-03410-8}, }
2023
- Brief. Bioinform.A Cofunctional Grouping-Based Approach for Non-Redundant Feature Gene Selection in Unannotated Single-Cell RNA-seq AnalysisTao Deng*, Siyu Chen, Ying Zhang, Yuanbin Xu, Da Feng, Hao Wu, and Xiaobo Sun*Briefings in Bioinformatics, Feb 2023
Feature gene selection has significant impact on the performance of cell clustering in single-cell RNA sequencing (scRNA-seq) analysis. A well-rounded feature selection (FS) method should consider relevance, redundancy and complementarity of the features. Yet most existing FS methods focus on gene relevance to the cell types but neglect redundancy and complementarity, which undermines the cell clustering performance. We develop a novel computational method GeneClust to select feature genes for scRNA-seq cell clustering. GeneClust groups genes based on their expression profiles, then selects genes with the aim of maximizing relevance, minimizing redundancy and preserving complementarity. It can work as a plug-in tool for FS with any existing cell clustering method. Extensive benchmark results demonstrate that GeneClust significantly improve the clustering performance. Moreover, GeneClust can group cofunctional genes in biological process and pathway into clusters, thus providing a means of investigating gene interactions and identifying potential genes relevant to biological characteristics of the dataset. GeneClust is freely available at https://github.com/ToryDeng/scGeneClust.
@article{deng2023cofunctional, title = {A Cofunctional Grouping-Based Approach for Non-Redundant Feature Gene Selection in Unannotated Single-Cell {{RNA-seq}} Analysis}, author = {Deng, Tao and Chen, Siyu and Zhang, Ying and Xu, Yuanbin and Feng, Da and Wu, Hao and Sun, Xiaobo}, year = {2023}, month = feb, journal = {Briefings in Bioinformatics}, volume = {24}, number = {2}, pages = {bbad042}, doi = {10.1093/bib/bbad042}, copyright = {All rights reserved}, }
2021
- SN Appl. Sci.Short Term Prediction of Wireless Traffic Based on Tensor Decomposition and Recurrent Neural NetworkTao Deng, Mengxuan Wan, Kaiwen Shi, Ling Zhu, Xichen Wang, and Xuchu JiangSN Applied Sciences, Sep 2021
This paper proposes a wireless network traffic prediction model based on Bayesian Gaussian tensor decomposition and recurrent neural network with rectified linear unit (BGCP-RNN-ReLU model), which can effectively predict the changes in the upstream and downstream network traffic in a short period of time in the future. The research is divided into two parts: (i) The missing observations are imputed by an algorithm based on Bayesian Gaussian tensor decomposition. (ii) The recurrent neural network is used to forecast the true observations only rather than both true and estimated observations. The results show that, compared with other combined models of missing data imputation and neural networks, the BGCP-RNN-ReLU model proposed in this paper has the smallest prediction error for both the upstream and downstream traffic. The new model achieves better forecasting precision, and thus can help to regulate the load of communication station to reduce resource consumption.
@article{deng2021short, title = {Short Term Prediction of Wireless Traffic Based on Tensor Decomposition and Recurrent Neural Network}, author = {Deng, Tao and Wan, Mengxuan and Shi, Kaiwen and Zhu, Ling and Wang, Xichen and Jiang, Xuchu}, year = {2021}, month = sep, journal = {SN Applied Sciences}, volume = {3}, number = {9}, pages = {779}, doi = {10.1007/s42452-021-04761-8}, }