Huang, Zimo and Wang, Jun and Lu, Xudong and Mohd. Zain, Azlan and Yu, Guoxian (2023) scGGAN: single-cell RNA-seq imputation by graph-based generative adversarial network. Briefings in Bioinformatics, 24 (2). pp. 1-15. ISSN 1467-5463
PDF
2MB |
Official URL: http://dx.doi.org/10.1093/bib/bbad040
Abstract
Single-cell RNA sequencing (scRNA-seq) data are typically with a large number of missing values, which often results in the loss of critical gene signaling information and seriously limit the downstream analysis. Deep learning-based imputation methods often can better handle scRNA-seq data than shallow ones, but most of them do not consider the inherent relations between genes, and the expression of a gene is often regulated by other genes. Therefore, it is essential to impute scRNA-seq data by considering the regional gene-to-gene relations. We propose a novel model (named scGGAN) to impute scRNA-seq data that learns the gene-to-gene relations by Graph Convolutional Networks (GCN) and global scRNA-seq data distribution by Generative Adversarial Networks (GAN). scGGAN first leverages single-cell and bulk genomics data to explore inherent relations between genes and builds a more compact gene relation network to jointly capture the homogeneous and heterogeneous information. Then, it constructs a GCN-based GAN model to integrate the scRNA-seq, gene sequencing data and gene relation network for generating scRNA-seq data, and trains the model through adversarial learning. Finally, it utilizes data generated by the trained GCN-based GAN model to impute scRNA-seq data. Experiments on simulated and real scRNA-seq datasets show that scGGAN can effectively identify dropout events, recover the biologically meaningful expressions, determine subcellular states and types, improve the differential expression analysis and temporal dynamics analysis. Ablation experiments confirm that both the gene relation network and gene sequence data help the imputation of scRNA-seq data.
Item Type: | Article |
---|---|
Uncontrolled Keywords: | Generative Adversarial Networks, Graph Convolutional Networks, data imputation, gene relation network, single-cell RNA-seq |
Subjects: | Q Science > QA Mathematics > QA75 Electronic computers. Computer science |
Divisions: | Computing |
ID Code: | 105572 |
Deposited By: | Widya Wahid |
Deposited On: | 06 May 2024 06:28 |
Last Modified: | 06 May 2024 06:28 |
Repository Staff Only: item control page