Universiti Teknologi Malaysia Institutional Repository

scGGAN: single-cell RNA-seq imputation by graph-based generative adversarial network

Huang, Zimo and Wang, Jun and Lu, Xudong and Mohd. Zain, Azlan and Yu, Guoxian (2023) scGGAN: single-cell RNA-seq imputation by graph-based generative adversarial network. Briefings in Bioinformatics, 24 (2). pp. 1-15. ISSN 1467-5463

[img] PDF
2MB

Official URL: http://dx.doi.org/10.1093/bib/bbad040

Abstract

Single-cell RNA sequencing (scRNA-seq) data are typically with a large number of missing values, which often results in the loss of critical gene signaling information and seriously limit the downstream analysis. Deep learning-based imputation methods often can better handle scRNA-seq data than shallow ones, but most of them do not consider the inherent relations between genes, and the expression of a gene is often regulated by other genes. Therefore, it is essential to impute scRNA-seq data by considering the regional gene-to-gene relations. We propose a novel model (named scGGAN) to impute scRNA-seq data that learns the gene-to-gene relations by Graph Convolutional Networks (GCN) and global scRNA-seq data distribution by Generative Adversarial Networks (GAN). scGGAN first leverages single-cell and bulk genomics data to explore inherent relations between genes and builds a more compact gene relation network to jointly capture the homogeneous and heterogeneous information. Then, it constructs a GCN-based GAN model to integrate the scRNA-seq, gene sequencing data and gene relation network for generating scRNA-seq data, and trains the model through adversarial learning. Finally, it utilizes data generated by the trained GCN-based GAN model to impute scRNA-seq data. Experiments on simulated and real scRNA-seq datasets show that scGGAN can effectively identify dropout events, recover the biologically meaningful expressions, determine subcellular states and types, improve the differential expression analysis and temporal dynamics analysis. Ablation experiments confirm that both the gene relation network and gene sequence data help the imputation of scRNA-seq data.

Item Type:Article
Uncontrolled Keywords:Generative Adversarial Networks, Graph Convolutional Networks, data imputation, gene relation network, single-cell RNA-seq
Subjects:Q Science > QA Mathematics > QA75 Electronic computers. Computer science
Divisions:Computing
ID Code:105572
Deposited By: Widya Wahid
Deposited On:06 May 2024 06:28
Last Modified:06 May 2024 06:28

Repository Staff Only: item control page