Skip to Content

Genetics and Biochemistry Profiles

Profile Photo

LiangJiang (LJ) Wang

Genetics and Biochemistry

Associate Professor

Life Sciences Building 228B [Office]


Educational Background

Ph.D., Botany, University of Georgia, 1999
M.S., Computer Science, Mississippi State University, 2001
M.S., Biology, Zhejiang University, China, 1989
B.S., Biology, Zhejiang University, China, 1986

Research Interests

The research in my lab has focused on biological knowledge discovery, genomic data integration and mining, and computational RNA biology. We previously developed machine learning models and web-based tools for biomedical research, including BindN and BindN+ for predicting DNA/RNA-binding residues in protein sequence, MuStab for protein stability prediction, and seeSUMO for protein sumoylation site prediction. We also integrated and mined the vast amount of publicly available gene expression data for understanding the molecular pathways involved in human diseases, including intellectual disability, autism, and cancer.

Recently, we have been developing machine learning and data mining approaches for the functional annotation of human long noncoding RNAs (lncRNAs) and circular RNAs (circRNAs) by leveraging the vast amount of genetic and genomic data ("biological big data"). We have constructed machine learning models and gene co-expression networks to predict and prioritize candidate lncRNAs associated with intellectual disability and autism spectrum disorders. We have also applied deep learning techniques to the prediction and pattern analysis of lncRNA subcellular localization, circRNA back-splicing code, and RNA-protein interactions. Our studies demonstrate that genomic data mining can not only give insights into RNA functions in gene regulation and 3D genome organization, but also provide valuable information for experimental studies of candidate genes associated with human diseases.

Research Group (Lab)

Current Students:
Snehal Shah (2019 - present, PhD student in Healthcare Genetics)
Anqi Wei (2017 - present, PhD student in Biochemistry)

Former Students:
Shuzhen Kuang (2017 - 2020, PhD in Biology, Postdoc at UCSF School of Medicine)
Jun Wang (2015 - 2020, PhD in Biochemistry, Postdoc at Yale School of Medicine)
Brian Gudenas (2014 - 2018, PhD in Genetics, Postdoc at St Jude Research Hospital)
Jose Guevara (2012 - 2017, PhD in Biochemistry, Professor at University of Costa Rica)
Steven Cogill (2012 - 2016, PhD in Genetics, Postdoc at Stanford School of Medicine)
Shaolei Teng (2007 - 2011, PhD in Biochemistry, Postdoc at Cold Spring Harbor Laboratory)

Courses Taught

GEN/BCHM 4400/4400H/6400 Bioinformatics
GEN/BCHM 4930 Senior Seminar
GEN 3020/3020H Introduction to Genetics
BCHM 3050 Essential Elements of Biochemistry
GEN/BCHM 8100 Principles of Molecular Biology

Selected Publications

(Selected from 79 peer-reviewed publications. *Correspondence author)

Kuang, S., Wei, Y.. Wang, L.* (2021) Expression-based prediction of human essential genes and candidate lncRNAs in cancer cells. Bioinformatics, btaa717. Online ahead of print.

Kuang, S., Wang, L.* (2021) Deep learning of sequence patterns for CTCF-mediated chromatin loop formation. Journal of Computational Biology, 28(2):133-145.

Wang, J., Wang, L.* (2020) Prediction and prioritization of autism-associated long non-coding RNAs using gene expression and sequence features. BMC Bioinformatics, 21(1):505.

Kuang, S., Wang, L.* (2020) Identification and analysis of consensus RNA motifs binding to the genome regulator CTCF. NAR Genomics and Bioinformatics, 2(2):lqaa031.

Wang, J., Wang, L.* (2020) Deep analysis of RNA N6-adenosine methylation (m6A) patterns in human cells. NAR Genomics and Bioinformatics, 2(1):lqaa007.

Wang, J., Wang, L.* (2019) Deep learning of the back-splicing code for circular RNA formation. Bioinformatics, 35(24):5235-5242.

Gudenas, B.L., Wang, J., Kuang, S., Wei, A., Cogill, S.B., Wang, L.* (2019) Genomic data mining for functional annotation of human long noncoding RNAs. Journal of Zhejiang University - Science B, 20(6):476-487.

Gudenas, B.L., Wang, L.* (2018) Prediction of lncRNA subcellular localization with deep learning from sequence features. Scientific Reports, 8:16385.

Cogill, S.B., Srivastava, A.K., Yang, M.Q., Wang, L.* (2018) Co-expression of long non-coding RNAs and autism risk genes in the developing human brain. BMC Systems Biology, 12(Suppl 7):91.

Yang, X., Kuang, S., Wang, L.*, Wei, Y.* (2018) MHC class I chain-related A: Polymorphism, regulation and therapeutic value in cancer. Biomedicine & Pharmacotherapy, 103:111-117.

Gudenas, B.L., Srivastava, A.K., Wang, L.* (2017) Integrative genomic analyses for identification and prioritization of long non-coding RNAs associated with autism. PLOS ONE, 12(5):e0178532.

Cogill, S.B., Wang, L.* (2016) Support vector machine model of developmental brain gene expression data for prioritization of autism risk gene candidates. Bioinformatics, 32 (23):3611-3618.

Gudenas, B.L., Wang, L.* (2015) Gene co-expression networks in human brain developmental transcriptomes implicate the association of long non-coding RNAs with intellectual disability. Bioinformatics and Biology Insights, 9(Suppl 1):21-27.

Cogill, S.B., Wang, L.* (2014) Co-expression network analysis of human lncRNAs and cancer genes. Cancer Informatics, 13(S5):49-59.

Teng, S., Yang, J.Y., Wang, L.* (2013) Genome-wide prediction and analysis of human tissue-selective genes using microarray expression data. BMC Medical Genomics, 6(Suppl 1):S10.

Teng, S., Luo, H., Wang, L.* (2012) Predicting protein sumoylation sites from sequence features. Amino Acids, 43(1):447-455.

Wang, L. *, Huang, C., Yang, J.Y. (2010) Predicting siRNA potency with random forests and support vector machines. BMC Genomics, 11(Suppl 3):S2.

Teng, S., Srivastava, A.K., Wang, L.* (2010) Sequence feature-based prediction of protein stability changes upon amino acid substitutions. BMC Genomics, 11(Suppl 2):S5.

Wang, L.*, Huang, C., Yang, M.Q., Yang, J.Y. (2010) BindN+ for accurate prediction of DNA and RNA-binding residues from protein sequence features. BMC Systems Biology, 4(Suppl 1):S3.

Tribolium Genome Sequencing Consortium (Wang, L. as a coauthor) (2008) The genome of the model beetle and pest Tribolium castaneum. Nature, 452(7190):949-955.

Wang, L., Wang, S., Li, Y., Paradesi, M.S.R., Brown, S.J.* (2007) BeetleBase: the model organism database for Tribolium castaneum. Nucleic Acids Research, 35(Database issue):D476-D479.

Wang, L.*, Brown, S.J. (2006) BindN: a web-based tool for efficient prediction of DNA and RNA binding sites in amino acid sequences. Nucleic Acids Research, 34(Web Server issue):W243-W248.

Casa, A.M., Brouwer, C., Nagel, A., Wang, L., Zhang, Q., Kresovich, S., Wessler, S.R.* (2000) Inaugural Article: The MITE family Heartbreaker (Hbr): Molecular markers in maize. Proc. Natl. Acad. Sci. USA, 97(18):10083-10089.

Wang, L., Wessler, S.R.* (1998) Inefficient reinitiation is responsible for upstream open reading frame-mediated translational repression of the maize R gene. Plant Cell, 10(10):1733-1745.


Google Scholar - Liangjiang Wang
GitHub - BioDataLearning