Skip to content

About

Contact Information

P: 864-656-6237
E: kgthoma@clemson.edu

Campus Location

190 Collings St.

Hours

Monday - Friday:
8 a.m. - 4:30 p.m.

Profile


Profile Photo

LiangJiang (LJ) Wang

Genetics and Biochemistry

Associate Professor

Life Sciences Building 228B [Office]

liangjw@clemson.edu

Educational Background

Ph.D., Botany, University of Georgia, 1999
M.S., Computer Science, Mississippi State University, 2001
M.S., Biology, Zhejiang University, China, 1989
B.S., Biology, Zhejiang University, China, 1986

Research Interests

The research in my lab has focused on biological knowledge discovery, genomic data integration and mining, and computational RNA biology. We previously developed machine learning models and web-based tools for biomedical research, including BindN and BindN+ for predicting DNA/RNA-binding residues in protein sequence, MuStab for protein stability prediction, and seeSUMO for protein sumoylation site prediction. We also integrated and mined the vast amount of publicly available gene expression data for understanding the molecular pathways involved in human diseases, including intellectual disability, autism, and cancer.

Recently, we have been developing machine learning and data mining approaches for the functional annotation of human long noncoding RNAs (lncRNAs) and circular RNAs (circRNAs) by leveraging the vast amount of genetic and genomic data ("biological big data"). We have constructed machine learning models and gene co-expression networks to predict and prioritize candidate lncRNAs associated with intellectual disability and autism spectrum disorders. We have also applied deep learning techniques to the prediction and pattern analysis of lncRNA subcellular localization, circRNA back-splicing code, and RNA-protein interactions. Our studies demonstrate that genomic data mining can not only give insights into RNA functions in gene regulation and 3D genome organization, but also provide valuable information for experimental studies of candidate genes associated with human diseases.

Research Group (Lab)

Current Students:
Snehal Shah (2019 - present, PhD student in Healthcare Genetics)
Anqi Wei (2017 - present, PhD student in Biochemistry)

Former Students:
Shuzhen Kuang (2017 - 2020, PhD in Biology, Postdoc at UCSF School of Medicine)
Jun Wang (2015 - 2020, PhD in Biochemistry, Postdoc at Yale School of Medicine)
Brian Gudenas (2014 - 2018, PhD in Genetics, Postdoc at St Jude Research Hospital)
Jose Guevara (2012 - 2017, PhD in Biochemistry, Professor at University of Costa Rica)
Steven Cogill (2012 - 2016, PhD in Genetics, Postdoc at Stanford School of Medicine)
Shaolei Teng (2007 - 2011, PhD in Biochemistry, Postdoc at Cold Spring Harbor Laboratory)

Courses Taught

GEN/BCHM 4400/4400H/6400 Bioinformatics
GEN/BCHM 4930 Senior Seminar
GEN 3020/3020H Introduction to Genetics
BCHM 3050 Essential Elements of Biochemistry
GEN/BCHM 8100 Principles of Molecular Biology

Selected Publications

(Selected from 79 peer-reviewed publications. *Correspondence author)

Kuang, S., Wei, Y. and Wang, L.* (2021) Expression-based prediction of human essential genes and candidate lncRNAs in cancer cells. Bioinformatics, 37(3):396-403. https://pubmed.ncbi.nlm.nih.gov/32790840/

Kuang, S., Wang, L.* (2021) Deep learning of sequence patterns for CTCF-mediated chromatin loop formation. Journal of Computational Biology, 28(2):133-145. https://pubmed.ncbi.nlm.nih.gov/33232622/

Wang, J., Wang, L.* (2020) Prediction and prioritization of autism-associated long non-coding RNAs using gene expression and sequence features. BMC Bioinformatics, 21(1):505. https://pubmed.ncbi.nlm.nih.gov/33160303/

Kuang, S. and Wang, L.* (2020) Identification and analysis of consensus RNA motifs binding to the genome regulator CTCF. NAR Genomics and Bioinformatics, 2(2):lqaa031. https://pubmed.ncbi.nlm.nih.gov/33575587/

Wang, J. and Wang, L.* (2020) Deep analysis of RNA N6-adenosine methylation (m6A) patterns in human cells. NAR Genomics and Bioinformatics, 2(1):lqaa007. https://pubmed.ncbi.nlm.nih.gov/33575554/

Wang, J., Wang, L.* (2019) Deep learning of the back-splicing code for circular RNA formation. Bioinformatics, 35(24):5235-5242. https://www.ncbi.nlm.nih.gov/pubmed/31077303

Gudenas, B.L., Wang, J., Kuang, S., Wei, A., Cogill, S.B., Wang, L.* (2019) Genomic data mining for functional annotation of human long noncoding RNAs. Journal of Zhejiang University - Science B, 20(6):476-487. https://www.ncbi.nlm.nih.gov/pubmed/31090273

Gudenas, B.L., Wang, L.* (2018) Prediction of lncRNA subcellular localization with deep learning from sequence features. Scientific Reports, 8:16385. https://www.ncbi.nlm.nih.gov/pubmed/30401954

Cogill, S.B., Srivastava, A.K., Yang, M.Q., Wang, L.* (2018) Co-expression of long non-coding RNAs and autism risk genes in the developing human brain. BMC Systems Biology, 12(Suppl 7):91. https://www.ncbi.nlm.nih.gov/pubmed/30547845

Yang, X., Kuang, S., Wang, L.*, Wei, Y.* (2018) MHC class I chain-related A: Polymorphism, regulation and therapeutic value in cancer. Biomedicine & Pharmacotherapy, 103:111-117. https://www.ncbi.nlm.nih.gov/pubmed/29635123

Gudenas, B.L., Srivastava, A.K., Wang, L.* (2017) Integrative genomic analyses for identification and prioritization of long non-coding RNAs associated with autism. PLOS ONE, 12(5):e0178532. https://www.ncbi.nlm.nih.gov/pubmed/28562671

Cogill, S.B., Wang, L.* (2016) Support vector machine model of developmental brain gene expression data for prioritization of autism risk gene candidates. Bioinformatics, 32 (23):3611-3618. https://www.ncbi.nlm.nih.gov/pubmed/27506227

Gudenas, B.L., Wang, L.* (2015) Gene co-expression networks in human brain developmental transcriptomes implicate the association of long non-coding RNAs with intellectual disability. Bioinformatics and Biology Insights, 9(Suppl 1):21-27. https://www.ncbi.nlm.nih.gov/pubmed/26523118

Cogill, S.B., Wang, L.* (2014) Co-expression network analysis of human lncRNAs and cancer genes. Cancer Informatics, 13(S5):49-59. https://www.ncbi.nlm.nih.gov/pubmed/25392693

Teng, S., Yang, J.Y., Wang, L.* (2013) Genome-wide prediction and analysis of human tissue-selective genes using microarray expression data. BMC Medical Genomics, 6(Suppl 1):S10. https://www.ncbi.nlm.nih.gov/pubmed/23369200

Teng, S., Luo, H., Wang, L.* (2012) Predicting protein sumoylation sites from sequence features. Amino Acids, 43(1):447-455. https://www.ncbi.nlm.nih.gov/pubmed/21986959

Wang, L. *, Huang, C., Yang, J.Y. (2010) Predicting siRNA potency with random forests and support vector machines. BMC Genomics, 11(Suppl 3):S2. https://www.ncbi.nlm.nih.gov/pubmed/21143784

Teng, S., Srivastava, A.K., Wang, L.* (2010) Sequence feature-based prediction of protein stability changes upon amino acid substitutions. BMC Genomics, 11(Suppl 2):S5. https://www.ncbi.nlm.nih.gov/pubmed/21047386

Wang, L.*, Huang, C., Yang, M.Q., Yang, J.Y. (2010) BindN+ for accurate prediction of DNA and RNA-binding residues from protein sequence features. BMC Systems Biology, 4(Suppl 1):S3. https://www.ncbi.nlm.nih.gov/pubmed/20522253

Tribolium Genome Sequencing Consortium (Wang, L. as a coauthor) (2008) The genome of the model beetle and pest Tribolium castaneum. Nature, 452(7190):949-955. https://www.ncbi.nlm.nih.gov/pubmed/18362917

Wang, L., Wang, S., Li, Y., Paradesi, M.S.R., Brown, S.J.* (2007) BeetleBase: the model organism database for Tribolium castaneum. Nucleic Acids Research, 35(Database issue):D476-D479. https://www.ncbi.nlm.nih.gov/pubmed/17090595

Wang, L.*, Brown, S.J. (2006) BindN: a web-based tool for efficient prediction of DNA and RNA binding sites in amino acid sequences. Nucleic Acids Research, 34(Web Server issue):W243-W248. https://www.ncbi.nlm.nih.gov/pubmed/16845003

Casa, A.M., Brouwer, C., Nagel, A., Wang, L., Zhang, Q., Kresovich, S., Wessler, S.R.* (2000) Inaugural Article: The MITE family Heartbreaker (Hbr): Molecular markers in maize. Proc. Natl. Acad. Sci. USA, 97(18):10083-10089. https://www.ncbi.nlm.nih.gov/pubmed/10963671

Wang, L., Wessler, S.R.* (1998) Inefficient reinitiation is responsible for upstream open reading frame-mediated translational repression of the maize R gene. Plant Cell, 10(10):1733-1745. https://www.ncbi.nlm.nih.gov/pubmed/9761799

Links

Google Scholar - Liangjiang Wang
GitHub - BioDataLearning

Contact Information

P: 864-656-6237
E: kgthoma@clemson.edu

Campus Location

190 Collings St.

Hours

Monday - Friday:
8 a.m. - 4:30 p.m.