Patel, RushabhGuo, YanhuiAlhudhaif, AdiAlenezi, FayadhAlthubiti, Sara A.Polat, Kemal2023-09-272023-09-272022Patel, R., Guo, Y., Alhudhaif, A., Alenezi, F., Althubiti, S. A., & Polat, K. (2021). Graph-based link prediction between human phenotypes and genes. Mathematical Problems in Engineering, 2022.1024-123X1563-5147http://dx.doi.org/10.1155/2022/7111647https://hdl.handle.net/20.500.12491/11753Deep phenotyping is defined as learning about genotype-phenotype associations and the history of human illness by analyzing phenotypic anomalies. It is significant to investigate the association between phenotype and genotype. Machine learning approaches are good at predicting the associations between abnormal human phenotypes and genes. A novel framework based on machine learning is proposed to estimate the links between human phenotype ontology (HPO) and genes. The Orphanet's annotation parses the human phenotype-gene associations. An algorithm node2vec generates the embeddings for the nodes (HPO and genes). It performs node sampling on the graph using random walks and learns features on these sampled nodes for embedding. These embeddings were used downstream to predict the link between these nodes by supervised classifiers. Results show the gradient boosting decision tree model (LightGBM) has achieved an optimal AUROC of 0.904 and an AUCPR of 0.784, an optimal weighted F1 score of 0.87. LightGBM can detect more accurate interactions and links between human phenotypes and gene pairs.eninfo:eu-repo/semantics/openAccessLightGBMAUROCPhenotypesGraph-based link prediction between human phenotypes and genesArticle10.1155/2022/71116472022182-s2.0-85128235473Q2WOS:000807377100006N/A