中国物理B ›› 2022, Vol. 31 ›› Issue (11): 118901-118901.doi: 10.1088/1674-1056/ac7a17

• • 上一篇    

Biased random walk with restart for essential proteins prediction

Pengli Lu(卢鹏丽)1,†, Yuntian Chen(陈云天)1, Teng Zhang(张腾)1, and Yonggang Liao(廖永刚)2   

  1. 1 School of Computer and Communication, Lanzhou University of Technology, Lanzhou 730050, China;
    2 China Mobile Communications Group Gansu Co., Ltd., Lanzhou 730070, China
  • 收稿日期:2022-04-23 修回日期:2022-06-06 接受日期:2022-06-18 出版日期:2022-10-17 发布日期:2022-10-17
  • 通讯作者: Pengli Lu E-mail:lupengli88@163.com
  • 基金资助:
    Project supported by the National Natural Science Foundation of China (Grant Nos. 11861045 and 62162040).

Biased random walk with restart for essential proteins prediction

Pengli Lu(卢鹏丽)1,†, Yuntian Chen(陈云天)1, Teng Zhang(张腾)1, and Yonggang Liao(廖永刚)2   

  1. 1 School of Computer and Communication, Lanzhou University of Technology, Lanzhou 730050, China;
    2 China Mobile Communications Group Gansu Co., Ltd., Lanzhou 730070, China
  • Received:2022-04-23 Revised:2022-06-06 Accepted:2022-06-18 Online:2022-10-17 Published:2022-10-17
  • Contact: Pengli Lu E-mail:lupengli88@163.com
  • Supported by:
    Project supported by the National Natural Science Foundation of China (Grant Nos. 11861045 and 62162040).

摘要: Predicting essential proteins is crucial for discovering the process of cellular organization and viability. We propose biased random walk with restart algorithm for essential proteins prediction, called BRWR. Firstly, the common process of practice walk often sets the probability of particles transferring to adjacent nodes to be equal, neglecting the influence of the similarity structure on the transition probability. To address this problem, we redefine a novel transition probability matrix by integrating the gene express similarity and subcellular location similarity. The particles can obtain biased transferring probabilities to perform random walk so as to further exploit biological properties embedded in the network structure. Secondly, we use gene ontology (GO) terms score and subcellular score to calculate the initial probability vector of the random walk with restart. Finally, when the biased random walk with restart process reaches steady state, the protein importance score is obtained. In order to demonstrate superiority of BRWR, we conduct experiments on the YHQ, BioGRID, Krogan and Gavin PPI networks. The results show that the method BRWR is superior to other state-of-the-art methods in essential proteins recognition performance. Especially, compared with the contrast methods, the improvements of BRWR in terms of the ACC results range in 1.4%-5.7%, 1.3%-11.9%, 2.4%-8.8%, and 0.8%-14.2%, respectively. Therefore, BRWR is effective and reasonable.

关键词: PPI network, essential proteins, random walk with restart, gene expression

Abstract: Predicting essential proteins is crucial for discovering the process of cellular organization and viability. We propose biased random walk with restart algorithm for essential proteins prediction, called BRWR. Firstly, the common process of practice walk often sets the probability of particles transferring to adjacent nodes to be equal, neglecting the influence of the similarity structure on the transition probability. To address this problem, we redefine a novel transition probability matrix by integrating the gene express similarity and subcellular location similarity. The particles can obtain biased transferring probabilities to perform random walk so as to further exploit biological properties embedded in the network structure. Secondly, we use gene ontology (GO) terms score and subcellular score to calculate the initial probability vector of the random walk with restart. Finally, when the biased random walk with restart process reaches steady state, the protein importance score is obtained. In order to demonstrate superiority of BRWR, we conduct experiments on the YHQ, BioGRID, Krogan and Gavin PPI networks. The results show that the method BRWR is superior to other state-of-the-art methods in essential proteins recognition performance. Especially, compared with the contrast methods, the improvements of BRWR in terms of the ACC results range in 1.4%-5.7%, 1.3%-11.9%, 2.4%-8.8%, and 0.8%-14.2%, respectively. Therefore, BRWR is effective and reasonable.

Key words: PPI network, essential proteins, random walk with restart, gene expression

中图分类号:  (Complex systems)

  • 89.75.-k