中国物理B ›› 2024, Vol. 33 ›› Issue (1): 18903-18903.doi: 10.1088/1674-1056/acd7ca

• • 上一篇    下一篇

Essential proteins identification method based on four-order distances and subcellular localization information

Pengli Lu(卢鹏丽)1,†, Yu Zhong(钟雨)1,2,‡, and Peishi Yang(杨培实)1   

  1. 1 School of Computer and Communication, Lanzhou University of Technology, Lanzhou 730050, China;
    2 School of Tianmen Vocational College, Tianmen 431700, China
  • 收稿日期:2023-02-01 修回日期:2023-04-28 接受日期:2023-05-23 出版日期:2023-12-13 发布日期:2023-12-20
  • 通讯作者: Pengli Lu, Yu Zhong E-mail:lupengli88@163.com;13026326063@163.com
  • 基金资助:
    Project supported by the Gansu Province Industrial Support Plan (Grant No. 2023CYZC-25), the Natural Science Foundation of Gansu Province (Grant No. 23JRRA770), and the National Natural Science Foundation of China (Grant No. 62162040).

Essential proteins identification method based on four-order distances and subcellular localization information

Pengli Lu(卢鹏丽)1,†, Yu Zhong(钟雨)1,2,‡, and Peishi Yang(杨培实)1   

  1. 1 School of Computer and Communication, Lanzhou University of Technology, Lanzhou 730050, China;
    2 School of Tianmen Vocational College, Tianmen 431700, China
  • Received:2023-02-01 Revised:2023-04-28 Accepted:2023-05-23 Online:2023-12-13 Published:2023-12-20
  • Contact: Pengli Lu, Yu Zhong E-mail:lupengli88@163.com;13026326063@163.com
  • Supported by:
    Project supported by the Gansu Province Industrial Support Plan (Grant No. 2023CYZC-25), the Natural Science Foundation of Gansu Province (Grant No. 23JRRA770), and the National Natural Science Foundation of China (Grant No. 62162040).

摘要: Essential proteins are inseparable in cell growth and survival. The study of essential proteins is important for understanding cellular functions and biological mechanisms. Therefore, various computable methods have been proposed to identify essential proteins. Unfortunately, most methods based on network topology only consider the interactions between a protein and its neighboring proteins, and not the interactions with its higher-order distance proteins. In this paper, we propose the DSEP algorithm in which we integrated network topology properties and subcellular localization information in protein—protein interaction (PPI) networks based on four-order distances, and then used random walks to identify the essential proteins. We also propose a method to calculate the finite-order distance of the network, which can greatly reduce the time complexity of our algorithm. We conducted a comprehensive comparison of the DSEP algorithm with 11 existing classical algorithms to identify essential proteins with multiple evaluation methods. The results show that DSEP is superior to these 11 methods.

关键词: protein—protein interaction (PPI) network, essential proteins, four-order distances, subcellular localization information

Abstract: Essential proteins are inseparable in cell growth and survival. The study of essential proteins is important for understanding cellular functions and biological mechanisms. Therefore, various computable methods have been proposed to identify essential proteins. Unfortunately, most methods based on network topology only consider the interactions between a protein and its neighboring proteins, and not the interactions with its higher-order distance proteins. In this paper, we propose the DSEP algorithm in which we integrated network topology properties and subcellular localization information in protein—protein interaction (PPI) networks based on four-order distances, and then used random walks to identify the essential proteins. We also propose a method to calculate the finite-order distance of the network, which can greatly reduce the time complexity of our algorithm. We conducted a comprehensive comparison of the DSEP algorithm with 11 existing classical algorithms to identify essential proteins with multiple evaluation methods. The results show that DSEP is superior to these 11 methods.

Key words: protein—protein interaction (PPI) network, essential proteins, four-order distances, subcellular localization information

中图分类号:  (Complex systems)

  • 89.75.-k
87.23.Cc (Population dynamics and ecological pattern formation) 87.23.Ge (Dynamics of social systems)