INTERDISCIPLINARY PHYSICS AND RELATED AREAS OF SCIENCE AND TECHNOLOGY |
Prev
|
|
|
AG-GATCN: A novel method for predicting essential proteins |
Peishi Yang(杨培实), Pengli Lu(卢鹏丽)†, and Teng Zhang(张腾) |
School of Computer and Communication, Lanzhou University of Technology, Lanzhou 730050, China |
|
|
Abstract Essential proteins play an important role in disease diagnosis and drug development. Many methods have been devoted to the essential protein prediction by using some kinds of biological information. However, they either ignore the noise presented in the biological information itself or the noise generated during feature extraction. To overcome these problems, in this paper, we propose a novel method for predicting essential proteins called attention gate-graph attention network and temporal convolutional network (AG-GATCN). In AG-GATCN method, we use improved temporal convolutional network (TCN) to extract features from gene expression sequence. To address the noise in the gene expression sequence itself and the noise generated after the dilated causal convolution, we introduce attention mechanism and gating mechanism in TCN. In addition, we use graph attention network (GAT) to extract protein-protein interaction (PPI) network features, in which we construct the feature matrix by introducing node2vec technique and 7 centrality metrics, and to solve the GAT oversmoothing problem, we introduce gated tanh unit (GTU) in GAT. Finally, two types of features are integrated by us to predict essential proteins. Compared with the existing methods for predicting essential proteins, the experimental results show that AG-GATCN achieves better performance.
|
Received: 18 October 2022
Revised: 08 December 2022
Accepted manuscript online: 08 February 2023
|
PACS:
|
89.75.-k
|
(Complex systems)
|
|
Fund: Project supported by the National Natural Science Foundation of China (Grant Nos. 11861045, 11361033, and 62162040). |
Corresponding Authors:
Pengli Lu
E-mail: lupengli88@163.com
|
Cite this article:
Peishi Yang(杨培实), Pengli Lu(卢鹏丽), and Teng Zhang(张腾) AG-GATCN: A novel method for predicting essential proteins 2023 Chin. Phys. B 32 058902
|
[1] Kamath R S, Fraser A G, Dong Y, et al. 2003 Nature 421 231 [2] Clatworthy A E, Pierson E, Hung D T, et al. 2007 Nat. Chem. Biol. 3 541 [3] Giaever G, Chu A M, Ni L, et al. 2002 Nature 418 387 [4] Cullen L M and Arndt G M 2005 Immunol. 83 217 [5] Roemer T, Jiang B, Davison J, et al. 2003 Mol. Microbiol. 50 167 [6] Maple J and Moller S G 2007 Circadian Rhythms p. 207 [7] Zhu H and Snyder M 2003 Curr. Opin. Chem. Biol. 7 55 [8] Freeman L C 1978 Soc. Networks 1 215 [9] Joy M P, Brock A, Ingber D E, et al. 2005 Jour. Biom. Biot. 2005 96 [10] Wuchty S and Stadler P F 2003 J. Theor. Biol. 223 45 [11] Estrada E and Rodriguez-Velazquez J A 2005 Phys. Rev. E. 71 056103 [12] Wang J X, Li M, Wang H, et al. 2011 IEEE ACM Trans. Comput. Biol. Bioi. 9 1070 [13] Li M, Wang J X, Chen X, et al. 2011 Comput. Biol. Chem. 35 143 [14] Zhang X, Xiao W X, Acencio M L, et al. 2016 BMC Bioi. 17 322 [15] Zhang C L and Zhang S W 2013 Comput. Biol. Med. 43 568 [16] Xiao Q H, Wang J X and Peng X, et al. 2015 BMC Genom. 16 S1 [17] Tang X W, Wang J X, Zhong J C, et al. 2013 IEEE ACM Trans. Comput. Biol. Bioi. 11 407 [18] Yugandhar K and Gromiha M M 2014 Prot. Stru. Func. Bioi. 82 2088 [19] Luo J W and Qi Y 2015 PloS one 10 e0131418 [20] Li M, Lu Y, Niu Z B, et al. 2015 IEEE ACM Trans. Comput. Biol. Bioi. 14 370 [21] Li M, Zhang H H, Wang J X, et al. 2012 BMC Syst. Biol. 6 1 [22] Zhong J C, Tang C, Peng W, et al. 2021 BMC Bioi. 22 1 [23] Li M, Li W K, Wu F X, et al. 2018 J. Theor. Biol. 447 65 [24] Wu C Y, Lin B T, Shi K, et al. 2021 Curr. Bioi. 16 1161 [25] Wang N, Zeng M, Li Y M, et al. 2021 J. Comput. Biol. 28 687 [26] Ahmed N M, Chen L, Li B, et al. 2021 Soft Comput. 25 8883 [27] Lei X J, Yang X Q, Wu F X, et al. 2018 IEEE ACM Trans. Comput. Biol. Bioi. 17 495 [28] Zeng M, Li M, Wu F X, et al. 2019 BMC Bioi. 20 506 [29] Li Y M, Zeng M, Wu Y F, et al. 2021 IEEE ACM Trans. Comput. Biol. Bioi. 19 3263 [30] Kipf T N and Welling M 2016 arXiv: 1609.02907 [31] Veličković P, Cucurull G, Casanova A, et al. 2017 arXiv: 1710.10903 [32] Bai S J, Kolter J Z, Koltun V, et al. 2018 arXiv: 1803.01271 [33] Bahdanau D, Cho K, Bengio Y, et al. 2014 arXiv: 1409.0473 [34] Grover A and Leskovec J 2016 Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining pp. 855-864 [35] Dauphin Y N, Fan A, Auli M, et al. 2017 International conference on machine learning pp. 933-941 [36] Chatr-Aryamontri A, Oughtred R, Boucher L, et al. 2017 Nucleic Acids Res. 45 D369 [37] Mewes H W, Frishman D, Güldener U, et al. 2002 Nucleic Acids Res. 30 31 [38] Cherry J M, Adler C, Ball C, et al. 1998 Nucleic Acids Res. 26 73 [39] Zhang R and Lin Y 2009 Nucleic Acids Res. 37 D455 [40] Winzeler E A, Shoemaker D D, Astromoff A, et al. 1999 Science 285 901 [41] Tu B P, Kudlicki A, Rowicka M, et al. 2005 Science 310 1152 [42] Zahidi Y, El Younoussi Y, Azroumahli C, et al. 2019 2019 5th International Conference on Optimization and Applications pp. 1-10 [43] Pedregosa F, Varoquaux G, Gramfort A, et al. 2011 J. Mach. Learn. Res. 12 2825 [44] Zeng M, Li M, Fei Z H, et al. 2021 IEEE ACM Trans. Comput. Biol. Bioi. 18 296 |
No Suggested Reading articles found! |
|
|
Viewed |
|
|
|
Full text
|
|
|
|
|
Abstract
|
|
|
|
|
Cited |
|
|
|
|
Altmetric
|
blogs
Facebook pages
Wikipedia page
Google+ users
|
Online attention
Altmetric calculates a score based on the online attention an article receives. Each coloured thread in the circle represents a different type of online attention. The number in the centre is the Altmetric score. Social media and mainstream news media are the main sources that calculate the score. Reference managers such as Mendeley are also tracked but do not contribute to the score. Older articles often score higher because they have had more time to get noticed. To account for this, Altmetric has included the context data for other articles of a similar age.
View more on Altmetrics
|
|
|