中国物理B ›› 2010, Vol. 19 ›› Issue (11): 110502-110201.doi: 10.1088/1674-1056/19/11/110502
江凡1, 孙重华2
Sun Zhong-Hua(孙重华)a)b) and Jiang Fan(江凡)a)†
摘要: In this paper a new continuous variable called core-ratio is defined to describe the probability for a residue to be in a binding site, thereby replacing the previous binary description of the interface residue using 0 and 1. So we can use the support vector machine regression method to fit the core-ratio value and predict the protein binding sites. We also design a new group of physical and chemical descriptors to characterize the binding sites. The new descriptors are more effective, with an averaging procedure used. Our test shows that much better prediction results can be obtained by the support vector regression (SVR) method than by the support vector classification method.
中图分类号: (Proteins)