Int J Biol Sci 2013; 9(1):22-33. doi:10.7150/ijbs.5371 This issue Cite
1. College of Mathematics and Physics, Qingdao University of Science and Technology; Qingdao, Shandong, China;
2. College of Electromechanical Engineering, Qingdao University of Science and Technology; Qingdao, Shandong, China.
The increasing protein sequences from the genome project require theoretical methods to predict transmembrane helical segments (TMHs). So far, several prediction methods have been reported, but there are some deficiencies in prediction accuracy and adaptability in these methods. In this paper, a method based on discrete wavelet transform (DWT) has been developed to predict the number and location of TMHs in membrane proteins. PDB coded as 1KQG is chosen as an example to describe the prediction process by this method. 80 proteins with known 3D structure from Mptopo database are chosen at random as data sets (including 325 TMHs) and 80 sequences are divided into 13 groups according to their function and type. TMHs prediction is carried out for each group of membrane protein sequences and obtain satisfactory result. To verify the feasibility of this method, 80 membrane protein sequences are treated as test sets, 308 TMHs can be predicted and the prediction accuracy is 96.3%. Compared with the main prediction results of seven popular prediction methods, the obtained results indicate that the proposed method in this paper has higher prediction accuracy.
Keywords: Membrane protein, Transmembrane helical segments, Discrete wavelet transform, Hydrophobicity.