Home  |  Contact  |  Sitemap  |  Chinese  |  CAS

RESEARCH PROGRESS

Researchers Propose Computational Methods Predict Protein-protein Interaction

Editor: | Aug 21,2017

Protein interactions dominate almost all biochemical reactions in living cells, such as DNA synthesis, gene transcription and activation, protein translation, cell cycle regulation and signal transduction. Through the study of these interactions, people have provided a new perspective to understand the occurrence and development of disease, thus providing molecular level information support for the prevention, diagnosis and treatment of specific diseases. 

Due to limitations of experimental techniques, these methods are costly and time consuming and have high false positive rates and false negative prediction rates when dealing with large-scale experiments. Therefore, the development of reliable computational methods has important practical significance in promoting protein-protein interaction identification. 

Researchers  from Xinjiang Technical Institute of Physics and Chemistry of Chinese Academy of Science develop PCVMZM, a computational method based on a Probabilistic Classification Vector Machines (PCVM) model and Zernike moments (ZM) descriptor for predicting the Protein–protein interactions (PPIs) from protein amino acids sequences.The study was published in International Journal of Molecular Science.  

Vectorization representation of amino acid sequences is a prerequisite for the application of machine learning methods to biological problems. 

Researcher first used Position-Specific Scoring Matrix to represent each protein sequence, which retains the evolutionary information about proteins. And then, They employed the Zernike Moments method to generate feature vectors that be fed into classifier for implement prediction.   

This method was performed on Yeast and Helicobacter pylori datasets with five-fold cross-validation experimentswhich attained the average prediction accuracy of 94.48% and 91.25%, respectively.

The achieved AUC on Yeast was close to 97%.The results illustrate that the method can provide accurate, stable and high coverage prediction ability and provide a useful decision-making tool for genomics research.  

Based on the above research results, the researchers realized a more accurate and stable prediction system by adding a deep learning system. The experimental results display that this modified system can improve the accuracy of 2.2% and can realize cross species detection. The study was published in Molecular Biosystems.

This work was supported by the National Natural Science Foundation of China, National Science Foundation for excellent Young Scholars of China, and One Hundred Person Project of the Chinese Academy of Sciences. 

 

The flow chart of the PCVMZM(Image by XTIPC) 

Contact: 

Prof. YOU Zhuhong 

E-mail:zhuhongyou@ms.xjb.ac.cn 

Xinjiang Technical Institute of Physics & Chemistry, CAS 


附件下载:

(86) 991-3838931
lhskj@ms.xjb.ac.cn
(86)991-3838957
40-1 Beijing Road Urumqi, XinjiangChina