Lecture Notes in Computer Science, 2011, Volume 6674/2011, 320-331, DOI: 10.1007/978-3-642-21260-4_31

Query-Adaptive Ranking with Support Vector Machines for Protein Homology Prediction

Yan Fu, Rong Pan, Qiang Yang and Wen Gao

View Related Documents

Abstract

Protein homology prediction is a crucial step in template-based protein structure prediction. The functions that rank the proteins in a database according to their homologies to a query protein is the key to the success of protein structure prediction. In terms of information retrieval, such functions are called ranking functions, and are often constructed by machine learning approaches. Different from traditional machine learning problems, the feature vectors in the ranking-function learning problem are not identically and independently distributed, since they are calculated with regard to queries and may vary greatly in statistical characteristics from query to query. At present, few existing algorithms make use of the query-dependence to improve ranking performance. This paper proposes a query-adaptive ranking-function learning algorithm for protein homology prediction. Experiments with the support vector machine (SVM) used as the benchmark learner demonstrate that the proposed algorithm can significantly improve the ranking performance of SVMs in the protein homology prediction task.

Keywords  Protein homology prediction – information retrieval – ranking function – machine learning – support vector machine

This work was supported by the Research Initiation Funds for President Scholarship Winners of Chinese Academy of Sciences (CAS), the National Natural Science Foundation of China (30900262, 61003140 and 61033010), the CAS Knowledge Innovation Program (KGGX1-YW-13), and the Fundamental Research Funds for the Central Universities (09lgpy62).

Fulltext Preview

Image of the first page of the fulltext document