TY - JOUR
T1 - Identification of the ligand binding sites on the molecular surface of proteins
AU - Kinoshita, Kengo
AU - Nakamura, Haruki
PY - 2005/3
Y1 - 2005/3
N2 - Identification of protein biochemical functions based on their three-dimensional structures is now required in the post-genome-sequencing era. Ligand binding is one of the major biochemical functions of proteins, and thus the identification of ligands and their binding sites is the starting point for the function identification. Previously we reported our first trial on structure-based function prediction, based on the similarity searches of molecular surfaces against the functional site database. Here we describe the extension of our first trial by expanding the search database to whole heteroatom binding sites appearing within the Protein Data Bank (PDB) with the new analysis protocol. In addition, we have determined the similarity threshold line, by using 10 structure pairs with solved free and complex structures. Finally, we extensively applied our method to newly determined hypothetical proteins, including some without annotations, and evaluated the performance of our methods.
AB - Identification of protein biochemical functions based on their three-dimensional structures is now required in the post-genome-sequencing era. Ligand binding is one of the major biochemical functions of proteins, and thus the identification of ligands and their binding sites is the starting point for the function identification. Previously we reported our first trial on structure-based function prediction, based on the similarity searches of molecular surfaces against the functional site database. Here we describe the extension of our first trial by expanding the search database to whole heteroatom binding sites appearing within the Protein Data Bank (PDB) with the new analysis protocol. In addition, we have determined the similarity threshold line, by using 10 structure pairs with solved free and complex structures. Finally, we extensively applied our method to newly determined hypothetical proteins, including some without annotations, and evaluated the performance of our methods.
KW - Hypothetical proteins
KW - Protein three dimensional structure
KW - Structural genomics
KW - Structure-based function prediction
UR - http://www.scopus.com/inward/record.url?scp=14144254344&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=14144254344&partnerID=8YFLogxK
U2 - 10.1110/ps.041080105
DO - 10.1110/ps.041080105
M3 - Article
C2 - 15689509
AN - SCOPUS:14144254344
SN - 0961-8368
VL - 14
SP - 711
EP - 718
JO - Protein Science
JF - Protein Science
IS - 3
ER -