Overview

Motivation

Challenges

Methodologies

Achievements

Appendix

Overview

e-Protein is a BBSRC/DTI Grid pilot project entitled "A Distributed Pipeline for Structure-based Proteome Annotation using Grid Technology". The project involves seven groups at three locations -- Imperial College London (Profs Sternberg & Darlington), University College London (UCL) (Prof. Jones, Prof. Orengo and Dr Sorensen) and the European Bioinformatics Institute (EBI) (Prof. Thornton and Dr Birney).

The aim of e-Protein is to provide a structure-based annotation of all proteins in major sequenced genomes, linking computational and database resources at the three sites by Grid technology (Fig 1) to achieve this goal.

 

The major results of the project to date are:

  • The development and analysis of databases of proteome annotation at Imperial (3D-Genomics) and UCL (GTD and Gene3D).
  • The development of databases providing functional annotation of all proteins in Uniprot KnowledgeBase using structural information (EBI).
  • Adaptation of the Distributed Annotation System for use with protein annotations.
  • Provision of a web-based portal combining protein annotations from all project sites.
  • The development and demonstration of the JYDE software (UCL) for inter-site distributed computing, focusing on applications to proteome annotation.
  • The application of JYDE to annotate 32,000 unique protein sequences in the human genome within one day, using 500 computer processors at UCL and Imperial.
  • The development of the ICENI protocol (Imperial) to capture the workflow of proteome annotation pipelines and split the work between multiple Grid resources, providing the capability of true resource brokering.

Next >