Predicting PDZ Domain Mediated Protein Interactions from Structure

Shirley Hui, Xiang Xing, and Gary D. Bader

Website: http://webservice.baderlab.org/domains/POW/

Background

PDZ domains are peptide recognition domains that are involved in important biological processes and bind their targets through the recognition of simple linear motifs. The recent availability of high throughput PDZ domain peptide interaction data has prompted the development of sequence based predictors of PDZ domain peptide interactions. However, the performance of these predictors depends on how similar in sequence a given domain is to the training domains. On the other hand, domain structure features are known to play roles in determining PDZ domain binding specificity and can also be used for training. When used for proteome scanning, such a predictor may be able to predict more novel interactions and increase the coverage of PDZ domain mediated protein protein interactions that can be currently predicted.

Results

We developed a structure based predictor of PDZ domain peptide interactions. We use domain structure features for training which are known to facilitate protein folding and stability and protein interactions. We also computationally generate additional negative interactions for training and show that this reduces the number of potential false positives returned by the predictor. Through multiple cross validation strategies and a series of blind tests we show that the predictor is estimated to have improved generalization performance and can correctly predict interactions in different organisms. Through proteome scanning in human we show that the structure based predictions correspond to known PDZ domain peptide interactions and known protein protein interactions in curated databases. We also show that a large number of validated hits are only predicted by the structure-based predictor, representing a 43% increase in PDZ domain mediated PPIs that could be predicted before. A functional enrichment of our hits is used to create a map of PDZ domain biology. This map highlights PDZ domain involvement in diverse biological processes and some are only found by the structure-based predictor.

SVM Predictions

SVM predictions were validated using known interactions from PDZBase, a domain peptide interaction database and known protein-protein interactions (PPIs) from iRefIndex. iRefIndex is a PPI database which consolidates PPIs from different databases including BIND, BioGRID, CORUM, DIP, HPRD, IntAct, MINT.

The following are SVM proteome scanning structure-based and sequence-based predictions for human, fly and worm PDZ domains.

Organism

Structure-based

Sequence-based

Human

Human 218 (zip)

Human 241 (zip)

Fly

Fly 7 (zip)

Fly 6 (zip)

Worm

Worm 6 (zip)

Worm 6 (zip)

The format of the output files is:

Supplementary Information

Source Code

Team


CategoryHomepage

Data/StructurePDZProteomeScanning (last edited 2012-01-03 17:56:47 by ShirleyHui)

MoinMoin Appliance - Powered by TurnKey Linux