Abstract
Free full text
Deciphering Protein–Protein Interactions. Part I. Experimental Techniques and Databases
Proteins interact with each other in a highly specific manner, and protein interactions play a key role in many cellular processes; in particular, the distortion of protein interfaces may lead to the development of many diseases. To understand the mechanisms of protein recognition at the molecular level and to unravel the global picture of protein interactions in the cell, different experimental techniques have been developed. Some methods characterize individual protein interactions while others are advanced for screening interactions on a genome-wide scale. In this review we describe different experimental techniques of protein interaction identification together with various databases which attempt to classify the large array of experimental data. We discuss the main promises and pitfalls of different methods and present several approaches to verify and validate the diverse experimental data produced by high-throughput techniques.
Introduction
It is now becoming clear that protein interactions determine the outcome of most cellular processes [1–4]. Therefore, identifying and characterizing protein–protein interactions and their networks is essential for understanding the mechanisms of biological processes on a molecular level. Despite the fact that protein interactions are remarkably diverse, all protein interfaces share certain common properties. Protein interactions can be classified into different types depending on their strength (permanent and transient), specificity (specific or nonspecific), the location of interacting partners within one or on two polypeptide chains, and the similarity between interacting subunits (homo- and hetero-oligomers). It has been shown that interface types are significantly different in amino acid composition so that it is possible to predict the type of interaction interface from amino acid composition alone [5]. Earlier structural analysis of interfaces showed that most interfaces consist of completely buried cores surrounded by partially accessible rims [6,7] with the overall size of about 1600 ± 400 Å2 (a “standard size” patch) [8]. It has been found that certain amino acids are preferred on protein interfaces and that the amino acid composition of the core differs considerably from the rim [6,7,9,10]. More recent models suggested that the protein binding site consists of a few independent highly packed regions, so called “hot spots,” which contribute significantly to the free energy of binding [11–13]. Hot spots were found to be structurally conserved [14], and the energetics of interactions at the hot spots have been analyzed in several studies [15–18].
In many cellular processes, proteins recognize specific targets and bind them in a highly regular manner. The specificity of interactions in these cases is determined by structural and physico–chemical properties of two interacting proteins. As a result, there should be a certain degree of conservation in the interaction patterns between similar proteins and domains. Indeed, it has been found that close homologs almost always interact in the same way and protein–protein interactions place certain evolutionary constraints on protein sequence and structural divergence [19–24]. Recent studies confirm that the total number of interaction types or modes is limited and rather small [25–27]. On the other hand, remotely related proteins/domains can have different interaction modes [21,26,28]; and the conservation of such protein interfaces is similar to the average conservation of rest of the protein [29–32].
In this review and its companion review in the April issue [33], we attempt to classify and systemize the array of experimental and theoretical data on the identification and prediction of protein interactions. In this review we focus on the generic experimental techniques for identifying protein interactions and the databases storing the information obtained from these experiments. In the second review, we present different methods to predict protein and domain interactions and discuss various challenges faced in this field with respect to limited prediction accuracy.
Experimental Methods for Identifying and Characterizing Protein Interactions
Protein interactions can be analyzed by different genetic, biochemical, and physical methods, which are listed in Table 1 and shown in Figure 1. Some techniques enable screening of a large number of proteins in a cell, such as yeast two-hybrid (Y2H), tandem affinity purification (TAP), mass spectroscopy (MS), DNA and protein microarrays, synthetic lethality, and phage display. Other methods focus on monitoring and characterizing specific biochemical and physico–chemical properties of a protein complex.
Table 1
Yeast two-hybrid method.
The development of the Y2H technique has considerably accelerated the screening of protein interactions in vivo. Y2H is based on the fact that many eukaryotic transcription activators have at least two distinct domains, one that directs binding to a promoter DNA sequence (BD) and another that activates transcription (AD) (Figure 1A). It was demonstrated that splitting BD and AD inactivates the transcription, but the transcription can be restored if a DNA-binding domain is physically (not necessarily covalently) associated with an activating domain [34]. According to the Y2H method, a protein of interest is fused to BD (bait). This chimeric protein is cloned in an expression plasmid, which is then transfected into a yeast cell. A similar procedure creates a chimeric sequence of another protein fused to AD (prey). If two proteins physically interact, the reporter gene is activated. The most broadly used Y2H systems are GAL4/LexA-based, where the GAL4 protein controls in yeast the expression of the LacZ gene encoding beta-galactosidase. Numerous variations of Y2H have been developed including systems with several reporter genes, one-hybrid and three-hybrid systems for identifying proteins interactions with DNA and RNA [35–38], systems for detecting interactions in mammalian and prokaryotic cells, and systems for screening the interactions between membrane proteins [39–43].
For screening entire genomes, the Y2H method has been advanced into two main approaches [44–46]: matrix-based and library-based.
In the matrix approach, a matrix of prey clones is created where each clone expresses a particular prey protein in one well of a plate. Then each bait strain is mated with an array of prey strains and those diploids where two chimeric proteins interact are selected based on the expression of a reporter gene and the position on a plate.
In the library approach, each bait is screened against an undefined prey library containing random cDNA fragments or open reading frames (ORFs). Diploid positives are selected based on their ability to grow on specific substrates; and interacting proteins are determined by DNA sequencing. The first two genome-wide analyses of the yeast “interactome” revealed 692 and 841 putative interactions, respectively [47,48]. The overlap between these two experimental studies was quite small; both methods shared only 141 interactions, about 20% of the interaction data [48]. Recently, Y2H has been used to identify interactions in worm [2], fly [1], and human [49,50].
The small overlap between Y2H experiments can be explained by different factors, among them: differences in protein interaction sampling, Y2H bias towards nonspecific interactions [51], and limitations of the Y2H method itself. For example, proteins initiating transcription by themselves cannot be targeted in Y2H experiments; and the use of sequence chimeras can impose difficulties since fusion can change the structure of a target protein. In addition, protein folding and posttranslational modifications can differ between yeast and other organisms. This makes it difficult to screen proteins from mammalian and prokaryotic cells using Y2H as well as cytoplasmic and membrane proteins. To validate the quality of Y2H protein interactions in vivo, different in vitro techniques can be used.
Mass spectroscopy.
MS is a powerful method of studying macromolecular interactions in vitro. The principle of the MS method is to produce ions which can be detected based on their mass-to-charge ratios, thereby allowing the identification of polypeptide sequences [36,52,53] (Figure 1B). The problem of converting protein/peptide molecules from the condensed phase into ions in the gas phase is solved by using Electrospray Ionization (ESI) [54] and Matrix Assisted Laser Desorption Ionization (MALDI) [55,56]. Different algorithms have been developed to analyze mass spectra and to identify proteins by their sequence [57–60]. Some of them find correlations between theoretical and experimental spectra while others use de novo algorithms to infer peptide sequences from theoretical interpretation of the mass spectra. Despite the usefulness of MS for the characterization of interacting proteins, purification of protein complexes turns out to be the limiting step of their identification. To address this, TAP has been developed.
TAP method of complex purification.
A TAP tag consists of two IgG binding domains of Staphylococcus protein A and a calmodulin binding peptide separated by the tobacco etch virus protease cleavage site [61,62] (Figure 1C). A target protein open reading frame (ORF) is fused with the DNA sequences encoding the TAP tag and is expressed in yeast where it can form native complexes with other proteins. At the first step of the TAP purification, protein A binds tightly to an IgG matrix; and after washing out the contaminants, the protease cleaves the link between protein A and IgG matrix. The eluate of this first step is then incubated with calmodulin-coated beads in the presence of calcium. After washing, the target protein complex is released. The components of each complex are screened by polyacrylamide gel electrophoresis, cleaved by proteases, and the fragments are identified by MS. Comparing Y2H and TAP–MS, it should be noted that both methods generate a lot of false positives and miss a lot of known interactions. Y2H has the advantages of being an in vivo technique and of detecting transient interactions. In contrast, TAP–MS can report on higher-order interactions beyond binary and, therefore, provides direct information on protein complexes.
Several large-scale studies of protein complexes have been performed using TAP–MS and Y2H methods [2,4,63,64]. For example, Krogan et al. showed that 7,123 protein interactions identified with high confidence in yeast can be clustered into 547 protein complexes [3].
Gene co-expression.
Since the function of a protein complex depends on the functionality of all subunits, subunits should be present in stoichiometric amounts and gene expression levels of subunits in a complex should be related. Gene expression profiles can be provided, for example, from cell cycle experiments and expression levels of a gene under different conditions. Expression profile similarity can be calculated as a correlation coefficient between relative expression levels of two genes/proteins or the normalized difference between their absolute expression levels or calculated using other methods [65–69] (Figure 1D). The distribution of these quantities for target proteins then can be compared with the distributions for random noninteracting protein pairs. It was shown that the most obvious coexpression comes from permanent complexes such as ribosome and proteasome [65]. Several studies have tackled the problem of gene co-expression and demonstrated that interacting proteins in yeast are more likely to have their genes coexpressed compared with noninteracting proteins [65,70–77]. Moreover, it was shown that expression levels of physically interacting proteins coevolve, and coevolution of gene expression can be a better predictor of protein interactions than coevolution of amino acid sequences [78]. To infer the interactions between the genes, the DNA microarray methodology can be successfully used in the conjunction with the synthetic lethality method.
Synthetic lethality method.
It is not very well-understood how genetic variation influences phenotype and how genes interact with each other producing different phenotypes in different strains of the same species [77,78]. These problems can be addressed by using various genetic interaction methods, the most common of which is the synthetic lethality method (Figure 1F). The synthetic lethality method produces mutations or deletions of two separate genes which are viable alone but cause lethality when combined together in a cell under certain conditions [78–83]. Since these mutations are lethal, they cannot be isolated directly and should be synthetically constructed. Synthetic interaction can point to the possible physical interaction between two gene products, their participation in a single pathway, or a similar function. For example, synthetic lethality experiments enabled the prediction of the unknown function of the YLL049W gene as belonging to the dynein–dynactin pathway, and the bridging together of the two pathways of the parallel mitotic exit network and the Cdc14 early anaphase release pathway [83].
Monitoring specific protein interactions.
The most detailed information about protein interaction interfaces at the atomic level can be provided by X-ray crystallography and NMR spectroscopy, but the number of solved protein complexes remains low [84]. At the same time, the real-time characterization of interacting proteins in vivo can be achieved with various spectroscopic techniques requiring the attachment of a spectroscopic label to a target protein [87,88] (Table 1). A powerful technique in this respect is fluorescence resonance energy transfer (FRET), which can occur only if two fluorophores are located close to each other [89]. Another effective method, surface plasmon resonance (SPR), does not require spectroscopic labeling and can detect interactions between soluble ligands and immobilized receptors [90,91]; while the isothermal titration calorimetry (ITC) technique allows for direct measurement of the enthalpy of binding [92]. Recently, new methods have been developed to analyze protein interactions at the single-molecule level. For example, atomic force microscopy can fairly accurately measure interaction forces ([93]) while fluorescence techniques can characterize conformational changes in proteins upon binding [94].
Protein interaction networks derived from experiments.
The fast development of experimental techniques for protein interactions has enabled the construction and systematic analysis of interaction networks [1,2,95]. Interaction maps obtained for one species can be used to predict interaction networks in other species, to identify functions of unknown proteins, and to get insight into the evolution of protein interaction patterns. The interaction map analyses and comparisons are based on the observation that many interactions are conserved among species (“interologs”) [46]. Sequence-based searches for “interologs” were able to identify 16%–31% of true “interologs” (tested using Y2H system) even between remotely related species such as yeast and worm [96]. Analysis of conservation in the networks produced by gene co-expression data revealed that interologs correspond to the functionally related genes responsible for core biological processes [77]. Moreover, a multiple-species network has been constructed by identifying pairs of genes with correlated expression in different organisms. A multiple-species network has shown to perform better than a single-species network in linking together functionally related genes.
Verification of protein interactions.
Validation of protein interaction data is difficult; except for small datasets on protein interactions provided by the Protein Data Bank (PDB) [84] and the Munich Information Center for Protein Sequences (MIPS) [97], there is no comprehensive gold standard interaction set. Several methods have been proposed for verification of protein interaction data [66,67,76,98,99], and some of them are described here.
Expression profile reliability method (EPR) [66] is based on the observation that interacting proteins are coexpressed. Two distributions of expression distances are defined for noninteracting and reliably interacting proteins. The distribution of expression distances for a protein set of interest is assumed to be a linear combination of two predefined distributions with the linear coefficient that characterizes the accuracy of a given dataset.
Paralogous verification method (PVM) [66] is based on the observation that if two proteins interact, their paralogs most likely interact. It gives more reliability to the interaction of two families that contain a greater number of interactions between paralogous proteins. This method identified ~40% true interactions at a 1% error rate.
Protein localization method (PLM) [98] defines true positives as interacting proteins that are localized in the same cellular compartment and/or interacting proteins that are annotated to have a common cellular role. PLM showed that the accuracy of experimental data strongly depends on the method with up to 50% true positives detected in Y2H experiments and up to 100% true positives detected in immunoprecipitation experiments [100].
Protein and domain interaction databases.
A large variety of databases exists to study binary protein interactions and the higher order interactions in protein complexes. A summary of some available databases is given in Tables 2 and and3.3. Different databases contain interactions obtained by direct submission from experimentalists and by mining literature and other data sources; in some cases the data is verified using automated algorithms or manual curation. In addition to direct detection of physical protein interactions, indirect methods can be used to predict the functional association between proteins or to predict the location of the interaction interface itself. There is indeed a wide range of detail characterizing the interactions available from different databases. For example, Y2H data gives the identity of interacting proteins, electron microscopy provides relative positional information of interacting proteins, and crystallography provides full atomic detail of interaction surfaces. In addition, interacting proteins can be studied either as complete units or by domains used as the units of interaction. Consequently, in this review we group all databases into protein and domain-related databases.
Table 2
Table 3
In spite of the interaction data diversity, there exist considerable overlaps in the datasets contained in the databases, making it difficult to recommend a single resource for a particular type of information. In one effort to deal with this redundancy, the International Molecular Exchange Consortium (IMEx) has been formed in which databases agree to share their data in a consistent and timely fashion (Table 2). In addition, a standard data model has been proposed for the representation and exchange of protein interaction data [101]. A few example databases from Table 2 will now be highlighted to illustrate different types of interaction data available.
Protein Interaction Databases
Database of Interacting Proteins.
The Database of Interacting Proteins (DIP) contains experimentally determined protein interactions and includes a core subset of interactions that have passed a quality assessment [102]. Interaction data are obtained from the literature; PDB; and high-throughput methods such as Y2H, DNA and protein microarrays; and TAP–MS analysis of protein complexes. Several methods are employed to assess the quality of interaction data and are offered as a service for query interactions. DIP has links to a couple of related databases including LiveDIP, which records information about the state of a biological interaction, such as covalently modified, conformational, or cellular location states [103]. Another database related to DIP is Prolinks, which brings together four methods of linking proteins: phylogenetic profiles, Rosetta Stone, gene neighbors, and gene clusters[104]. The database includes a Proteome Navigator tool to browse the linkages and view accompanying data.
Biomolecular Interaction Network Database.
The Biomolecular Interaction Network Database (BIND) includes high-throughput experimental datasets and protein complexes from PDB [105,106]. It contains a variety of curated experimental data. A generalized data specification handles not only various types of protein interaction data, but also protein–small molecule interactions and protein–nucleic acid interactions. An interaction viewer is provided to browse the interaction space. BIND also can distinguish different functional types of interactions.
Munich MPact/MIPS database.
MPact is a resource to access MIPS, which contains a manually curated yeast protein interaction dataset [97] collected by curators from the literature. The resource also includes high-throughput results for yeast, but keeps this data separate. MIPS is often used as a standard of truth database for evaluating the quality of data and the accuracy of interaction prediction methods.
Domain Interaction Databases
PIBASE database.
PIBASE is a database of domain interactions from the protein structure data [107]. It uses SCOP and CATH domain definitions to find putative domain interactions. Several methods are employed to remove redundancy in structural data; for example, structural comparisons of interfaces are made between domains within one structure. The database combines physicochemical properties of protein binding sites and has a link to MODBASE [108], containing models of three-dimensional structures that allow use of PIBASE for modeling of putative domain interfaces.
3did database.
3did allows one to explore the details of domain interactions from protein structure data (yeast interactions are also included) [109]. For each domain, an overview is given of all its interactions with other domains, showing different interaction types. In some cases, dot plots of structural comparisons between interaction interfaces show the variance of the interactions between pairs of domain families. Database entries are also supplied with the GO-based functional annotations. InterPreTS is a Web-based service associated with 3did that predicts domain interactions based on sequence homology of query proteins to a database of interacting domains (DBID) [21].
Conserved Binding Mode database.
The Conserved Binding Mode (CBM) database is a collection of domain interactions from the structure data where domains are defined by the Conserved Domain Database [110]. Unlike other structure-based databases, domain interactions are grouped by geometry into conserved interaction modes for each pair of domain families across all PDB structures [26]. Structural superpositions are used to infer CBMs from different members of interacting domain families docking in the same way. Such domain interactions with recurring structural themes have greater significance to be biologically relevant, unlike spurious crystal packing interactions. CBMs can also assist in analyzing protein interaction network topology by emphasizing connections made in a biological context. Finally, the CBM database can be used to categorize the specific interaction surfaces that have evolved from conserved domains and thereby allows for the homology modeling of protein interaction interfaces. A similar approach for grouping interaction patterns for SCOP domains was recently undertaken with the SCOPPI database [111].
Domain Interaction Map database.
Domain Interaction Map (DIMA) database is a domain interaction map derived from phylogenetic profiling Pfam domains [97]. Instead of looking at entire protein sequences, the algorithm compares the occurrences of domains across genomes and associates similar patterns of occurrences with functional associations. The method works well for domains with moderate information content that have distinct phylogenetic profiles.
In this paper we have reviewed a wide spectrum of experimental techniques for identifying and characterizing protein interactions; each technique can provide a piece in the puzzle of mechanisms of protein recognition [112]. Despite enormous efforts in this field, the overall picture is still incomplete, which is not surprising given the enormous complexity of a cell. Indeed, proteins can behave differently in different parts of the cell, and many proteins form transient complexes that are difficult to identify. Moreover, evolutionarily conserved proteins have much better coverage in experiments than the proteins restricted to a certain organism. The low coverage together with the small overlap between different experimental methods calls for the development of theoretical approaches for interaction data verification and prediction, the topic we address in our companion review [33].
Acknowledgments
The authors thank Lewis Geer for helpful discussions and Robert Yates for graphic design of the figures. This work was supported by the Intramural Research Program of the National Library of Medicine at the National Institutes of Health of the US Department of Health and Human Services.
Abbreviations
AD | domain that activates transcription |
BD | domain that directs binding to a promoter DNA sequence |
BIND | Biomolecular Interaction Network Database |
CBM | Conserved Binding Mode database |
DIP | database of interacting proteins |
MS | mass spectroscopy |
TAP | tandem affinity purification |
Y2H | yeast two-hybrid |
Footnotes
Benjamin A. Shoemaker and Anna R. Panchenko are with the Computational Biology Branch of the National Center for Biotechnology Information in Bethesda, Maryland, United States of America.
Competing interests. The authors have declared that no competing interests exist.
Funding. The authors received no specific funding for this article.
Author contributions. BAS and ARP analyzed the data and wrote the paper.
References
- Giot L, Bader JS, Brouwer C, Chaudhuri A, Kuang B, et al. A protein interaction map of Drosophila melanogaster . Science. 2003;302:1727–1736. [Abstract] [Google Scholar]
- Li S, Armstrong CM, Bertin N, Ge H, Milstein S, et al. A map of the interactome network of the metazoan C. elegans . Science. 2004;303:540–543. [Europe PMC free article] [Abstract] [Google Scholar]
- Krogan NJ, Cagney G, Yu H, Zhong G, Guo X, et al. Global landscape of protein complexes in the yeast Saccharomyces cerevisiae . Nature. 2006;440:637–643. [Abstract] [Google Scholar]
- Gavin AC, Aloy P, Grandi P, Krause R, Boesche M, et al. Proteome survey reveals modularity of the yeast cell machinery. Nature. 2006;440:631–636. [Abstract] [Google Scholar]
- Ofran Y, Rost B. Analysing six types of protein–protein interfaces. J Mol Biol. 2003;325:377–387. [Abstract] [Google Scholar]
- Bogan AA, Thorn KS. Anatomy of hot spots in protein interfaces. J Mol Biol. 1998;280:1–9. [Abstract] [Google Scholar]
- Chakrabarti P, Janin J. Dissecting protein–protein recognition sites. Proteins. 2002;47:334–343. [Abstract] [Google Scholar]
- Lo Conte L, Chothia C, Janin J. The atomic structure of protein–protein recognition sites. J Mol Biol. 1999;285:2177–2198. [Abstract] [Google Scholar]
- Jones S, Thornton JM. Analysis of protein–protein interaction sites using surface patches. J Mol Biol. 1997;272:121–132. [Abstract] [Google Scholar]
- Guharoy M, Chakrabarti P. Conservation and relative importance of residues across protein–protein interfaces. Proc Natl Acad Sci U S A. 2005;102:15447–15452. [Europe PMC free article] [Abstract] [Google Scholar]
- DeLano WL. Unraveling hot spots in binding interfaces: Progress and challenges. Curr Opin Struct Biol. 2002;12:14–20. [Abstract] [Google Scholar]
- Keskin O, Ma B, Rogale K, Gunasekaran K, Nussinov R. Protein–protein interactions: Organization, cooperativity and mapping in a bottom-up Systems Biology approach. Phys Biol. 2005;2:S24–S35. [Abstract] [Google Scholar]
- Res I, Lichtarge O. Character and evolution of protein–protein interfaces. Phys Biol. 2005;2:S36–S43. [Abstract] [Google Scholar]
- Ma B, Elkayam T, Wolfson H, Nussinov R. Protein–protein interactions: Structurally conserved residues distinguish between binding sites and exposed protein surfaces. Proc Natl Acad Sci U S A. 2003;100:5772–5777. [Europe PMC free article] [Abstract] [Google Scholar]
- Sheinerman FB, Honig B. On the role of electrostatic interactions in the design of protein–protein interfaces. J Mol Biol. 2002;318:161–177. [Abstract] [Google Scholar]
- Kortemme T, Baker D. A simple physical model for binding energy hot spots in protein–protein complexes. Proc Natl Acad Sci U S A. 2002;99:14116–14121. [Europe PMC free article] [Abstract] [Google Scholar]
- Fernandez A, Scheraga HA. Insufficiently dehydrated hydrogen bonds as determinants of protein interactions. Proc Natl Acad Sci U S A. 2003;100:113–118. [Europe PMC free article] [Abstract] [Google Scholar]
- Kundrotas PJ, Alexov E. Electrostatic properties of protein–protein complexes. Biophys J. 2006;91:1724–1736. [Europe PMC free article] [Abstract] [Google Scholar]
- Valdar WS, Thornton JM. Protein–protein interfaces: Analysis of amino acid conservation in homodimers. Proteins. 2001;42:108–124. [Abstract] [Google Scholar]
- Nooren IM, Thornton JM. Structural characterisation and functional significance of transient protein–protein interactions. J Mol Biol. 2003;325:991–1018. [Abstract] [Google Scholar]
- Aloy P, Ceulemans H, Stark A, Russell RB. The relationship between sequence and interaction divergence in proteins. J Mol Biol. 2003;332:989–998. [Abstract] [Google Scholar]
- Littler SJ, Hubbard SJ. Conservation of orientation and sequence in protein domain–domain interactions. J Mol Biol. 2005;345:1265–1279. [Abstract] [Google Scholar]
- Teichmann SA. The constraints protein–protein interactions place on sequence divergence. J Mol Biol. 2002;324:399–407. [Abstract] [Google Scholar]
- Panchenko AR, Wolf YI, Panchenko LA, Madej T. Evolutionary plasticity of protein families: Coupling between sequence and structure variation. Proteins. 2005;61:535–544. [Europe PMC free article] [Abstract] [Google Scholar]
- Aloy P, Russell RB. Ten thousand interactions for the molecular biologist. Nat Biotechnol. 2004;22:1317–1321. [Abstract] [Google Scholar]
- Shoemaker BA, Panchenko AR, Bryant SH. Finding biologically relevant protein domain interactions: Conserved binding mode analysis. Protein Sci. 2006;15:352–361. [Europe PMC free article] [Abstract] [Google Scholar]
- Kim WK, Henschel A, Winter C, Schroeder M. The many faces of protein–protein interactions: A compendium of interface geometry. PLoS Comput Biol. 2006;2:e124. [Europe PMC free article] [Abstract] [Google Scholar]
- Keskin O, Tsai CJ, Wolfson H, Nussinov R. A new, structurally nonredundant, diverse data set of protein–protein interfaces and its implications. Protein Sci. 2004;13:1043–1055. [Europe PMC free article] [Abstract] [Google Scholar]
- Grishin NV, Phillips MA. The subunit interfaces of oligomeric enzymes are conserved to a similar extent to the overall protein sequences. Protein Sci. 1994;3:2455–2458. [Europe PMC free article] [Abstract] [Google Scholar]
- Caffrey DR, Somaroo S, Hughes JD, Mintseris J, Huang ES. Are protein–protein interfaces more conserved in sequence than the rest of the protein surface? Protein Sci. 2004;13:190–202. [Europe PMC free article] [Abstract] [Google Scholar]
- Korkin D, Davis FP, Sali A. Localization of protein-binding sites within families of proteins. Protein Sci. 2005;14:2350–2360. [Europe PMC free article] [Abstract] [Google Scholar]
- Panchenko AR, Kondrashov F, Bryant S. Prediction of functional sites by analysis of sequence and structure conservation. Protein Sci. 2004;13:884–892. [Europe PMC free article] [Abstract] [Google Scholar]
- Shoemaker BA, Panchenko AR. Deciphering protein–protein interactions. Part II. Computational methods to predict protein and domain interaction partners. PLoS Comp Biol. 2007;3:e43 [Europe PMC free article] [Abstract] [Google Scholar]
- Fields S, Song O. A novel genetic system to detect protein–protein interactions. Nature. 1989;340:245–246. [Abstract] [Google Scholar]
- Fashena SJ, Serebriiskii I, Golemis EA. The continued evolution of two-hybrid screening approaches in yeast: How to outwit different preys with different baits. Gene. 2000;250:1–14. [Abstract] [Google Scholar]
- Causier B. Studying the interactome with the yeast two-hybrid system and mass spectrometry. Mass Spectrom Rev. 2004;23:350–367. [Abstract] [Google Scholar]
- Auerbach D, Thaminy S, Hottiger MO, Stagljar I. The post-genomic era of interactive proteomics: Facts and perspectives. Proteomics. 2002;2:611–623. [Abstract] [Google Scholar]
- Van Criekinge W, Beyaert R. Yeast two-hybrid: State of the art. Biol Proced Online. 1999;2:1V38. [Europe PMC free article] [Abstract] [Google Scholar]
- Toby GG, Golemis EA. Using the yeast interaction trap and other two-hybrid–based approaches to study protein–protein interactions. Methods. 2001;24:201–217. [Abstract] [Google Scholar]
- Lee JW, Lee SK. Mammalian two-hybrid assay for detecting protein–protein interactions in vivo. Methods Mol Biol. 2004;261:327–336. [Abstract] [Google Scholar]
- Walhout AJ, Vidal M. High-throughput yeast two-hybrid assays for large-scale protein interaction mapping. Methods. 2001;24:297–306. [Abstract] [Google Scholar]
- Aronheim A, Zandi E, Hennemann H, Elledge SJ, Karin M. Isolation of an AP-1 repressor by a novel method for detecting protein–protein interactions. Mol Cell Biol. 1997;17:3094–3102. [Europe PMC free article] [Abstract] [Google Scholar]
- Mohler WA, Blau HM. Gene expression and cell fusion analyzed by lacZ complementation in mammalian cells. Proc Natl Acad Sci U S A. 1996;93:12423–12427. [Europe PMC free article] [Abstract] [Google Scholar]
- Bartel PL, Roecklein JA, SenGupta D, Fields S. A protein linkage map of Escherichia coli bacteriophage T7. Nat Genet. 1996;12:72–77. [Abstract] [Google Scholar]
- Finley RL, Jr, Brent R. Interaction mating reveals binary and ternary connections between Drosophila cell cycle regulators. Proc Natl Acad Sci U S A. 1994;91:12980–12984. [Europe PMC free article] [Abstract] [Google Scholar]
- Walhout AJ, Sordella R, Lu X, Hartley JL, Temple GF, et al. Protein interaction mapping in C. elegans using proteins involved in vulval development. Science. 2000;287:116–122. [Abstract] [Google Scholar]
- Uetz P, Giot L, Cagney G, Mansfield TA, Judson RS, et al. A comprehensive analysis of protein–protein interactions in Saccharomyces cerevisiae . Nature. 2000;403:623–627. [Abstract] [Google Scholar]
- Ito T, Chiba T, Ozawa R, Yoshida M, Hattori M, et al. A comprehensive two-hybrid analysis to explore the yeast protein interactome. Proc Natl Acad Sci U S A. 2001;98:4569–4574. [Europe PMC free article] [Abstract] [Google Scholar]
- Ghavidel A, Cagney G, Emili A. A skeleton of the human protein interactome. Cell. 2005;122:830–832. [Abstract] [Google Scholar]
- Rual JF, Venkatesan K, Hao T, Hirozane-Kishikawa T, Dricot A, et al. Towards a proteome-scale map of the human protein–protein interaction network. Nature. 2005;437:1173–1178. [Abstract] [Google Scholar]
- Deeds EJ, Ashenberg O, Shakhnovich EI. A simple physical model for scaling in protein–protein interaction networks. Proc Natl Acad Sci U S A. 2006;103:311–316. [Europe PMC free article] [Abstract] [Google Scholar]
- Di Tullio A, Reale S, De Angelis F. Molecular recognition by mass spectrometry. J Mass Spectrom. 2005;40:845–865. [Abstract] [Google Scholar]
- Aebersold R, Mann M. Mass spectrometry-based proteomics. Nature. 2003;422:198–207. [Abstract] [Google Scholar]
- Whitehouse CM, Dreyer RN, Yamashita M, Fenn JB. Electrospray interface for liquid chromatographs and mass spectrometers. Anal Chem. 1985;57:675–679. [Abstract] [Google Scholar]
- Pieles U, Zurcher W, Schar M, Moser HE. Matrix-assisted laser desorption ionization time-of-flight mass spectrometry: A powerful tool for the mass and sequence analysis of natural and modified oligonucleotides. Nucleic Acids Res. 1993;21:3191–3196. [Europe PMC free article] [Abstract] [Google Scholar]
- Karas M, Hillenkamp F. Laser desorption ionization of proteins with molecular masses exceeding 10,000 daltons. Anal Chem. 1988;60:2299–2301. [Abstract] [Google Scholar]
- Yates JR, III, Eng JK, McCormack AL, Schieltz D. Method to correlate tandem mass spectra of modified peptides to amino acid sequences in the protein database. Anal Chem. 1995;67:1426–1436. [Abstract] [Google Scholar]
- Taylor JA, Johnson RS. Sequence database searches via de novo peptide sequencing by tandem mass spectrometry. Rapid Commun Mass Spectrom. 1997;11:1067–1075. [Abstract] [Google Scholar]
- Pevzner PA, Dancik V, Tang CL. Mutation-tolerant protein identification by mass spectrometry. J Comput Biol. 2000;7:777–787. [Abstract] [Google Scholar]
- Geer LY, Markey SP, Kowalak JA, Wagner L, Xu M, et al. Open mass spectrometry search algorithm. J Proteome Res. 2004;3:958–964. [Abstract] [Google Scholar]
- Rigaut G, Shevchenko A, Rutz B, Wilm M, Mann M, et al. A generic protein purification method for protein complex characterization and proteome exploration. Nat Biotechnol. 1999;17:1030–1032. [Abstract] [Google Scholar]
- Puig O, Caspary F, Rigaut G, Rutz B, Bouveret E, et al. The tandem affinity purification (TAP) method: A general procedure of protein complex purification. Methods. 2001;24:218–229. [Abstract] [Google Scholar]
- Gavin AC, Bosche M, Krause R, Grandi P, Marzioch M, et al. Functional organization of the yeast proteome by systematic analysis of protein complexes. Nature. 2002;415:141–147. [Abstract] [Google Scholar]
- Ho Y, Gruhler A, Heilbut A, Bader GD, Moore L, et al. Systematic identification of protein complexes in Saccharomyces cerevisiae by mass spectrometry. Nature. 2002;415:180–183. [Abstract] [Google Scholar]
- Jansen R, Greenbaum D, Gerstein M. Relating whole-genome expression data with protein–protein interactions. Genome Res. 2002;12:37–46. [Europe PMC free article] [Abstract] [Google Scholar]
- Deane CM, Salwinski L, Xenarios I, Eisenberg D. Protein interactions: Two methods for assessment of the reliability of high throughput observations. Mol Cell Proteomics. 2002;1:349–356. [Abstract] [Google Scholar]
- Kemmeren P, van Berkum NL, Vilo J, Bijma T, Donders R, et al. Protein interaction verification and functional annotation by integrated analysis of genome-scale data. Mol Cell. 2002;9:1133–1143. [Abstract] [Google Scholar]
- Kim SK, Lund J, Kiraly M, Duke K, Jiang M, et al. A gene expression map for Caenorhabditis elegans . Science. 2001;293:2087–2092. [Abstract] [Google Scholar]
- Troyanskaya OG, Garber ME, Brown PO, Botstein D, Altman RB. Nonparametric methods for identifying differentially expressed genes in microarray data. Bioinformatics. 2002;18:1454–1461. [Abstract] [Google Scholar]
- Troyanskaya OG. Putting microarrays in a context: Integrated analysis of diverse biological data. Brief Bioinform. 2005;6:34–43. [Abstract] [Google Scholar]
- Bhardwaj N, Lu H. Correlation between gene expression profiles and protein–protein interactions within and across genomes. Bioinformatics. 2005;21:2730–2738. [Abstract] [Google Scholar]
- Tornow S, Mewes HW. Functional modules by relating protein interaction networks and gene expression. Nucleic Acids Res. 2003;31:6283–6289. [Europe PMC free article] [Abstract] [Google Scholar]
- Teichmann SA, Babu MM. Conservation of gene co-regulation in prokaryotes and eukaryotes. Trends Biotechnol. 2002;20:407–410. Discussion on p. 410. [Abstract] [Google Scholar]
- Ge H, Liu Z, Church GM, Vidal M. Correlation between transcriptome and interactome mapping data from Saccharomyces cerevisiae . Nat Genet. 2001;29:482–486. [Abstract] [Google Scholar]
- Grigoriev A. A relationship between gene expression and protein interactions on the proteome scale: Analysis of the bacteriophage T7 and the yeast Saccharomyces cerevisiae . Nucleic Acids Res. 2001;29:3513–3519. [Europe PMC free article] [Abstract] [Google Scholar]
- Mrowka R, Patzak A, Herzel H. Is there a bias in proteome research? Genome Res. 2001;11:1971–1973. [Abstract] [Google Scholar]
- Stuart JM, Segal E, Koller D, Kim SK. A gene-coexpression network for global discovery of conserved genetic modules. Science. 2003;302:249–255. [Abstract] [Google Scholar]
- Fraser HB, Hirsh AE, Wall DP, Eisen MB. Coevolution of gene expression among interacting proteins. Proc Natl Acad Sci U S A. 2004;101:9033–9038. [Europe PMC free article] [Abstract] [Google Scholar]
- Rutherford SL. From genotype to phenotype: Buffering mechanisms and the storage of genetic information. Bioessays. 2000;22:1095–1105. [Abstract] [Google Scholar]
- Hartman J, Garvik B, Hartwell L. Principles for the buffering of genetic variation. Science. 2001;291:1001–1004. [Abstract] [Google Scholar]
- Bender A, Pringle JR. Use of a screen for synthetic lethal and multicopy suppressee mutants to identify two new genes involved in morphogenesis in Saccharomyces cerevisiae . Mol Cell Biol. 1991;11:1295–1305. [Europe PMC free article] [Abstract] [Google Scholar]
- Ooi SL, Pan X, Peyser BD, Ye P, Meluh PB, et al. Global synthetic-lethality analysis and yeast functional profiling. Trends Genet. 2006;22:56–63. [Abstract] [Google Scholar]
- Brown JA, Sherlock G, Myers CL, Burrows NM, Deng C, et al. Global analysis of gene function in yeast by quantitative phenotypic profiling. Mol Syst Biol. 2006;2:0001. [Europe PMC free article] [Abstract] [Google Scholar]
- Tong AH, Evangelista M, Parsons AB, Xu H, Bader GD, et al. Systematic genetic analysis with ordered arrays of yeast deletion mutants. Science. 2001;294:2364–2368. [Abstract] [Google Scholar]
- Ye P, Peyser BD, Pan X, Boeke JD, Spencer FA, et al. Gene function prediction from congruent synthetic lethal interactions in yeast. Mol Syst Biol. 2005;1:0026. [Europe PMC free article] [Abstract] [Google Scholar]
- Berman HM, Bhat TN, Bourne PE, Feng Z, Gilliland G, et al. The Protein Data Bank and the challenge of structural genomics. Nat Struct Biol. 2000;7(Supplement):957–959. [Abstract] [Google Scholar]
- Lippincott-Schwartz J, Patterson GH. Development and use of fluorescent protein markers in living cells. Science. 2003;300:87–91. [Abstract] [Google Scholar]
- Piehler J. New methodologies for measuring protein interactions in vivo and in vitro. Curr Opin Struct Biol. 2005;15:4–14. [Abstract] [Google Scholar]
- Yan Y, Marriott G. Analysis of protein interactions using fluorescence technologies. Curr Opin Chem Biol. 2003;7:635–640. [Abstract] [Google Scholar]
- Karlsson R. SPR for molecular interaction analysis: A review of emerging application areas. J Mol Recognit. 2004;17:151–161. [Abstract] [Google Scholar]
- Cooper MA. Label-free screening of bio-molecular interactions. Anal Bioanal Chem. 2003;377:834–842. [Abstract] [Google Scholar]
- Velazquez Campoy A, Freire E. ITC in the post-genomic era...? Priceless. Biophys Chem. 2005;115:115–124. [Abstract] [Google Scholar]
- Yang Y, Wang H, Erie DA. Quantitative characterization of biomolecular assemblies and interactions using atomic force microscopy. Methods. 2003;29:175–187. [Abstract] [Google Scholar]
- Margittai M, Widengren J, Schweinberger E, Schroder GF, Felekyan S, et al. Single-molecule fluorescence resonance energy transfer reveals a dynamic equilibrium between closed and open conformations of syntaxin 1. Proc Natl Acad Sci U S A. 2003;100:15516–15521. [Europe PMC free article] [Abstract] [Google Scholar]
- Butland G, Peregrin-Alvarez JM, Li J, Yang W, Yang X, et al. Interaction network containing conserved and essential protein complexes in Escherichia coli . Nature. 2005;433:531–537. [Abstract] [Google Scholar]
- Matthews LR, Vaglio P, Reboul J, Ge H, Davis BP, et al. Identification of potential interaction networks using sequence-based searches for conserved protein–protein interactions or “interologs.” Genome Res. 2001;11:2120–2126. [Europe PMC free article] [Abstract] [Google Scholar]
- Guldener U, Munsterkotter M, Oesterheld M, Pagel P, Ruepp A, et al. MPact: The MIPS protein interaction resource on yeast. Nucleic Acids Res. 2006;34:D436–D441. [Europe PMC free article] [Abstract] [Google Scholar]
- Sprinzak E, Sattath S, Margalit H. How reliable are experimental protein–protein interaction data? J Mol Biol. 2003;327:919–923. [Abstract] [Google Scholar]
- Bader JS, Chaudhuri A, Rothberg JM, Chant J. Gaining confidence in high-throughput protein interaction networks. Nat Biotechnol. 2004;22:78–85. [Abstract] [Google Scholar]
- Masters SC. Co-immunoprecipitation from transfected cells. Methods Mol Biol. 2004;261:337–350. [Abstract] [Google Scholar]
- Hermjakob H, Montecchi-Palazzi L, Bader G, Wojcik J, Salwinski L, et al. The HUPO PSI's molecular interaction format—A community standard for the representation of protein interaction data. Nat Biotechnol. 2004;22:177–183. [Abstract] [Google Scholar]
- Salwinski L, Miller CS, Smith AJ, Pettit FK, Bowie JU, et al. The Database of Interacting Proteins: 2004 update. Nucleic Acids Res. 2004;32:D449–D451. [Europe PMC free article] [Abstract] [Google Scholar]
- Duan XJ, Xenarios I, Eisenberg D. Describing biological protein interactions in terms of protein states and state transitions: The LiveDIP database. Mol Cell Proteomics. 2002;1:104–116. [Abstract] [Google Scholar]
- Bowers PM, Pellegrini M, Thompson MJ, Fierro J, Yeates TO, et al. Prolinks: A database of protein functional linkages derived from coevolution. Genome Biol. 2004;5:R35. [Europe PMC free article] [Abstract] [Google Scholar]
- Alfarano C, Andrade CE, Anthony K, Bahroos N, Bajec M, et al. The Biomolecular Interaction Network Database and related tools 2005 update. Nucleic Acids Res. 2005;33:D418–D424. [Europe PMC free article] [Abstract] [Google Scholar]
- Bader GD, Hogue CW. BIND—A data specification for storing and describing biomolecular interactions, molecular complexes and pathways. Bioinformatics. 2000;16:465–477. [Abstract] [Google Scholar]
- Davis FP, Sali A. PIBASE: A comprehensive database of structurally defined protein interfaces. Bioinformatics. 2005;21:1901–1907. [Abstract] [Google Scholar]
- Pieper U, Eswar N, Davis FP, Braberg H, Madhusudhan MS, et al. MODBASE: A database of annotated comparative protein structure models and associated resources. Nucleic Acids Res. 2006;34:D291–D295. [Europe PMC free article] [Abstract] [Google Scholar]
- Stein A, Russell RB, Aloy P. 3did: Interacting protein domains of known three-dimensional structure. Nucleic Acids Res. 2005;33:D413–D417. [Europe PMC free article] [Abstract] [Google Scholar]
- Marchler-Bauer A, Panchenko AR, Shoemaker BA, Thiessen PA, Geer LY, et al. CDD: A database of conserved domain alignments with links to domain three-dimensional structure. Nucleic Acids Res. 2002;30:281–283. [Europe PMC free article] [Abstract] [Google Scholar]
- Winter C, Henschel A, Kim WK, Schroeder M. SCOPPI: A structural classification of protein–protein interfaces. Nucleic Acids Res. 2006;34:D310–D314. [Europe PMC free article] [Abstract] [Google Scholar]
- Panchenko AR, Shoemaker BA. ISMB tutorial 2006: Protein–protein interactions: Structure and systems approaches to analyze diverse genomic data. 2006. Available: http://www.ncbi.nlm.nih.gov/CBBresearch/Panchenko/ismb_tutorial2006.ppt. Accessed 16 February 2007.
- Eisen MB, Spellman PT, Brown PO, Botstein D. Cluster analysis and display of genome-wide expression patterns. Proc Natl Acad Sci U S A. 1998;95:14863–14868. [Europe PMC free article] [Abstract] [Google Scholar]
- MacBeath G, Schreiber SL. Printing proteins as microarrays for high-throughput function determination. Science. 2000;289:1760–1763. [Abstract] [Google Scholar]
- Zhu H, Bilgin M, Bangham R, Hall D, Casamayor A, et al. Global analysis of protein activities using proteome chips. Science. 2001;293:2101–2105. [Abstract] [Google Scholar]
- Jones RB, Gordus A, Krall JA, MacBeath G. A quantitative protein interaction network for the ErbB receptors using protein microarrays. Nature. 2006;439:168–174. [Abstract] [Google Scholar]
- Smith GP. Filamentous fusion phage: Novel expression vectors that display cloned antigens on the virion surface. Science. 1985;228:1315–1317. [Abstract] [Google Scholar]
- Baumeister W, Grimm R, Walz J. Electron tomography of molecules and cells. Trends Cell Biol. 1999;9:81–85. [Abstract] [Google Scholar]
- von Mering C, Jensen LJ, Snel B, Hooper SD, Krupp M, et al. STRING: Known and predicted protein–protein associations, integrated and transferred across organisms. Nucleic Acids Res. 2005;33:D433–D437. [Europe PMC free article] [Abstract] [Google Scholar]
- Zanzoni A, Montecchi-Palazzi L, Quondam M, Ausiello G, Helmer-Citterich M, et al. MINT: A molecular INTeraction database. FEBS Lett. 2002;513:135–140. [Abstract] [Google Scholar]
- Hermjakob H, Montecchi-Palazzi L, Lewington C, Mudali S, Kerrien S, et al. IntAct: An open source molecular interaction database. Nucleic Acids Res. 2004;32:D452–D455. [Europe PMC free article] [Abstract] [Google Scholar]
- Stark C, Breitkreutz BJ, Reguly T, Boucher L, Breitkreutz A, et al. BioGRID: A general repository for interaction datasets. Nucleic Acids Res. 2006;34:D535–D539. [Europe PMC free article] [Abstract] [Google Scholar]
- Peri S, Navarro JD, Amanchy R, Kristiansen TZ, Jonnalagadda CK, et al. Development of human protein reference database as an initial platform for approaching systems biology in humans. Genome Res. 2003;13:2363–2371. [Europe PMC free article] [Abstract] [Google Scholar]
- Kundrotas PJ, Alexov E. PROTCOM: Searchable database of protein complexes enhanced with domain–domain structures. Nucleic Acids Res. 2006;34(Supplement 1):D247–D251. [Europe PMC free article] [Abstract] [Google Scholar]
- Aloy P, Russell RB. InterPreTS: Protein interaction prediction through tertiary structure. Bioinformatics. 2003;19:161–162. [Abstract] [Google Scholar]
- Pieper U, Eswar N, Braberg H, Madhusudhan MS, Davis FP, et al. MODBASE, a database of annotated comparative protein structure models, and associated resources. Nucleic Acids Res. 2004;32:D217–D222. [Europe PMC free article] [Abstract] [Google Scholar]
- Finn RD, Marshall M, Bateman A. iPfam: Visualization of protein–protein interactions in PDB at domain and amino acid resolutions. Bioinformatics. 2005;21:410–412. [Abstract] [Google Scholar]
- Ng SK, Zhang Z, Tan SH, Lin K. InterDom: A database of putative interacting protein domains for validating predicted protein interactions and complexes. Nucleic Acids Res. 2003;31:251–254. [Europe PMC free article] [Abstract] [Google Scholar]
- Pagel P, Oesterheld M, Stumpflen V, Frishman D. The DIMA web resource—Exploring the protein domain network. Bioinformatics. 2006;22:997–998. [Abstract] [Google Scholar]
Articles from PLOS Computational Biology are provided here courtesy of PLOS
Full text links
Read article at publisher's site: https://doi.org/10.1371/journal.pcbi.0030042
Read article for free, from open access legal sources, via Unpaywall: https://journals.plos.org/ploscompbiol/article/file?id=10.1371/journal.pcbi.0030042&type=printable
Citations & impact
Impact metrics
Citations of article over time
Article citations
MEG-PPIS: a fast protein-protein interaction site prediction method based on multi-scale graph information and equivariant graph neural network.
Bioinformatics, 40(5):btae269, 01 Jan 2024
Cited by: 0 articles | PMID: 38640481 | PMCID: PMC11252844
BioID Analysis of Actin-Binding Proteins.
Methods Mol Biol, 2794:95-104, 01 Jan 2024
Cited by: 0 articles | PMID: 38630223
Network interactions with functional roles and evolutionary relationships for BURP domain-containing proteins in chickpea and model species.
Bioinformation, 19(12):1197-1211, 31 Dec 2023
Cited by: 0 articles | PMID: 38250539 | PMCID: PMC10794749
DeepProSite: structure-aware protein binding site prediction using ESMFold and pretrained language model.
Bioinformatics, 39(12):btad718, 01 Dec 2023
Cited by: 9 articles | PMID: 38015872 | PMCID: PMC10723037
A Ubiquitin-Based Module Directing Protein-Protein Interactions in Chloroplasts.
Int J Mol Sci, 24(23):16673, 23 Nov 2023
Cited by: 0 articles | PMID: 38068997 | PMCID: PMC10706609
Go to all (176) article citations
Similar Articles
To arrive at the top five similar articles we use a word-weighted algorithm to compare words from the Title and Abstract of each citation.
Visualisation and navigation methods for typed protein-protein interaction networks.
Appl Bioinformatics, 2(3 suppl):S19-24, 01 Jan 2003
Cited by: 4 articles | PMID: 15130812
Searching the MINT database for protein interaction information.
Curr Protoc Bioinformatics, Chapter 8:Unit 8.5, 01 Aug 2003
Cited by: 0 articles | PMID: 18428712
Biomolecular interaction network database.
Brief Bioinform, 6(2):194-198, 01 Jun 2005
Cited by: 36 articles | PMID: 15975228
Review
A hybrid graph-theoretic method for mining overlapping functional modules in large sparse protein interaction networks.
Int J Data Min Bioinform, 3(1):68-84, 01 Jan 2009
Cited by: 3 articles | PMID: 19432377
Funding
Funders who supported this work.