PROTEINS: Structure, Function, and Genetics 11, 95-110 (1991)

Expert System for Predicting Protein Localization Sites in Gram-Negative Bacteria

K. Nakai & M. Kanehisa

Institute for Chemical Research, Kyoto University, Japan.

We have developed an expert system that makes use of various kinds of knowledge organized as ``if-then'' rules for predicting protein localization sites in Gram-negative bacteria, given the amino acid sequence information alone. We considered four localization sites: the cytoplasm, the inner (cytoplasmic) membrane, the periplasm, and the outer membrane. Most rules were derived from experimental observations. For example, the rule to recognize an inner membrane protein is the presence of either a hydrophobic stretch in the predicted mature protein or an uncleavable N-terminal signal sequence. Lipoproteins are first recognized by a consensus pattern and then assumed present at either the inner or outer membrane. These two possibilities are further discriminated by examining an acidic residue in the mature N-terminal portion. Furthermore, we found an empirical rule that periplasmic and outer membrane proteins were successfully discriminated by their different amino acid composition. Overall, our system could predict 83% of the localization sites of proteins in our database.

Last update October 10, 1994