Analysis and prediction of leucine-rich nuclear export signals.

Research output: Contribution to journalJournal articleResearchpeer-review

  • Tanja la Cour
  • Lars Kiemer
  • Anne Mølgaard
  • Ramneek Gupta
  • Skriver, Karen
  • Søren Brunak
We present a thorough analysis of nuclear export signals and a prediction server, which we have made publicly available. The machine learning prediction method is a significant improvement over the generally used consensus patterns. Nuclear export signals (NESs) are extremely important regulators of the subcellular location of proteins. This regulation has an impact on transcription and other nuclear processes, which are fundamental to the viability of the cell. NESs are studied in relation to cancer, the cell cycle, cell differentiation and other important aspects of molecular biology. Our conclusion from this analysis is that the most important properties of NESs are accessibility and flexibility allowing relevant proteins to interact with the signal. Furthermore, we show that not only the known hydrophobic residues are important in defining a nuclear export signals. We employ both neural networks and hidden Markov models in the prediction algorithm and verify the method on the most recently discovered NESs. The NES predictor (NetNES) is made available for general use at https://www.cbs.dtu.dk/.
Original languageEnglish
JournalProtein Engineering Design and Selection (Print Edition)
Volume17
Issue number6
Pages (from-to)527-36
Number of pages9
ISSN1741-0126
DOIs
Publication statusPublished - 2004

Bibliographical note

Keywords: Active Transport, Cell Nucleus; Algorithms; Artificial Intelligence; Aspartic Acid; Computational Biology; Computing Methodologies; Consensus Sequence; Databases, Protein; Glutamic Acid; Hydrophobicity; Internet; Isoelectric Point; Leucine; Markov Chains; Models, Molecular; Neural Networks (Computer); Nuclear Proteins; Protein Sorting Signals; Protein Structure, Secondary; Protein Structure, Tertiary; ROC Curve; Reproducibility of Results; Sequence Alignment; Serine; Structural Homology, Protein

ID: 2812830