@brazovayeye

Evolving Regular Expression-based Sequence Classifiers for Protein Nuclear Localisation

, , и . Applications of Evolutionary Computing, EvoWorkshops2004: EvoBIO, EvoCOMNET, EvoHOT, EvoIASP, EvoMUSART, EvoSTOC, том 3005 из LNCS, стр. 31--40. Coimbra, Portugal, Springer Verlag, (5-7 April 2004)

Аннотация

A number of bioinformatics tools use regular expression (RE) matching to locate protein or DNA sequence motifs that have been discovered by researchers in the laboratory. For example, patterns representing nuclear localisation signals (NLSs) are used to predict nuclear localisation. NLSs are not yet well understood, and so the set of currently known NLSs may be incomplete. Here we use genetic programming (GP) to generate RE-based classifiers for nuclear localisation. While the approach is a supervised one (with respect to protein location), it is unsupervised with respect to already known NLSs. It therefore has the potential to discover new NLS motifs. We apply both tree based and linear GP to the problem. The inclusion of predicted secondary structure in the input does not improve performance. Benchmarking shows that our majority classifiers are competitive with existing tools. The evolved REs are usually "NLS like" and work is underway to analyse these for novelty.

Линки и ресурсы

тэги

сообщество

  • @brazovayeye
  • @dblp
@brazovayeye- тэги данного пользователя выделены