A FAST TECHNIQUE FOR DERIVING FREQUENT STRUCTURED PATTERNS FROM BIOLOGICAL DATA SETS
In the last years, the completion of the human genome sequencing showed a wide range of new challenging issues involving raw data analysis. In particular, the discovery of information implicitly encoded in biological sequences is assuming a prominent role in identifying genetic diseases and in deciphering biological mechanisms. This information is usually represented by patterns frequently occurring in the sequences. Because of biological observations, a specific class of patterns is becoming particularly interesting: frequent structured patterns. In this respect, it is biologically meaningful to look at both "exact" and "approximate" repetitions of pattens within the available sequences. This paper gives a contribution in this setting by providing algorithms which allow to discover frequent structured patterns, both in "exact" and "approximate" form, present in a collection of input biological sequences.
Year of publication: |
2005
|
---|---|
Authors: | TERRACINA, GIORGIO |
Published in: |
New Mathematics and Natural Computation (NMNC). - World Scientific Publishing Co. Pte. Ltd., ISSN 1793-7027. - Vol. 01.2005, 02, p. 305-327
|
Publisher: |
World Scientific Publishing Co. Pte. Ltd. |
Subject: | Sequence analysis | pattern discovery | frequent structured patterns | motifs |
Saved in:
Saved in favorites
Similar items by subject
-
A PRUNING APPROACH TO PATTERN DISCOVERY
WANG, HSIAO-FAN, (2008)
-
Pattern Discovery and Computational Mechanics
Shalizi, Cosma Rohilla, (2000)
-
Pattern Discovery and Detection: A Unified Statistical Methodology
Hand, David, (2004)
- More ...