|
DRUM >
College of Computer, Mathematical & Physical Sciences >
Computer Science >
Computer Science Research Works >
Please use this identifier to cite or link to this item:
http://hdl.handle.net/1903/8008
|
| Title: | A clustering method for repeat analysis in DNA sequences. |
| Authors: | Volfovsky, Natalia Haas, Brian J. Salzberg, Steven L. |
| Type: | Article |
| Keywords: | genomic sequences suffix tree multi fasta genomes RepeatFinder |
| Issue Date: | 1-Aug-2001 |
| Publisher: | Genome Biology |
| Citation: | A clustering method for repeat analysis in DNA sequences. N. Volfovsky, B.J. Haas, and S.L. Salzberg. Genome Biology 2:8 (2001), research0027:1-11. |
| Abstract: | Background: A computational system for analysis of the repetitive structure of genomic
sequences is described. The method uses suffix trees to organize and search the input sequences;
this data structure has been used previously for efficient computation of exact and degenerate
repeats.
Results: The resulting software tool collects all repeat classes and outputs summary statistics as
well as a file containing multiple sequences (multi fasta), that can be used as the target of searches.
Its use is demonstrated here on several complete microbial genomes, the entire Arabidopsis
thaliana genome, and a large collection of rice bacterial artificial chromosome end sequences.
Conclusions: We propose a new clustering method for analysis of the repeat data captured in suffix
trees. This method has been incorporated into a system that can find repeats in individual genome
sequences or sets of sequences, and that can organize those repeats into classes. It quickly and
accurately creates repeat databases from small and large genomes. The associated software
(RepeatFinder), should prove helpful in the analysis of repeat structure for both complete and partial
genome sequences. |
| URI: | http://hdl.handle.net/1903/8008 |
| Appears in Collections: | Computer Science Research Works
|
All items in DRUM are protected by copyright, with all rights reserved.
|