Concept of strings and Trees in Bioinformatics
No Thumbnail Available
Date
2008
Journal Title
Journal ISSN
Volume Title
Publisher
ABACUS. Published by Mathematics Association of Nigeria.
Abstract
Strings have been found to be the most general medium for the representation of information. Considering the large amount of data stored in data dictionaries, databases or the massive data in genomic databases, text remains the main form of exchanging information. The representation of information from real-world problem may involve the use of many interlinked data structures.
In this paper, the suffix tree data structure was used to solve the Pattern Matching problem. We present the suffix tree as an efficient data structure that provides efficient access to all substrings of a string. It is able to rapidly align sequences containing millions of nucleotides and give sufficient biological information.