Genome Subsequences Assembly Using Approximate Matching Techniques in Hadoop

Sequencing DNA will provide valuable insights into several aspects of human life. The major requirement of this domain is for a faster and more accurate sequencing mechanism. The process becomes difficult due to the huge size of DNA. This paper presents an effective genome assembly technique in Hadoop architecture using MapReduce. The fragment assembly is based on initially matching the subsequences and then depending on the matching levels, the final complete matching subsequences are filtered. The consensus alignment and recalibration are performed using Greedy approximate matching techniques. The experimental results show that our approach is more accurate and exhibits better coverage; however, the processing time is found to be high. In future, our contributions will be based on reducing the processing time. Discussions about these techniques are also presented in this paper.

MoreLess

Year of publication:	2017
Authors:	Raja, Govindan ; Reddy, U. Srinivasulu
Published in:	International Journal of Knowledge Discovery in Bioinformatics (IJKDB). - IGI Global, ISSN 1947-9123, ZDB-ID 2703506-2. - Vol. 7.2017, 2 (01.07.), p. 83-97
Publisher:	IGI Global
Subject:	Genome Assembly \| Greedy Approximate Matching \| Hadoop \| Map Reduce \| Sequence Alignment

More details

Type of publication:	Article
Language:	English
Other identifiers:	10.4018/IJKDB.2017070105 [DOI]
Source:	Other ZBW resources

Persistent link: https://www.econbiz.de/10012046212