Genome Subsequences Assembly Using Approximate Matching Techniques in Hadoop
Sequencing DNA will provide valuable insights into several aspects of human life. The major requirement of this domain is for a faster and more accurate sequencing mechanism. The process becomes difficult due to the huge size of DNA. This paper presents an effective genome assembly technique in Hadoop architecture using MapReduce. The fragment assembly is based on initially matching the subsequences and then depending on the matching levels, the final complete matching subsequences are filtered. The consensus alignment and recalibration are performed using Greedy approximate matching techniques. The experimental results show that our approach is more accurate and exhibits better coverage; however, the processing time is found to be high. In future, our contributions will be based on reducing the processing time. Discussions about these techniques are also presented in this paper.
Year of publication: |
2017
|
---|---|
Authors: | Raja, Govindan ; Reddy, U. Srinivasulu |
Published in: |
International Journal of Knowledge Discovery in Bioinformatics (IJKDB). - IGI Global, ISSN 1947-9123, ZDB-ID 2703506-2. - Vol. 7.2017, 2 (01.07.), p. 83-97
|
Publisher: |
IGI Global |
Subject: | Genome Assembly | Greedy Approximate Matching | Hadoop | Map Reduce | Sequence Alignment |
Saved in:
Online Resource
Saved in favorites
Similar items by subject
-
Hadoop ecosystem as enterprise big data platform : perspectives and practices
Mazumder, Sourav, (2018)
-
Frequent Itemset Mining in Large Datasets a Survey
Kumar, Manish, (2017)
-
Enterprise big data analysis using SVM classifier and lexicon dictionary
Radha, S., (2020)
- More ...