Patil, Nagamma; Toshniwal, Durga; Garg, Kumkum - In: International Journal of Data Analysis Techniques and … 5 (2013) 2, pp. 122-147
for genome identification based on exact matching of n-grams. However, in most real world biological problems, it may not … be feasible to have an exact match, so approximate matching may be desired. The problem in using n-grams is that the … on approximate matching. Generally genome sequences are very long, so we sample the data into 10,000 base pairs. Given a …