An improved consensus algorithm for approximate string matching
Main Article Content
Abstract
One of the fundamental tasks in bioinformatics consists in searching for patterns, in a protein or DNA sequence, that are sufficiently similar to a given motif. This problem is known as approximate string matching (ASM) and has several applications besides bioinformatics. The similarity between strings of symbols is typically evaluated by metrics such as the Hamming distance, the Levensthein distance, or correlation or consensus techniques. In this paper, a refinement of a recently introduced consensus algorithm is proposed and evaluated with real protein sequences from plants. Preliminary tests with real protein sequences from plants show that the proposed refinement can significantly increase the localization accuracy by up to 95%, while further reducing the number of false positives by around 80%. Thus, the proposed algorithm could be a useful tool in many biological applications.
Article Details
DERECHOS DE AUTOR Y DERECHOS CONEXOS, las MEMORIAS CONGRESO NACIONAL DE INGENÍERIA BIOMÉDICA es una publicación editada por la Sociedad Mexicana de Ingeniería Biomédica A.C., Plaza Buenavista, núm. 2, Col. Buenavista, Delegación Cuauhtémoc, C.P. 06350, México, D.F., Tel. +52 (555) 574-4505, www.somib.org.mx, correo-e: secretariado@somib.org.mx. Editor responsable: Elliot Vernet Saavedra. Reserva de Derechos al Uso Exclusivo No. 04-2015-011313082200-01, ISSN: 2395-8928, ambos otorgados por el Instituto Nacional de Derechos de Autor.