Abstract
The Hidden Markov Models (HMMs) are widely used for biological sequence analysis because of their ability to incorporate biological information in their structure. An automatic means of optimising the structure of HMMs would be highly desirable. To maintain biologically interpretable blocks inside the HMM, we used a Genetic Algorithm (GA) that has HMM blocks in its coding representation. We developed special genetics operations that maintain the useful HMM blocks. To prevent over-fitting a separate data set is used for comparing the performance of the HMMs to that used for the Baum-Welch training. The performance of this algorithm is applied to finding HMM structures for the promoter and coding region of C. jejuni. The GA-HMM was capable of finding a superior HMM to a hand-coded HMM designed for the same task which has been published in the literature.
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
Durbin, R., Eddy, S., Krogh, A., Mitchison, G.: Biological sequence analysis. Cambridge University Press, Cambridge (1998)
Petersen, L., Larsen, T.S., Ussery, D.W., On, S.L.W., Krogh, A.: Rpod promoters in Campylobacter jejuni exhibit a strong periodic signal instead of a -35 box. Journal of Molecular Biology 326(5), 1361–1372 (2003)
Krogh, A., Larsson, B., von Heijne, G., Sonnhammer, E.: Predicting transmembrane protein topology with a Hidden Markov Model: Application to complete genomes. Journal of Molecular Biology 305(3), 567–580 (2003)
Goldberg, D.E.: Genetic Algorithms in Search, Optimization & Machine Learning. Addison-Wesley, Reading, Mass (1989)
Yada, T., Ishikawa, M., Tanaka, H., Asai, K.: DNA Sequence Analysis Using Hidden Markov Model and Genetic Algorithm. Genome Informatics 5, 178–179 (1994)
Baker, J.E.: Reducing bias and inefficiency in the selection algorithm. In: Proceedings of the Second International Conference on Genetic Algorithms, Lawrence Erlbaum Associates (Hillsdale) (1987)
PrĂ¼gel-Bennett, A., Shapiro, J.L.: An analysis of genetic algorithms using statistical mechanics. Physical Review Letters 72(9), 1305–1309 (1994)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2004 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Won, KJ., PrĂ¼gel-Bennett, A., Krogh, A. (2004). The Block Hidden Markov Model for Biological Sequence Analysis. In: Negoita, M.G., Howlett, R.J., Jain, L.C. (eds) Knowledge-Based Intelligent Information and Engineering Systems. KES 2004. Lecture Notes in Computer Science(), vol 3213. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-30132-5_13
Download citation
DOI: https://doi.org/10.1007/978-3-540-30132-5_13
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-23318-3
Online ISBN: 978-3-540-30132-5
eBook Packages: Springer Book Archive