G. Tzanis, I. Kavakiotis, I. Vlahavas, “Polyadenylation Site Prediction Using Interesting Emerging Pattern”, 8th IEEE International Conference on Bioinformatics and Bioengineering, IEEE, Athens, Greece, 2008.
Author(s): G. Tzanis, I. Kavakiotis, I. Vlahavas
Appeared In: 8th IEEE International Conference on Bioinformatics and Bioengineering, IEEE, Athens, Greece, 2008.
Keywords: prediction, polyadenylation, mRNA, messanger RNA, emerging patterns, data mining, Arabidopsis thaliana
Abstract: This paper presents a study on polyadenylation site prediction in mRNA sequences. We describe a method, called PolyA-EP, that we developed for predicting polyadenylation sites and we present a systematic study of the problem of recognizing mRNA 3΄ ends which contain a polyadenylation site using the proposed method. PolyA-EP exploits the advantages of emerging patterns, namely high understandability and discriminating power and can be used for both descriptive and predictive analysis. In particular, PolyA-EP is a parameterizable tool that can be used in order to extract interesting emerging patterns for describing or predicting polyadenylation sites. Moreover, the extracted emerging patterns can span across many elements around the polyadenylation site. We discuss the results of the experiments we conducted with Arabidopsis thaliana sequences drawing important conclusions and finally we propose a framework that improves the accuracy of polyadenylation site prediction.