Getting poster data...
Alex Lomsadze, Mark Borodovsky (Georgia Institute of Technology, Atlanta, GA, USA)Gene finding plays a fundamental role in genome sequencing projects. Characterization of protein function, manufacturing DNA microarrays, reconstruction of cellular networks rely on genes and proteins identified by computational gene finding tools. A list of predicted gene locations produced by ab initio gene finding algorithm is a result of computation of the maximum likelihood sequence parse into coding and non-coding regions; the computation takes into account multiple frequency patterns of nucleotide ordering in regions that carry or not carry genetic code as well as frequency patterns in the site signals located at borders of protein-coding regions. The frequency patterns could be used to compute posterior probabilities of presence of particular types of regions and sites in a particular sequence; termed as “coding potential” posterior probabilities of protein coding region in a given frame were naturally visualized in the graphical output of GeneMark (Borodovsky, McIninch 1993), the pioneer gene finding algorithm that has been steadily improved and diversified over the years.