Computational prediction of over-annotated protein-coding genes in the genome of Agrobacterium tumefaciens strain C58
Yu Jia-Feng†a),b), Sui Tian-Xianga),d), Wang Hong-Meic), Wang Chun-Lingc), Jing Lic), Wang Ji-Huaa),c)
       
Purine/pyrimidine disparity at the three codon positions. This figure shows that the locating regions of the over-annotated genes are obviously different from those of the genuine protein-coding genes. It is found that the purine bases are absolutely predominant at the first codon position in protein-coding genes, whereas the purine/pyrimidine disparity values at the first position of the over-annotated genes are below zero.