Computational prediction of over-annotated protein-coding genes in the genome of Agrobacterium tumefaciens strain C58
Yu Jia-Feng†a),b), Sui Tian-Xianga),d), Wang Hong-Meic), Wang Chun-Lingc), Jing Lic), Wang Ji-Huaa),c)
       
The relative G + C content at the three codon positions. As can be seen, the locating regions of the predicted non-coding genes are obviously different from those of the other genes. Careful observation indicates that the G + C content at the second and third codon positions of the protein-coding genes exhibit high usage bias. On the contrary, the values of relative G + C contents at the three codon positions of the predicted non-coding genes are about 1, respectively, which indicate that they are likely to be random sequences.