Description | The tendency of genes involved in the same process to be in the same vicinity on the DNA molecule is a measurable property of genomic organization. The metric applied here is limited to genes found in evidence of other Genome Properties whose states are "YES". An arbitrary distance of 3000 nucleotides is used to determine that genes are within a cluster. In the case on non-TIGR genomes, where both TIGR and non-TIGR gene calls are present in the CMR, whichever set gives a more complete representation of the required components is used in these calculations, defaulting to the non-TIGR gene calls in case of a tie. The value assigned for this property is the percentage of genes found in clusters compared to the total number of genes in evidence for that property. Clearly, this method will not result in fully unbiased data as each genome may contain varying numbers of YES-state properties which themselves may have varying propensities for having genes organized into clusters. In particular, a property such as CRISPR elements, which contains large clusters of genes by its very definition is problematic and has been excluded from this calculation. |