Table 1 |
||||
|
Comparison of Plasmodium falciparum, Saccharomyces cerevisiae, Arabidopsis thaliana and Homo sapiens genomic statistics |
||||
| Plasmodium falciparum |
Saccharomyces cerevisiae |
Arabidopsis thaliana |
Homo sapiens |
|
|
|
||||
| Genome general statistics |
||||
| No of chromosomes |
14 |
16 |
5 |
22 + X/Y |
| Size (bp) |
22,853,764 |
12,495,682 |
115,409,949 |
3,272,187,692 |
| average (A+T) % |
80.6 |
61.7 |
65.1 |
59.0 |
| Estimated number of genes |
5,268 |
5,770 |
25,498 |
31,778 |
| Average gene length |
2,283 |
1,424 |
1,310 |
1,340 |
| % of coding genome |
53 |
66 |
29 |
9 |
|
|
||||
| Initial annotation based on sequence similarity (BLAST or *Smith-Waterman E-values) |
||||
| Proportion of predicted protein sequences: |
||||
| - having a detectable similarity to sequences, in other organisms, of known function
at the initial genome release date. |
34 % |
75 % |
69 % |
59 %* |
| - without any detectable similarity to sequences in other organisms at the initial
genome release date, i.e. "no BLASTP match to known proteins" (estimates based on
published data and local BLAST searches). |
61 % |
< 8 % |
< 20 % |
15 %* |
| - of totally unknown function (hypothetical proteins = with similarity to sequences
of unknown function + without any detectable similarity to sequences in other organisms). |
66 % |
16 % |
31 % |
41 %* |
|
|
||||
| Average characteristics of open reading frames |
||||
| Exons: |
||||
| No per gene |
2.39 |
1.05 |
5.18 |
12.1 |
| (A+T) % |
76.3 |
60 |
55 |
52 |
| average length |
949 |
1356 |
253 |
111 |
| Introns: |
||||
| (A+T) % |
86.5 |
64 |
66 |
60 |
| Intergenic regions: |
||||
| (A+T) % |
86.4 |
64 |
66 |
60 |
|
|
||||
|
Presented data compile information from [22] for Plasmodium falciparum, [190] for yeast (completed with statistics made available via the Comprehensive Yeast Genome Database website, [191]), the Arabidopsis genome initiative [192] for Arabidopsis, and the International Human Genome Sequencing Consortium [193] and [194] for Human (completed with statistics made available via Ensembl, [195]). These statistics at the complete genome release date have been continuously updated since then. |
||||
|
Birkholtz et al. Malaria Journal 2006 5:110 doi:10.1186/1475-2875-5-110 |
||||