DPDB Home Page Search Analysis Help Statistics Links Contact us


STATISTICS
Tue May 18 13:43:17 2010

Contents of the Secondary Database
  • Total number of Polymorphic sets analyzed: 2319
  • Total number of Analysis units: 4471 [Phylogeny]
    (average number of analysis units per polymorphic set: 1.9)
    • CDS: 2184
    • exon: 1167
    • intron: 1040
    • 5ŽUTR: 31
    • 3ŽUTR: 33
    • promoter: 16

    •  

QUALITY OF THE ALIGNMENTS:

  • Number of Analysis units according to the "Number of sequences":
    • Low number (2-5 =  ! ): 1828 (40.9%)
    • Medium number (6-10 =    K ): 690 (15.4%)
    • High number (>10 =   J): 1953 (43.7%)
       
  • Number of Analysis units according to the "Percentage of gaps or ambiguous bases":
    • High (≥30% =  ! , low quality): 42 (0.9%)
    • Medium (≥10%-<30% =    K , medium quality): 159 (3.6%)
    • Low (<10% =   J , high quality): 4270 (95.5%)
       
  • Number of Analysis units according to the "Percentage of difference in size between the longest and the shortest sequences":
    • High (≥30% =  ! , low quality): 424 (9.5%)
    • Medium (≥10%-<30% =    K , medium quality): 292 (6.5%)
    • Low (<10% =   J , high quality): 3755 (84.0%)
       

SET CONFIDENCE:

  • Existence in NCBI PopSet*: 28.6% (1279/4471)
  • Consecutive GenBank accession numbers**: 40.1% (1791/4471)
  • One or more shared references**: 57.1% (2552/4471)
  • Journals* (Genetics, Mol.Biol.Evol., J.Mol.Evol., Mol.Phylogenet.Evol.): 58.6% (2620/4471)

* At least one sequence in the set
** All the sequences in the set


AVERAGES BY GENE REGIONS:

  #Aligns #Polym. sets Avg. #Seqs Avg. Align. length (bases) Avg. #Analyzed sites (bases) Avg. θ Avg. π Avg. %G+C
3ŽUTR 333013.3609.3514.30.006620.0045538.98
5ŽUTR 312815.7530.3375.90.007650.0058542.46
CDS 2184202011.81365.51048.30.006920.0065053.91
exon 116650615.2589.5497.80.005870.0054952.58
intron 104056215.2382.7335.70.012000.0108737.52
promoter 161138.8603.6324.80.007940.0076124.14


Get your own lists and graphics from the Graphical Search or the Comparative SearchTools

 

Contents of the Primary the Database

  • Total number of Sequences: 57650
  • Total number of References 3695
    • Published: 1361
    • Submitted: 1692
    • In press: 85
    • Published Only in Database: 10
    • Unpublished: 399
    • Thesis: 18
    • Number of different Journals: 105

 




DGM UAB