genbank release 189.0 is out (21.Apr.2012)
ftp://ftp.ncbi.nih.gov/genbank/gbrel.txt
4.84 GB of virus-records in 20 files when uncompressed
1162210 virus sequences, 200831(17.3%
of these are influenza
~1340M virus-nucleotides , 297734743 of these in influenza
1482 nucleotides per influenza sequence in average
the 100 most sequenced viruses are:
439744,Human immunodeficiency virus
190649,Influenza A virus
117618,Hepatitis C virus
40356,Hepatitis B virus
22960,Simian immunodeficiency virus
13573,Rabies virus
13126,Human herpesvirus
12727,Porcine reproductive and respiratory syndrome virus
11305,Dengue virus
10036,Norovirus
9427,Influenza B virus
9119,Human papillomavirus
8854,Rotavirus
7648,Human echovirus
7621,Human enterovirus
7090,Human rotavirus
6701,Measles virus
6127,Human coxsackievirus
5515,Human rhinovirus
5514,Newcastle disease virus
5171,Hepatitis A virus
5124,Human respiratory syncytial virus
5096,Torque teno virus
4639,Foot-and-mouth disease virus
4568,Hepatitis E virus
4195,Human metapneumovirus
4037,GB virus
3645,JC polyomavirus
3628,Human adenovirus
3610,Human T-lymphotropic virus
3118,Human poliovirus
3072,Infectious bronchitis virus
2764,Bovine viral diarrhea virus
2480,Infectious bursal disease virus
2442,Tomato spotted wilt virus
2394,Feline immunodeficiency virus
2331,Citrus tristeza virus
2302,West Nile virus
2046,Porcine circovirus
1903,BK polyomavirus
1749,Equine infectious anemia virus
1730,Japanese encephalitis virus
1718,Human bocavirus
1693,Human astrovirus
1638,Human parechovirus
1637,Small ruminant lentivirus
1521,Potato virus
1435,Bluetongue virus
1430,Classical swine fever virus
1414,Human parvovirus
1345,African swine fever virus
1334,Cucumber mosaic virus
1327,Hepatitis delta virus
1290,Chikungunya virus
1264,Feline coronavirus
1227,Ovine progressive pneumonia virus
1144,Sapovirus
1142,Tick-borne encephalitis virus
1084,Bovine rotavirus
1083,Grapevine leafroll-associated virus
1057,Turnip mosaic virus
985,Human endogenous retrovirus
980,Gallid herpesvirus
973,Rubella virus
968,Human coronavirus
959,Infectious salmon anemia virus
956,Canine distemper virus
946,Canine parvovirus
938,Mumps virus
933,Tomato yellow leaf curl virus
933,Beet necrotic yellow vein virus
864,Buggy Creek virus
862,Equid herpesvirus
843,Feline leukemia virus
842,Equine arteritis virus
838,Eastern equine encephalitis virus
835,Simian foamy virus
793,Porcine rotavirus
765,Swine hepatitis E virus
755,Influenza C virus
714,Viral hemorrhagic septicemia virus
700,Rift Valley fever virus
688,Simian T-lymphotropic virus
671,Crimean-Congo hemorrhagic fever virus
665,Fowl adenovirus
657,Seoul virus
646,Bovine leukemia virus
642,Puumala virus
618,Torque teno sus virus
615,Vesicular stomatitis virus
598,Cowpox virus
589,Banana bunchy top virus
585,Rice stripe virus
576,Plum pox virus
574,Zucchini yellow mosaic virus
556,Barley yellow dwarf virus
548,SARS coronavirus
534,Porcine epidemic diarrhea virus
525,Beak and feather disease virus
522,Emiliania huxleyi virus
------------------------------------------------
all flu : 200852
A:190649
B:9427
C:755
unidentified:21
---------------------------
Flu-A segments:
percent of nucleotides in viral genbank sequences:
all,Newcastle,"Avian","Tomato","Potato"
A:32.0,29.8,31.6,28.6,30.4
C:20.4,22.9,20.2,19.0,19.9
G:23.3,23.6,24.2,21.4,23.9
T:24.2,23.7,24.0,29.9,25.7
ftp://ftp.ncbi.nih.gov/genbank/gbrel.txt
Code:
number,code,files,sequences,nucleotides,name --------------------------------------------- 12 ,EST, 461 , 72644911 , 40395013270 ,expressed sequence tags 16 ,HTG, 136 , 146372 , 24371740442 ,high throughput genomic 15 ,GSS, 255 , 33921273 , 21587774061 ,genome survey 13 ,PAT, 178 , 23988262 , 11941467633 ,patent 7 ,BCT, 85 , 823012 , 8011103134 ,bacterial 1 ,PRI, 45 , 663665 , 6277006564 ,primate 6 ,PLN, 55 , 2424604 , 5291469882 ,plant fungal algal 2 ,ROD, 29 , 439271 , 4425470108 ,rodent 19 ,TSA, 70 , 6024687 , 4038324975 ,??? 18 ,ENV, 53 , 4685451 , 3215023639 ,Environmental sampling 4 ,VRT, 26 , 1039878 , 2831711043 ,other vertebrate 5 ,INV, 30 , 1549678 , 2418282443 ,invertebrate 8 ,VRL, 20 , 1162210 , 1341502602 ,viral 10 ,SYN, 7 , 122994 , 925854875 ,synthetic 3 ,MAM, 8 , 307782 , 847879863 ,other mammalian 14 ,STS, 20 , 1322634 , 636256682 ,sequence tagged sites 17 ,HTC, 15 , 551007 , 634340429 ,high throughput cDNA 9 ,PHG, 1 , 6491 , 76133941 ,bacteriophage 11 ,UNA, 1 , 239 , 125812 ,unannotated 20 ,CON 0, 0, 0,constructed
4.84 GB of virus-records in 20 files when uncompressed
1162210 virus sequences, 200831(17.3%
~1340M virus-nucleotides , 297734743 of these in influenza
1482 nucleotides per influenza sequence in average
the 100 most sequenced viruses are:
439744,Human immunodeficiency virus
190649,Influenza A virus
117618,Hepatitis C virus
40356,Hepatitis B virus
22960,Simian immunodeficiency virus
13573,Rabies virus
13126,Human herpesvirus
12727,Porcine reproductive and respiratory syndrome virus
11305,Dengue virus
10036,Norovirus
9427,Influenza B virus
9119,Human papillomavirus
8854,Rotavirus
7648,Human echovirus
7621,Human enterovirus
7090,Human rotavirus
6701,Measles virus
6127,Human coxsackievirus
5515,Human rhinovirus
5514,Newcastle disease virus
5171,Hepatitis A virus
5124,Human respiratory syncytial virus
5096,Torque teno virus
4639,Foot-and-mouth disease virus
4568,Hepatitis E virus
4195,Human metapneumovirus
4037,GB virus
3645,JC polyomavirus
3628,Human adenovirus
3610,Human T-lymphotropic virus
3118,Human poliovirus
3072,Infectious bronchitis virus
2764,Bovine viral diarrhea virus
2480,Infectious bursal disease virus
2442,Tomato spotted wilt virus
2394,Feline immunodeficiency virus
2331,Citrus tristeza virus
2302,West Nile virus
2046,Porcine circovirus
1903,BK polyomavirus
1749,Equine infectious anemia virus
1730,Japanese encephalitis virus
1718,Human bocavirus
1693,Human astrovirus
1638,Human parechovirus
1637,Small ruminant lentivirus
1521,Potato virus
1435,Bluetongue virus
1430,Classical swine fever virus
1414,Human parvovirus
1345,African swine fever virus
1334,Cucumber mosaic virus
1327,Hepatitis delta virus
1290,Chikungunya virus
1264,Feline coronavirus
1227,Ovine progressive pneumonia virus
1144,Sapovirus
1142,Tick-borne encephalitis virus
1084,Bovine rotavirus
1083,Grapevine leafroll-associated virus
1057,Turnip mosaic virus
985,Human endogenous retrovirus
980,Gallid herpesvirus
973,Rubella virus
968,Human coronavirus
959,Infectious salmon anemia virus
956,Canine distemper virus
946,Canine parvovirus
938,Mumps virus
933,Tomato yellow leaf curl virus
933,Beet necrotic yellow vein virus
864,Buggy Creek virus
862,Equid herpesvirus
843,Feline leukemia virus
842,Equine arteritis virus
838,Eastern equine encephalitis virus
835,Simian foamy virus
793,Porcine rotavirus
765,Swine hepatitis E virus
755,Influenza C virus
714,Viral hemorrhagic septicemia virus
700,Rift Valley fever virus
688,Simian T-lymphotropic virus
671,Crimean-Congo hemorrhagic fever virus
665,Fowl adenovirus
657,Seoul virus
646,Bovine leukemia virus
642,Puumala virus
618,Torque teno sus virus
615,Vesicular stomatitis virus
598,Cowpox virus
589,Banana bunchy top virus
585,Rice stripe virus
576,Plum pox virus
574,Zucchini yellow mosaic virus
556,Barley yellow dwarf virus
548,SARS coronavirus
534,Porcine epidemic diarrhea virus
525,Beak and feather disease virus
522,Emiliania huxleyi virus
------------------------------------------------
all flu : 200852
A:190649
B:9427
C:755
unidentified:21
---------------------------
Flu-A segments:
Code:
18194,18265,18007,47478,18279,27837,23476,19113 1 ,5018, 320,7955, 4, 12, 263,1692, 12, 11, 54,2748, 105 2 ,1438, 321, 9,7667, 409,3597,4612, 15, 197 3 ,1576,3422, 307,3438,7418, 12, 215,1297, 274, 15, 33 4*,17195,366,18029,767,5342,1267,1160,102,2500,292,219,113,77,10,13,26 5 ,5488, 8, 209, 65, 152,4810, 27,7520 6*,14995,9306,563,145,188,1019,317,1029,275 7 , 834,8335, 41, 17,3891,10328, 30 8*,17346,1767 ---------------------------------------- 41,11573,4898,517, 205, 2 42, 143, 12, 3, 42, 11, 16, 3, 128, 8 43,13462,407, 269, 427, 1,3463 44, 571, 15, 166, 15 45, 30, 134, 442, 4, 11, 48, 69, 147, 20,4419, 18 46, 928, 192, 16, 5, 18, 12, 96 47, 411, 15, 24, 24, 10, 676 48, 102 49, 7, 424,1730, 32, 34, 114, 124, 7, 11, 17 4A, 212, 1, 3, 8, 66, 2 4B, 54, 13, 152 4C, 91, 22 4D, 10, 34, 33 4E, 9, 1 4F, 13 4G, 5, 16, 5 61,3149, 728, 9, 18, 488,7276, 145, 175, 33,2929, 30, 15 62,6184, 642, 151, 725, 322, 185, 904, 3, 190 63, 292, 244, 10, 13, 2, 2 64, 106, 22, 17 65, 1, 51, 10, 126 66, 567, 47, 388, 8, 9 67, 3, 6, 12, 74, 14, 182, 26 68, 700, 205, 124 69, 260, 15 81,4856, 10, 68,8571,2084,1753, 4 82, 403,1335, 15, 10, 4 ---------------------------------------------------
all,Newcastle,"Avian","Tomato","Potato"
A:32.0,29.8,31.6,28.6,30.4
C:20.4,22.9,20.2,19.0,19.9
G:23.3,23.6,24.2,21.4,23.9
T:24.2,23.7,24.0,29.9,25.7
Comment