AgaP_AGAP003633 cDNA ORF clone, Anopheles gambiae str. PEST

  • Gene

  • Clones

  • gRNAs

The following AgaP_AGAP003633 gene cDNA ORF clone sequences were retrieved from the NCBI Reference Sequence Database (RefSeq). These sequences represent the protein coding region of the AgaP_AGAP003633 cDNA ORF which is encoded by the open reading frame (ORF) sequence. ORF sequences can be delivered in our standard vector, pcDNA3.1+/C-(K)DYK or the vector of your choice as an expression/transfection-ready ORF clone. Not the clone you want? Click here to find your clone.

***CloneID Accession No. Definition **Vector *Turnaround time Price (USD) Select
OAh03639 XM_313397.5
Latest version!
Anopheles gambiae str. PEST AGAP003633-RA (AgaP_AGAP003633), mRNA. pcDNA3.1-C-(k)DYK or customized vector 20 $713.30
$1019.00

ORF Online Only Promotion

Next-day Shipping ORF Clones ( in default vector with tag)
1 Clone 30% OFF
2-4 Clone 40% OFF
5 or more Clone 50% OFF
All Other ORF Clones
30% OFF

*Business Day

** You may select a custom vector to replace pcDNA3.1+/C-(K)DYK after clone is added to cart.

** GenScript guarantees 100% sequence accuracy of all synthetic DNA constructs we deliver, but we do not guarantee protein expression in your experimental system. Protein expression is influenced by many factors that may vary between experiments or laboratories. In addition, please pay attention to the signal peptide, propeptide and transit peptide in target ORF, which may affect the choice of vector (N/C terminal tag vector).

***One clone ID might be correlated to multiple accession numbers, which share the same CDS sequence.

  • Reference Sequences (Refseq)
    CloneID OAh03639
    Clone ID Related Accession (Same CDS sequence) XM_313397.5
    Accession Version XM_313397.5 Latest version! Documents for ORF clone product in default vector
    Sequence Information ORF Nucleotide Sequence (Length: 3975bp)
    Protein sequence
    SNP
    Vector pcDNA3.1-C-(k)DYK or customized vector User Manual
    Clone information Clone Map MSDS
    Tag on pcDNA3.1+/C-(K)DYK C terminal DYKDDDDK tags
    ORF Insert Method CloneEZ™ Seamless cloning technology
    Insert Structure linear
    Update Date 2018-04-25
    Organism Anopheles gambiae str. PEST
    Product AGAP003633-PA
    Comment Comment: PROVISIONAL REFSEQ: This record has not yet been subject to final NCBI review. This record is derived from an annotated genomic sequence (NT_078266). On Sep 28, 2011 this sequence version replaced XM_313397.4.

    1
    61
    121
    181
    241
    301
    361
    421
    481
    541
    601
    661
    721
    781
    841
    901
    961
    1021
    1081
    1141
    1201
    1261
    1321
    1381
    1441
    1501
    1561
    1621
    1681
    1741
    1801
    1861
    1921
    1981
    2041
    2101
    2161
    2221
    2281
    2341
    2401
    2461
    2521
    2581
    2641
    2701
    2761
    2821
    2881
    2941
    3001
    3061
    3121
    3181
    3241
    3301
    3361
    3421
    3481
    3541
    3601
    3661
    3721
    3781
    3841
    3901
    3961
    ATGGATCTTG AAGAAGCAAA TGATGCAATT AAGCCCGGTG ACAATGTACC GAGATCAAAG 
    ACGGAGATAG AATTTGAGGA GCATGTACTC GATACAAAGG TTAGGCTGTC CGATGGCACA
    GAAGCGTGCG TGTTCAGCCG CACGCCGCGC AACAAAGATC TGCTGTTCGG GTACATGTGC
    CATATCTGCG GCGTGGCCTG TCTGTATGGC GAGCGCATGC TGCAAATTCA TATCGCCGGG
    CGCAAGCATC AGGCCCGATT GAACGTGACC GTGTTTGATG CAGAGCAGTA CCGTGCTTCG
    CTGGTGGCGA AACCGACGAA AGCAACTTCC GGATCGAACG AGAACAACAC TTCCACCGGA
    AACAACGGTG GTGGCGGTCA GGAAAATGAC TCTTCCTCAC CGTCCTTGAT GAAACCGCAG
    GCTCGGTTGC AAAACGTGCT TGACGGCTAT CGCGATGGCC CATTGGTCGG GCTGGAATAC
    ATCGTGGAGC TGATCGATTC AGCGAACGGT GCCGATCCCG TCTACAGCTG CGTGCTGTGC
    AATCTGCACA ACAACAACGA GAGCGGCATC ACGACGCACA TGATCGGGAT GGGGCATCGG
    TTCAAGTTCC TCGAAAAGCA CTATCCGACC GTGCGCAAGA TGCTGTCCCC GTATCGCAAC
    AACAATCAGG AGAACGGCGG CCAGGTGTTT TTCCGCGTGG TGCAAACGGT GTGCGAAGCG
    ATCGAGGACC ATCACGGCCG GCTGACGCCG CAGGTGTACG AGGCGGCCGA GTTCGAGCGT
    AACCGGGTGA AGTTTGGCCA GCAGATCTCG TTCGGCACTC ACTTCGACGA GCGCACGGGA
    CCGAAGTTTG TCGAGGTGAT CGACGCGAAG GTGCTGCAGG ATTTGGTCAC CGCCGAGGAC
    TGTGGCGGTG GGCAGGTTCA GGGTGCGCCG AAACGCTACC GACGCAACGA GCGCCGTAGC
    AGTTTGGATT CCATTTCGAG CGTCAGCAGT GCTAACTCGA TGCTAACGAT CTCGTCCAGC
    GGCGACGATG AGAATCGCGG ACGAACGCGA AAGCGAAGCC CGCTCAATGG GCAGGGCAAT
    AGACTGAGTG GTGGTGGTGG CCGTGGTGAC GTGTCCTACC AGCGGCAGCA ACGGGCAGCA
    CAGCCGGCAA ATTCGTCGGC GAAGCAGATG CTACCAACCC CCCGAGAGCT GTCGATACAG
    TCGGCCGCCA TTGCACACGA GCGCTACAAG TGGGAGAAGT ATCGCTGTAC GGTCGATATA
    GCGGTGGAAA AGCTGGAGAA GCAGCTGAAG GATCACGAAA AGAACCCGGA GAAGCACCCG
    CTGTATTCGG AGGAGTGGAA AAAGTTCTGG AACCGGCGGT ACAAGGAGCT GCAGGCAGAG
    AAGAAGGACC CAGCCAAGCA CAACTTCAAG CCGGAATGGA TCGAGTTTTG GACGAAGCGC
    ATGAAGGAGC TGCACGAAGA GGAGGTCGCG CGAAAGAAGG AAGAGATCCG CACCAAGATG
    AACCTGCCGG CGGAGGACGA AGAACGGACG GGCGAGCTGC GGGAGCAGTA CGCACTGCGT
    GTCCCCACGG CGGGCAAGCG GGCCCGTTCG AGCGAGCGCA AAGCGGGCAC GGCCGCGGCC
    CCGATCCTGA TCGATGTGAA CTCGGACGAG GAGGAGGATG ACTATAAGGG CGGACAGCCG
    ATGCGGTCCA AGAATCGGTC GTCGCCGCGC TCATCGCGCC GCGGTGAGTG GTCGGATTCG
    GGAAGGAAAC GGTCCACCTC CCGTTCCCAC CGGTCCCGCA GTCCCATCAG CGACGATAAC
    GGGTACGAGG GCGGCTACTC GCGAAGTCGC TCGCAGCGTG CCCCACCGAT GGACTACCGG
    GGCGAATGGC CCGGTGGCAA GCATCGAGGT GGACCAGTGG CGGAGCGGGT GGATTACGAC
    GAGTGGGCGA AGAATTACTA CGGCCCGAAC AAGAAGGTGT TCGTGCGTAC GGAGTTCGAT
    GCGGACAGCT CGACACCGCT CAACTTTGTC GCGGTCTGTC GGCTGTTGAC CGCGTTCGAG
    GAGTATCTGG GCAGTTTGGG GCCAAAGGTG AACGATTTGC TGGCCAAAGC GCTCGCGCTG
    GAAAAGGTGA AGGCGAACTC GGCCGACGAT TTGCTGCTGA ACGAGGACAA CTGCATGTTT
    CTGGAGACGG TGAAGGAAAA GCTGAAAGGC CACATGATGG CGGAAACGAT CGATGCAAAC
    CGAATGGTAC CGATCAAGAA GGCGGTGCGT AACATTGCCC GGCTGCTGCA CGAGGCATCG
    AAACGCGAGC CTGTTAAGCC ACCGGAGGAG GAGCTGTCCC GCGATTCGGT GGCGGTGAGC
    GAAGCGGCTG TCCCGAGTGC TGTTCCTACT GCCGCGGCTG CCACCATCAG CGCATCGGCC
    ACTGGGGCCA GCATTGACAA GATTGCCGTC GCAGAGCAGC TTGCCAAATC GCTTGTTGCG
    CAGGGAAAGA CCGACTTTAC CACCGAAGAG CTGGAACAGC TGATCAACGT GTACGTCACC
    ATGGCCCAGA TGTCTCGGGA GAAGAACTCG CTCGTTACGG CGAAAGCGTA CATGGCCACA
    CTGCCGCCCT CCGTCGTGCC GGTAGCACCG AAGGAGATTG TTCCCGCCCC GGTAGCTGCG
    CAGGCCGCTG GACCGTCGAA GCTGCCGGAA CGTGTGCGCC CGGAAGCGGC TGGCTCGGCT
    GGCGGTGCGG CACGCTGGAA CGACAAAAAG ATGGATGTTT CGGTGCGTTC CGGTAACTTC
    GGCAGCATGG CGGACAACGA TGAGGACGAG CCGTCGTCGA GCATTTTGGA GAATCTAACC
    GACTCCGATC TGCAGACGCT GCTGCAGAGC TTCAAGGAGC TTTCGAATGA TGAGCAGATG
    CATCTCATCT CCTATCTGCG CAAGCTGGAA CGCACCGAGC CGGATCGTGT CGAGCGATTG
    CGCCGTTATG TGAACTTTGA CTCATGGAGC AATCCCAAGG CAGGTGGTAA CGATGGCGGT
    ACTGGTGGAC GGCGGGATGC TGCTAGCCGC CTGTCCGATG AAGACGAGCG TAATTATTTG
    GAGCAAGACG ACGACTCCAC ATTCGATATG TTCCAACCGT CGCCAGGACC GTCGATGGGA
    GGTCGTAAAC CGGTACCGAA CCCAGCGTCT AGCAACAGTG GGCCAAAACG GCCAGCAGTG
    CCAGCGCAAC GCGCACAACA GCAGCAACAA CAGCGACAGA AAGACATGCA TCAACATCAA
    CTGCAGCAGC AGCAGCAACA GCATCAGCAT CAGCAAAAGC CTCAACAGCA AAAAGGGAAC
    CCGATGATTA TGGATTCGGA AGACGAGGAC GATTATTCGT ACGACGATAT TTTGCGCGCT
    GCATCCAAGA ACGTGTCGAA CGTACCGCTG CTTTCCGGCT CGAAAATGCC GGCCGATGCG
    TACGACACGA GCTCGAACCA TTCGCATCGA ACGAGCGGCA ACAACGATGC GGGCAGTGGT
    GGGGGCAACA GTTCTGCCGG AATTTCGCTG TCCGACACGC AGAATTTGAT TGCCAACTTG
    ATGGAATCGC TACAGAAATC CGTGTCCGAC GCCAACAGCC ATAGTACAGC GGGAGCGAGC
    AGTTCCGCCG GCAGCAATAG CTACGCGAAT CCGATCACGA CGATTGGCGG GACAGCTGGT
    AGTGGTGGTG GGGAAAGTGC TGTCGGTAAA ACAATTCCCG GCAAGATAAC ATCGGCCACG
    CCGAATGCCT CCTCTATGTA TCCGGGCTTC CCTCAGCAAC AGCAGCAGCA TGGTCAATCC
    ATGGGAATGG GCATGCCCTT CGGTGGACAG CAGGGACAGA TGCCGATGGG ACAAATGTCC
    GGGCCGCAAG GGCAGATGCA ACAGCAATTT GGCAATCCGT ACATGTACCA GGGCCAGCAG
    ATGTTCGGAG CGAATGGTCC CGGCCCGCAG CAGATGCAAC AGTTTGGCGG TTACGGTAAC
    TACGGATACT ATTAG

    The stop codons will be deleted if pcDNA3.1+/C-(K)DYK vector is selected.

    RefSeq XP_313397.5
    CDS743..4717
    Translation

    Target ORF information:

    RefSeq Version XM_313397.5
    Organism Anopheles gambiae str. PEST
    Definition Anopheles gambiae str. PEST AGAP003633-RA (AgaP_AGAP003633), mRNA.

    Target ORF information:

    Epitope DYKDDDDK
    Bacterial selection AMPR
    Mammalian selection NeoR
    Vector pcDNA3.1+/C-(K)DYK
    XM_313397.5

    ORF Insert Sequence:

    1
    61
    121
    181
    241
    301
    361
    421
    481
    541
    601
    661
    721
    781
    841
    901
    961
    1021
    1081
    1141
    1201
    1261
    1321
    1381
    1441
    1501
    1561
    1621
    1681
    1741
    1801
    1861
    1921
    1981
    2041
    2101
    2161
    2221
    2281
    2341
    2401
    2461
    2521
    2581
    2641
    2701
    2761
    2821
    2881
    2941
    3001
    3061
    3121
    3181
    3241
    3301
    3361
    3421
    3481
    3541
    3601
    3661
    3721
    3781
    3841
    3901
    3961
    ATGGATCTTG AAGAAGCAAA TGATGCAATT AAGCCCGGTG ACAATGTACC GAGATCAAAG 
    ACGGAGATAG AATTTGAGGA GCATGTACTC GATACAAAGG TTAGGCTGTC CGATGGCACA
    GAAGCGTGCG TGTTCAGCCG CACGCCGCGC AACAAAGATC TGCTGTTCGG GTACATGTGC
    CATATCTGCG GCGTGGCCTG TCTGTATGGC GAGCGCATGC TGCAAATTCA TATCGCCGGG
    CGCAAGCATC AGGCCCGATT GAACGTGACC GTGTTTGATG CAGAGCAGTA CCGTGCTTCG
    CTGGTGGCGA AACCGACGAA AGCAACTTCC GGATCGAACG AGAACAACAC TTCCACCGGA
    AACAACGGTG GTGGCGGTCA GGAAAATGAC TCTTCCTCAC CGTCCTTGAT GAAACCGCAG
    GCTCGGTTGC AAAACGTGCT TGACGGCTAT CGCGATGGCC CATTGGTCGG GCTGGAATAC
    ATCGTGGAGC TGATCGATTC AGCGAACGGT GCCGATCCCG TCTACAGCTG CGTGCTGTGC
    AATCTGCACA ACAACAACGA GAGCGGCATC ACGACGCACA TGATCGGGAT GGGGCATCGG
    TTCAAGTTCC TCGAAAAGCA CTATCCGACC GTGCGCAAGA TGCTGTCCCC GTATCGCAAC
    AACAATCAGG AGAACGGCGG CCAGGTGTTT TTCCGCGTGG TGCAAACGGT GTGCGAAGCG
    ATCGAGGACC ATCACGGCCG GCTGACGCCG CAGGTGTACG AGGCGGCCGA GTTCGAGCGT
    AACCGGGTGA AGTTTGGCCA GCAGATCTCG TTCGGCACTC ACTTCGACGA GCGCACGGGA
    CCGAAGTTTG TCGAGGTGAT CGACGCGAAG GTGCTGCAGG ATTTGGTCAC CGCCGAGGAC
    TGTGGCGGTG GGCAGGTTCA GGGTGCGCCG AAACGCTACC GACGCAACGA GCGCCGTAGC
    AGTTTGGATT CCATTTCGAG CGTCAGCAGT GCTAACTCGA TGCTAACGAT CTCGTCCAGC
    GGCGACGATG AGAATCGCGG ACGAACGCGA AAGCGAAGCC CGCTCAATGG GCAGGGCAAT
    AGACTGAGTG GTGGTGGTGG CCGTGGTGAC GTGTCCTACC AGCGGCAGCA ACGGGCAGCA
    CAGCCGGCAA ATTCGTCGGC GAAGCAGATG CTACCAACCC CCCGAGAGCT GTCGATACAG
    TCGGCCGCCA TTGCACACGA GCGCTACAAG TGGGAGAAGT ATCGCTGTAC GGTCGATATA
    GCGGTGGAAA AGCTGGAGAA GCAGCTGAAG GATCACGAAA AGAACCCGGA GAAGCACCCG
    CTGTATTCGG AGGAGTGGAA AAAGTTCTGG AACCGGCGGT ACAAGGAGCT GCAGGCAGAG
    AAGAAGGACC CAGCCAAGCA CAACTTCAAG CCGGAATGGA TCGAGTTTTG GACGAAGCGC
    ATGAAGGAGC TGCACGAAGA GGAGGTCGCG CGAAAGAAGG AAGAGATCCG CACCAAGATG
    AACCTGCCGG CGGAGGACGA AGAACGGACG GGCGAGCTGC GGGAGCAGTA CGCACTGCGT
    GTCCCCACGG CGGGCAAGCG GGCCCGTTCG AGCGAGCGCA AAGCGGGCAC GGCCGCGGCC
    CCGATCCTGA TCGATGTGAA CTCGGACGAG GAGGAGGATG ACTATAAGGG CGGACAGCCG
    ATGCGGTCCA AGAATCGGTC GTCGCCGCGC TCATCGCGCC GCGGTGAGTG GTCGGATTCG
    GGAAGGAAAC GGTCCACCTC CCGTTCCCAC CGGTCCCGCA GTCCCATCAG CGACGATAAC
    GGGTACGAGG GCGGCTACTC GCGAAGTCGC TCGCAGCGTG CCCCACCGAT GGACTACCGG
    GGCGAATGGC CCGGTGGCAA GCATCGAGGT GGACCAGTGG CGGAGCGGGT GGATTACGAC
    GAGTGGGCGA AGAATTACTA CGGCCCGAAC AAGAAGGTGT TCGTGCGTAC GGAGTTCGAT
    GCGGACAGCT CGACACCGCT CAACTTTGTC GCGGTCTGTC GGCTGTTGAC CGCGTTCGAG
    GAGTATCTGG GCAGTTTGGG GCCAAAGGTG AACGATTTGC TGGCCAAAGC GCTCGCGCTG
    GAAAAGGTGA AGGCGAACTC GGCCGACGAT TTGCTGCTGA ACGAGGACAA CTGCATGTTT
    CTGGAGACGG TGAAGGAAAA GCTGAAAGGC CACATGATGG CGGAAACGAT CGATGCAAAC
    CGAATGGTAC CGATCAAGAA GGCGGTGCGT AACATTGCCC GGCTGCTGCA CGAGGCATCG
    AAACGCGAGC CTGTTAAGCC ACCGGAGGAG GAGCTGTCCC GCGATTCGGT GGCGGTGAGC
    GAAGCGGCTG TCCCGAGTGC TGTTCCTACT GCCGCGGCTG CCACCATCAG CGCATCGGCC
    ACTGGGGCCA GCATTGACAA GATTGCCGTC GCAGAGCAGC TTGCCAAATC GCTTGTTGCG
    CAGGGAAAGA CCGACTTTAC CACCGAAGAG CTGGAACAGC TGATCAACGT GTACGTCACC
    ATGGCCCAGA TGTCTCGGGA GAAGAACTCG CTCGTTACGG CGAAAGCGTA CATGGCCACA
    CTGCCGCCCT CCGTCGTGCC GGTAGCACCG AAGGAGATTG TTCCCGCCCC GGTAGCTGCG
    CAGGCCGCTG GACCGTCGAA GCTGCCGGAA CGTGTGCGCC CGGAAGCGGC TGGCTCGGCT
    GGCGGTGCGG CACGCTGGAA CGACAAAAAG ATGGATGTTT CGGTGCGTTC CGGTAACTTC
    GGCAGCATGG CGGACAACGA TGAGGACGAG CCGTCGTCGA GCATTTTGGA GAATCTAACC
    GACTCCGATC TGCAGACGCT GCTGCAGAGC TTCAAGGAGC TTTCGAATGA TGAGCAGATG
    CATCTCATCT CCTATCTGCG CAAGCTGGAA CGCACCGAGC CGGATCGTGT CGAGCGATTG
    CGCCGTTATG TGAACTTTGA CTCATGGAGC AATCCCAAGG CAGGTGGTAA CGATGGCGGT
    ACTGGTGGAC GGCGGGATGC TGCTAGCCGC CTGTCCGATG AAGACGAGCG TAATTATTTG
    GAGCAAGACG ACGACTCCAC ATTCGATATG TTCCAACCGT CGCCAGGACC GTCGATGGGA
    GGTCGTAAAC CGGTACCGAA CCCAGCGTCT AGCAACAGTG GGCCAAAACG GCCAGCAGTG
    CCAGCGCAAC GCGCACAACA GCAGCAACAA CAGCGACAGA AAGACATGCA TCAACATCAA
    CTGCAGCAGC AGCAGCAACA GCATCAGCAT CAGCAAAAGC CTCAACAGCA AAAAGGGAAC
    CCGATGATTA TGGATTCGGA AGACGAGGAC GATTATTCGT ACGACGATAT TTTGCGCGCT
    GCATCCAAGA ACGTGTCGAA CGTACCGCTG CTTTCCGGCT CGAAAATGCC GGCCGATGCG
    TACGACACGA GCTCGAACCA TTCGCATCGA ACGAGCGGCA ACAACGATGC GGGCAGTGGT
    GGGGGCAACA GTTCTGCCGG AATTTCGCTG TCCGACACGC AGAATTTGAT TGCCAACTTG
    ATGGAATCGC TACAGAAATC CGTGTCCGAC GCCAACAGCC ATAGTACAGC GGGAGCGAGC
    AGTTCCGCCG GCAGCAATAG CTACGCGAAT CCGATCACGA CGATTGGCGG GACAGCTGGT
    AGTGGTGGTG GGGAAAGTGC TGTCGGTAAA ACAATTCCCG GCAAGATAAC ATCGGCCACG
    CCGAATGCCT CCTCTATGTA TCCGGGCTTC CCTCAGCAAC AGCAGCAGCA TGGTCAATCC
    ATGGGAATGG GCATGCCCTT CGGTGGACAG CAGGGACAGA TGCCGATGGG ACAAATGTCC
    GGGCCGCAAG GGCAGATGCA ACAGCAATTT GGCAATCCGT ACATGTACCA GGGCCAGCAG
    ATGTTCGGAG CGAATGGTCC CGGCCCGCAG CAGATGCAAC AGTTTGGCGG TTACGGTAAC
    TACGGATACT ATTAG

    The stop codons will be deleted if pcDNA3.1+/C-(K)DYK vector is selected.

  • PubMed

    The Anopheles gambiae genome: an update.
    Trends in parasitology20(2)49-52(2004 Feb)
    Mongin E,Louis C,Holt RA,Birney E,Collins FH


    The genome sequence of the malaria mosquito Anopheles gambiae.
    Science (New York, N.Y.)298(5591)129-49(2002 Oct)
    Holt RA,Subramanian GM,Halpern A,Sutton GG,Charlab R,Nusskern DR,Wincker P,Clark AG,Ribeiro JM,Wides R,Salzberg SL,Loftus B,Yandell M,Majoros WH,Rusch DB,Lai Z,Kraft CL,Abril JF,Anthouard V,Arensburger P,Atkinson PW,Baden H,de Berardinis V,Baldwin D,Benes V,Biedler J,Blass C,Bolanos R,Boscus D,Barnstead M,Cai S,Center A,Chaturverdi K,Christophides GK,Chrystal MA,Clamp M,Cravchik A,Curwen V,Dana A,Delcher A,Dew I,Evans CA,Flanigan M,Grundschober-Freimoser A,Friedli L,Gu Z,Guan P,Guigo R,Hillenmeyer ME,Hladun SL,Hogan JR,Hong YS,Hoover J,Jaillon O,Ke Z,Kodira C,Kokoza E,Koutsos A,Letunic I,Levitsky A,Liang Y,Lin JJ,Lobo NF,Lopez JR,Malek JA,McIntosh TC,Meister S,Miller J,Mobarry C,Mongin E,Murphy SD,O'Brochta DA,Pfannkoch C,Qi R,Regier MA,Remington K,Shao H,Sharakhova MV,Sitter CD,Shetty J,Smith TJ,Strong R,Sun J,Thomasova D,Ton LQ,Topalis P,Tu Z,Unger MF,Walenz B,Wang A,Wang J,Wang M,Wang X,Woodford KJ,Wortman JR,Wu M,Yao A,Zdobnov EM,Zhang H,Zhao Q,Zhao S,Zhu SC,Zhimulev I,Coluzzi M,della Torre A,Roth CW,Louis C,Kalush F,Mural RJ,Myers EW,Adams MD,Smith HO,Broder S,Gardner MJ,Fraser CM,Birney E,Bork P,Brey PT,Venter JC,Weissenbach J,Kafatos FC,Collins FH,Hoffman SL