AgaP_AGAP005165 cDNA ORF clone, Anopheles gambiae str. PEST

The following AgaP_AGAP005165 gene cDNA ORF clone sequences were retrieved from the NCBI Reference Sequence Database (RefSeq). These sequences represent the protein coding region of the AgaP_AGAP005165 cDNA ORF which is encoded by the open reading frame (ORF) sequence. ORF sequences can be delivered in our standard vector, pcDNA3.1+/C-(K)DYK or the vector of your choice as an expression/transfection-ready ORF clone. Not the clone you want? Click here to find your clone.

***CloneID Accession No. Definition **Vector *Turnaround time Price (USD) Select
OAh03935 XM_314061.4
Latest version!
Anopheles gambiae str. PEST AGAP005165-RA (AgaP_AGAP005165), partial mRNA. pcDNA3.1-C-(k)DYK or customized vector 25 $797.30
$1139.00

ORF Online Only Promotion

Next-day Shipping ORF Clones ( in default vector with tag)
1 Clone 30% OFF
2-4 Clone 40% OFF
5 or more Clone 50% OFF
All Other ORF Clones
30% OFF

*Business Day

** You may select a custom vector to replace pcDNA3.1+/C-(K)DYK after clone is added to cart.

** GenScript guarantees 100% sequence accuracy of all synthetic DNA constructs we deliver, but we do not guarantee protein expression in your experimental system. Protein expression is influenced by many factors that may vary between experiments or laboratories. In addition, please pay attention to the signal peptide, propeptide and transit peptide in target ORF, which may affect the choice of vector (N/C terminal tag vector).

***One clone ID might be correlated to multiple accession numbers, which share the same CDS sequence.

  • Reference Sequences (Refseq)
    CloneID OAh03935
    Clone ID Related Accession (Same CDS sequence) XM_314061.4
    Accession Version XM_314061.4 Latest version! Documents for ORF clone product in default vector
    Sequence Information ORF Nucleotide Sequence (Length: 4038bp)
    Protein sequence
    SNP
    Vector pcDNA3.1-C-(k)DYK or customized vector User Manual
    Clone information Clone Map MSDS
    Tag on pcDNA3.1+/C-(K)DYK C terminal DYKDDDDK tags
    ORF Insert Method CloneEZ™ Seamless cloning technology
    Insert Structure linear
    Update Date 2018-04-25
    Organism Anopheles gambiae str. PEST
    Product AGAP005165-PA
    Comment Comment: PROVISIONAL REFSEQ: This record has not yet been subject to final NCBI review. This record is derived from an annotated genomic sequence (NT_078265). On Oct 15, 2007 this sequence version replaced XM_314061.3. COMPLETENESS: incomplete on the 3' end.

    1
    61
    121
    181
    241
    301
    361
    421
    481
    541
    601
    661
    721
    781
    841
    901
    961
    1021
    1081
    1141
    1201
    1261
    1321
    1381
    1441
    1501
    1561
    1621
    1681
    1741
    1801
    1861
    1921
    1981
    2041
    2101
    2161
    2221
    2281
    2341
    2401
    2461
    2521
    2581
    2641
    2701
    2761
    2821
    2881
    2941
    3001
    3061
    3121
    3181
    3241
    3301
    3361
    3421
    3481
    3541
    3601
    3661
    3721
    3781
    3841
    3901
    3961
    4021
    ATGTGGCTGT TTGGATTTAT GTTGATCGCC ATCGTGTGTG CCGCCAAGTG CACGGAGGCA 
    GACATTTCGG TGGAGGCAGC ATTTCAAGGA CGTTGTGGCC ACGGGAGTCC TTGTGAGCAA
    TTGTGCTACG AGCTGCACGA TGGCATGTAC GAGTGCGACT GCTCGGAGGG ATTCGAGCTG
    AACAAAAATG GCTATAGTTG CCAAGAAATC AATTCTACCT CCAGCGACGA TAACGCATCG
    AACGGTGAGG AGGATGTCCT TTATCAGCGT GGTGCATCGT TCAGCGCCAA ACTCGACATA
    TCCGATCTGC ACATATCGAT GAAATCGTTC GACGATGACG ATGGAGCCGA AGAAGCGAAA
    ACGACGTCGG GCGGTAGCGG AGGCCGGCAG GGAGCGGTCA CCAAAGCGCG ACCCCCATCG
    AGCGCGAACC CGAACAGCAT CAACGCGAAG AGCGATGAAT TAGCGCGAAG AAAGGCAAAC
    CGTAATTGGA AACTCGAACC GTCCTCGAAG CAATCGGCGA ATGCGAATGA GGGCCGCGCC
    CGCTCGTCAA GCCGCAAGGA AAAATCCATC CAAAATGATG GCCTGCCCGC GCTAGTGACG
    GCGTACGGTG ATGGCATGCG GCATGATCAG CGGCAGCAGC AGCACCAGCA GCAGCATCCG
    AAGACTTCCG CTTCCGGCAG TGACCGGCGA AAGTTTAACG CAACATCCGC TGGCAGAAAG
    GGACCGGGTA CGGGCGTTGA TGATACTGGC ACTGGCAGCA GCAATATGCC ACTGGCGAGT
    GATATGGAAA TCAGTGGCAT ATCATTACAC TCGCCCCTTT CCGGAAGACG AGGGAATGCA
    GCAGACACAG CGAGAAGGGA AATTACCTCC GGAGGCGATG GGGCGCGGTG TCGTGACTCA
    TTAAAAAACA GAATTTCCGC CCAAGTGGAA ACATTCACAA AATCAACTGA AGAAACAGAC
    AATCTAATTT GTTTTATTTG GTGTAACTTA AACGACTCCT TCTTTTTCTC TTTTAATCGC
    TTCAGCCAAC CGTGCACACT TGATTGCGGC TCTGAAGGTA CCTGTTACAT TTCGCCGGAA
    GGATCGTCGT CACTGATGCG TTGTCTGTGT GCTTTCGGCA AGACAGGGCA GCGCTGCGAA
    GAAGATGCAA AAGTGAACAC TCCAAGATTT AGTAAACATT CGTGGGTTGC ATTTCCAGCG
    CTCAGAGGAG CTTACAAACA TGTTCAGCTA CACATCGAGT TTCGACCGGA ATCGTTTGAT
    GGAATTTTAC TACTGACCGG CGAGCGAGAC GATTTGACGG GCGACTTTAT GGCTCTGCTG
    CTGCACCAAG GGTTCGTCGA GTTCTGGTTC GATTGTGGAT CTGGAATGGG TCGCGTCAAG
    TCAGAGGAAA CGATCGTCCT TAACCAGTGG AACACTATTA CCATCTATCG GCACCGGTGG
    GACGCGTGGC TCGTACTGAA CCAAGGCAAC CGGGTGCAGG GACGCTCCAA GGGCCTGTTT
    TCGCGTATCA CTTTCCGTGA GCCAGTTTTC CTCGGCGGGT ACGGCAACAT CACTGGCCTG
    GACCGGAAGC TTCCCGTCAG CACCGGCTTC ACCGGATGTA TACGAAAGTT TGTCGCTAAT
    GATCACGATT ACAACTTCCA GCAGGGCTCA CTCGGCGACG TGTCACATGG ATTCGATATC
    CAGGAATGCA TAACCGATCG GTGCAGTCGC TATCCCTGCC AACACGGCGG TAAATGTCTG
    CCGTCGGATG ACGGTGCCAT CTGTCTCTGT CCGCTGGGAT TTGGCGGTGA TTTATGCGAG
    ATGCGACTGG ATTTACAGGT GCCTTCATTC AACGGATCTT CTTATCTACG ATATGCTCCA
    CTGGGCGATA GCTGCATTAT TTGGTTCGAG CTTAAGATCA TCATCAAGCC GCTGCTAGAG
    GACGGGCTGC TGCTGTACAG CGGGCACCAC GAGTATGGTG ATTACATTTC GCTCTGCCTC
    AACATGGGCC ACGTCGAGTT CACGTACGAT CTGGGCAGCG GGCCAGCAAC CGTCAGGAGC
    GAGTTCCCGC TCTCCATGGG ACAGTGGCAT ACGATAAAAG TGTCCCGAAC GTCACGGTTG
    GCCGTGCTGA AGATCGATCA GCTGCCGGAA GTGATGACGG TGTCGCCGAA CGGGTTCTGG
    CATCTTTCCC TGCCACACAG CCTTTATCTT GGCGGCATCC ACAATGTACA CACGCTGCCA
    ACGAGCCTGC GGGATAAAGG ATCATTCGCC GGGTGCATTC AGAAGGTCGA TATCAACGAT
    CGCACTATTG CTATCATTTC GGAGGCACTC GGTGGATCGA ACGTGGAAAA CTGTCCGCAT
    GCGTGCGTTG CCCGACCGTG CGGTCCACTT GCTAAGTGCG TCCCGAATCT CGACACCTAC
    GAATGCCAAT GCAACCCGCA GAACCGCCAG TGCAACAAAG CCGAGGAGCT GCCATCGGAA
    GTAATTGAAA AGCAGCAGCG GCTGCTGAAG CGCAAACAGC AAGAACAGGC AGTCGCACCG
    CCATCCTCCG GGAACGGTTA CAATCAACCG AGGAATCGTC GCAGATGGAA CCGGTATCAA
    TCGGCTGCGG GTGAAATAAA AACTGCTCCA CCAGCAGCAT CCACGGCTAA TGAGGCTGAT
    AAACAACAGT CGCACGCGAA CGGAGCAGCA GAAACATCCT CGACGGTAGC TAGCCCGGCT
    GCATCCACCA CGGTGAAGGA TCCGGCGAAC ATTGCCCATC GGTTGCGGCG GCCGGAAGCA
    ACGCCGGAGA CGTTGCCGGC CAGCAATAAG GATACGGGCA GCGATGAACC GTCGTCATCT
    TTCGCTGACA GCGAAGACCA CCACGACGAC GACAATGACG ACAGTAATAG CGCTGACGAT
    GTAGCTGCTT ACGATGGCGA TGGGTACGGT AAAGATTACG ACCAGCATGG CGACCGGATA
    CCGTACCGTC CCGATTTTCT GCTTCACCGA AAGCAGGGAA GCTCCTGGCG CAGTCGGGGG
    AACAGACAGA ACCGGCCGGG ACAGCAGGGA AAGCAAACCG AGCCCGAGAT CGATGAGACA
    CTGATCGAGG AGATGAACCG GATTATGAAA AATCATAATG ACAACGACAA CGACGCGGAT
    GAGACACTGG ATGCCAGCTT TTGGGGTGTC GGACCATCGT CAGACGGGTA TGGTGAGCCG
    TCCCTTACGC AGCGCGAGTA TGACCTGACC GAGACGGACG CGGAGGAGAC TCACGACGAG
    GGTGCACATC ACCATCAACA TCACGACCAC CGTGCTCCGA GCCATTACCG CCGTCGGAAT
    GGGGTTGAGC AGGAGGACGA TGGCGACCAG CACGAATCAA AGGACACCGA TCACCGTCAG
    CGCCAGCAAC AGCAGCAGCA ACAGCATCAA CAGCAGCCAC AGAAACCACA GGCAGACCCG
    CGAAATGACG AGGATGACAT CCTTCGCAGC ATCAAGTACA AAAACAAATA CTTCCGCAAG
    TATCAGGGCG CGTGTTTTAC CGGCACGGAC AGCTACTTTC ACTACAGCGA CGCCGAAACG
    ATGCGGCGTG TCATTAGCTA CGAGATTGAT TTGAATCTTC GCTTTAAAAC GCACTCGGCG
    AACGGGCTAA TCCTCTGGAC CGGTCGGCAC AGTGCCCTAG AGGGTGACGA CTTTTTGTCG
    CTGGGCATCG AAAATGGATA TCTACATCTT CGATACAACC TCGGGTCGGG TGAAATTAAC
    ATCAAATATA ACTCGACCAA AGTCAGTGAC GGGCTTTGGC ATCGTGTGCG GGCGTTAAGG
    AACTCGCAGG ACGGTACGCT GAAGGTTGAC GGTGGTAAGC CAATCACCAG AAGATCTCCC
    GGCAAGCTAA GGCAGCTCAA CACGGACACT GGGCTCTATG TCGGTGGACT GCCGGCGGCG
    GCTCATTACA CCCGCCAGAG GTACCGGACC GGCATGGTGG GCTGCATCTC GGAGCTCATC
    CTGGCCGGCG AGCTGCGGTT GAACTTTGAC GCGACGATAC TGGGCACCGC ACACAACGTC
    GAGCCGGGCG CTCCGTGA

    The stop codons will be deleted if pcDNA3.1+/C-(K)DYK vector is selected.

    RefSeq XP_314061.4
    CDS231..4268
    Translation

    Target ORF information:

    RefSeq Version XM_314061.4
    Organism Anopheles gambiae str. PEST
    Definition Anopheles gambiae str. PEST AGAP005165-RA (AgaP_AGAP005165), partial mRNA.

    Target ORF information:

    Epitope DYKDDDDK
    Bacterial selection AMPR
    Mammalian selection NeoR
    Vector pcDNA3.1+/C-(K)DYK
    XM_314061.4

    ORF Insert Sequence:

    1
    61
    121
    181
    241
    301
    361
    421
    481
    541
    601
    661
    721
    781
    841
    901
    961
    1021
    1081
    1141
    1201
    1261
    1321
    1381
    1441
    1501
    1561
    1621
    1681
    1741
    1801
    1861
    1921
    1981
    2041
    2101
    2161
    2221
    2281
    2341
    2401
    2461
    2521
    2581
    2641
    2701
    2761
    2821
    2881
    2941
    3001
    3061
    3121
    3181
    3241
    3301
    3361
    3421
    3481
    3541
    3601
    3661
    3721
    3781
    3841
    3901
    3961
    4021
    ATGTGGCTGT TTGGATTTAT GTTGATCGCC ATCGTGTGTG CCGCCAAGTG CACGGAGGCA 
    GACATTTCGG TGGAGGCAGC ATTTCAAGGA CGTTGTGGCC ACGGGAGTCC TTGTGAGCAA
    TTGTGCTACG AGCTGCACGA TGGCATGTAC GAGTGCGACT GCTCGGAGGG ATTCGAGCTG
    AACAAAAATG GCTATAGTTG CCAAGAAATC AATTCTACCT CCAGCGACGA TAACGCATCG
    AACGGTGAGG AGGATGTCCT TTATCAGCGT GGTGCATCGT TCAGCGCCAA ACTCGACATA
    TCCGATCTGC ACATATCGAT GAAATCGTTC GACGATGACG ATGGAGCCGA AGAAGCGAAA
    ACGACGTCGG GCGGTAGCGG AGGCCGGCAG GGAGCGGTCA CCAAAGCGCG ACCCCCATCG
    AGCGCGAACC CGAACAGCAT CAACGCGAAG AGCGATGAAT TAGCGCGAAG AAAGGCAAAC
    CGTAATTGGA AACTCGAACC GTCCTCGAAG CAATCGGCGA ATGCGAATGA GGGCCGCGCC
    CGCTCGTCAA GCCGCAAGGA AAAATCCATC CAAAATGATG GCCTGCCCGC GCTAGTGACG
    GCGTACGGTG ATGGCATGCG GCATGATCAG CGGCAGCAGC AGCACCAGCA GCAGCATCCG
    AAGACTTCCG CTTCCGGCAG TGACCGGCGA AAGTTTAACG CAACATCCGC TGGCAGAAAG
    GGACCGGGTA CGGGCGTTGA TGATACTGGC ACTGGCAGCA GCAATATGCC ACTGGCGAGT
    GATATGGAAA TCAGTGGCAT ATCATTACAC TCGCCCCTTT CCGGAAGACG AGGGAATGCA
    GCAGACACAG CGAGAAGGGA AATTACCTCC GGAGGCGATG GGGCGCGGTG TCGTGACTCA
    TTAAAAAACA GAATTTCCGC CCAAGTGGAA ACATTCACAA AATCAACTGA AGAAACAGAC
    AATCTAATTT GTTTTATTTG GTGTAACTTA AACGACTCCT TCTTTTTCTC TTTTAATCGC
    TTCAGCCAAC CGTGCACACT TGATTGCGGC TCTGAAGGTA CCTGTTACAT TTCGCCGGAA
    GGATCGTCGT CACTGATGCG TTGTCTGTGT GCTTTCGGCA AGACAGGGCA GCGCTGCGAA
    GAAGATGCAA AAGTGAACAC TCCAAGATTT AGTAAACATT CGTGGGTTGC ATTTCCAGCG
    CTCAGAGGAG CTTACAAACA TGTTCAGCTA CACATCGAGT TTCGACCGGA ATCGTTTGAT
    GGAATTTTAC TACTGACCGG CGAGCGAGAC GATTTGACGG GCGACTTTAT GGCTCTGCTG
    CTGCACCAAG GGTTCGTCGA GTTCTGGTTC GATTGTGGAT CTGGAATGGG TCGCGTCAAG
    TCAGAGGAAA CGATCGTCCT TAACCAGTGG AACACTATTA CCATCTATCG GCACCGGTGG
    GACGCGTGGC TCGTACTGAA CCAAGGCAAC CGGGTGCAGG GACGCTCCAA GGGCCTGTTT
    TCGCGTATCA CTTTCCGTGA GCCAGTTTTC CTCGGCGGGT ACGGCAACAT CACTGGCCTG
    GACCGGAAGC TTCCCGTCAG CACCGGCTTC ACCGGATGTA TACGAAAGTT TGTCGCTAAT
    GATCACGATT ACAACTTCCA GCAGGGCTCA CTCGGCGACG TGTCACATGG ATTCGATATC
    CAGGAATGCA TAACCGATCG GTGCAGTCGC TATCCCTGCC AACACGGCGG TAAATGTCTG
    CCGTCGGATG ACGGTGCCAT CTGTCTCTGT CCGCTGGGAT TTGGCGGTGA TTTATGCGAG
    ATGCGACTGG ATTTACAGGT GCCTTCATTC AACGGATCTT CTTATCTACG ATATGCTCCA
    CTGGGCGATA GCTGCATTAT TTGGTTCGAG CTTAAGATCA TCATCAAGCC GCTGCTAGAG
    GACGGGCTGC TGCTGTACAG CGGGCACCAC GAGTATGGTG ATTACATTTC GCTCTGCCTC
    AACATGGGCC ACGTCGAGTT CACGTACGAT CTGGGCAGCG GGCCAGCAAC CGTCAGGAGC
    GAGTTCCCGC TCTCCATGGG ACAGTGGCAT ACGATAAAAG TGTCCCGAAC GTCACGGTTG
    GCCGTGCTGA AGATCGATCA GCTGCCGGAA GTGATGACGG TGTCGCCGAA CGGGTTCTGG
    CATCTTTCCC TGCCACACAG CCTTTATCTT GGCGGCATCC ACAATGTACA CACGCTGCCA
    ACGAGCCTGC GGGATAAAGG ATCATTCGCC GGGTGCATTC AGAAGGTCGA TATCAACGAT
    CGCACTATTG CTATCATTTC GGAGGCACTC GGTGGATCGA ACGTGGAAAA CTGTCCGCAT
    GCGTGCGTTG CCCGACCGTG CGGTCCACTT GCTAAGTGCG TCCCGAATCT CGACACCTAC
    GAATGCCAAT GCAACCCGCA GAACCGCCAG TGCAACAAAG CCGAGGAGCT GCCATCGGAA
    GTAATTGAAA AGCAGCAGCG GCTGCTGAAG CGCAAACAGC AAGAACAGGC AGTCGCACCG
    CCATCCTCCG GGAACGGTTA CAATCAACCG AGGAATCGTC GCAGATGGAA CCGGTATCAA
    TCGGCTGCGG GTGAAATAAA AACTGCTCCA CCAGCAGCAT CCACGGCTAA TGAGGCTGAT
    AAACAACAGT CGCACGCGAA CGGAGCAGCA GAAACATCCT CGACGGTAGC TAGCCCGGCT
    GCATCCACCA CGGTGAAGGA TCCGGCGAAC ATTGCCCATC GGTTGCGGCG GCCGGAAGCA
    ACGCCGGAGA CGTTGCCGGC CAGCAATAAG GATACGGGCA GCGATGAACC GTCGTCATCT
    TTCGCTGACA GCGAAGACCA CCACGACGAC GACAATGACG ACAGTAATAG CGCTGACGAT
    GTAGCTGCTT ACGATGGCGA TGGGTACGGT AAAGATTACG ACCAGCATGG CGACCGGATA
    CCGTACCGTC CCGATTTTCT GCTTCACCGA AAGCAGGGAA GCTCCTGGCG CAGTCGGGGG
    AACAGACAGA ACCGGCCGGG ACAGCAGGGA AAGCAAACCG AGCCCGAGAT CGATGAGACA
    CTGATCGAGG AGATGAACCG GATTATGAAA AATCATAATG ACAACGACAA CGACGCGGAT
    GAGACACTGG ATGCCAGCTT TTGGGGTGTC GGACCATCGT CAGACGGGTA TGGTGAGCCG
    TCCCTTACGC AGCGCGAGTA TGACCTGACC GAGACGGACG CGGAGGAGAC TCACGACGAG
    GGTGCACATC ACCATCAACA TCACGACCAC CGTGCTCCGA GCCATTACCG CCGTCGGAAT
    GGGGTTGAGC AGGAGGACGA TGGCGACCAG CACGAATCAA AGGACACCGA TCACCGTCAG
    CGCCAGCAAC AGCAGCAGCA ACAGCATCAA CAGCAGCCAC AGAAACCACA GGCAGACCCG
    CGAAATGACG AGGATGACAT CCTTCGCAGC ATCAAGTACA AAAACAAATA CTTCCGCAAG
    TATCAGGGCG CGTGTTTTAC CGGCACGGAC AGCTACTTTC ACTACAGCGA CGCCGAAACG
    ATGCGGCGTG TCATTAGCTA CGAGATTGAT TTGAATCTTC GCTTTAAAAC GCACTCGGCG
    AACGGGCTAA TCCTCTGGAC CGGTCGGCAC AGTGCCCTAG AGGGTGACGA CTTTTTGTCG
    CTGGGCATCG AAAATGGATA TCTACATCTT CGATACAACC TCGGGTCGGG TGAAATTAAC
    ATCAAATATA ACTCGACCAA AGTCAGTGAC GGGCTTTGGC ATCGTGTGCG GGCGTTAAGG
    AACTCGCAGG ACGGTACGCT GAAGGTTGAC GGTGGTAAGC CAATCACCAG AAGATCTCCC
    GGCAAGCTAA GGCAGCTCAA CACGGACACT GGGCTCTATG TCGGTGGACT GCCGGCGGCG
    GCTCATTACA CCCGCCAGAG GTACCGGACC GGCATGGTGG GCTGCATCTC GGAGCTCATC
    CTGGCCGGCG AGCTGCGGTT GAACTTTGAC GCGACGATAC TGGGCACCGC ACACAACGTC
    GAGCCGGGCG CTCCGTGA

    The stop codons will be deleted if pcDNA3.1+/C-(K)DYK vector is selected.

  • PubMed

    The Anopheles gambiae genome: an update.
    Trends in parasitology20(2)49-52(2004 Feb)
    Mongin E,Louis C,Holt RA,Birney E,Collins FH


    The genome sequence of the malaria mosquito Anopheles gambiae.
    Science (New York, N.Y.)298(5591)129-49(2002 Oct)
    Holt RA,Subramanian GM,Halpern A,Sutton GG,Charlab R,Nusskern DR,Wincker P,Clark AG,Ribeiro JM,Wides R,Salzberg SL,Loftus B,Yandell M,Majoros WH,Rusch DB,Lai Z,Kraft CL,Abril JF,Anthouard V,Arensburger P,Atkinson PW,Baden H,de Berardinis V,Baldwin D,Benes V,Biedler J,Blass C,Bolanos R,Boscus D,Barnstead M,Cai S,Center A,Chaturverdi K,Christophides GK,Chrystal MA,Clamp M,Cravchik A,Curwen V,Dana A,Delcher A,Dew I,Evans CA,Flanigan M,Grundschober-Freimoser A,Friedli L,Gu Z,Guan P,Guigo R,Hillenmeyer ME,Hladun SL,Hogan JR,Hong YS,Hoover J,Jaillon O,Ke Z,Kodira C,Kokoza E,Koutsos A,Letunic I,Levitsky A,Liang Y,Lin JJ,Lobo NF,Lopez JR,Malek JA,McIntosh TC,Meister S,Miller J,Mobarry C,Mongin E,Murphy SD,O'Brochta DA,Pfannkoch C,Qi R,Regier MA,Remington K,Shao H,Sharakhova MV,Sitter CD,Shetty J,Smith TJ,Strong R,Sun J,Thomasova D,Ton LQ,Topalis P,Tu Z,Unger MF,Walenz B,Wang A,Wang J,Wang M,Wang X,Woodford KJ,Wortman JR,Wu M,Yao A,Zdobnov EM,Zhang H,Zhao Q,Zhao S,Zhu SC,Zhimulev I,Coluzzi M,della Torre A,Roth CW,Louis C,Kalush F,Mural RJ,Myers EW,Adams MD,Smith HO,Broder S,Gardner MJ,Fraser CM,Birney E,Bork P,Brey PT,Venter JC,Weissenbach J,Kafatos FC,Collins FH,Hoffman SL