AgaP_AGAP000585 cDNA ORF clone, Anopheles gambiae str. PEST

The following AgaP_AGAP000585 gene cDNA ORF clone sequences were retrieved from the NCBI Reference Sequence Database (RefSeq). These sequences represent the protein coding region of the AgaP_AGAP000585 cDNA ORF which is encoded by the open reading frame (ORF) sequence. ORF sequences can be delivered in our standard vector, pcDNA3.1+/C-(K)DYK or the vector of your choice as an expression/transfection-ready ORF clone. Not the clone you want? Click here to find your clone.

***CloneID Accession No. Definition **Vector *Turnaround time Price (USD) Select
OAh09399 XM_001688236.2
Latest version!
Anopheles gambiae str. PEST AGAP000585-RA (AgaP_AGAP000585), partial mRNA. pcDNA3.1-C-(k)DYK or customized vector TBD $713.30
$1019.00

ORF Online Only Promotion

Next-day Shipping ORF Clones ( in default vector with tag)
1 Clone 30% OFF
2-4 Clone 40% OFF
5 or more Clone 50% OFF
All Other ORF Clones
30% OFF

*Business Day

** You may select a custom vector to replace pcDNA3.1+/C-(K)DYK after clone is added to cart.

** GenScript guarantees 100% sequence accuracy of all synthetic DNA constructs we deliver, but we do not guarantee protein expression in your experimental system. Protein expression is influenced by many factors that may vary between experiments or laboratories. In addition, please pay attention to the signal peptide, propeptide and transit peptide in target ORF, which may affect the choice of vector (N/C terminal tag vector).

***One clone ID might be correlated to multiple accession numbers, which share the same CDS sequence.

  • Reference Sequences (Refseq)
    CloneID OAh09399
    Clone ID Related Accession (Same CDS sequence) XM_001688236.2
    Accession Version XM_001688236.2 Latest version! Documents for ORF clone product in default vector
    Sequence Information ORF Nucleotide Sequence (Length: 3891bp)
    Protein sequence
    SNP
    Vector pcDNA3.1-C-(k)DYK or customized vector User Manual
    Clone information Clone Map MSDS
    Tag on pcDNA3.1+/C-(K)DYK C terminal DYKDDDDK tags
    ORF Insert Method CloneEZ™ Seamless cloning technology
    Insert Structure linear
    Update Date 2018-04-25
    Organism Anopheles gambiae str. PEST
    Product AGAP000585-PA
    Comment Comment: PROVISIONAL REFSEQ: This record has not yet been subject to final NCBI review. This record is derived from an annotated genomic sequence (NC_004818). On Sep 28, 2011 this sequence version replaced XM_001688236.1. COMPLETENESS: incomplete on the 5' end.

    1
    61
    121
    181
    241
    301
    361
    421
    481
    541
    601
    661
    721
    781
    841
    901
    961
    1021
    1081
    1141
    1201
    1261
    1321
    1381
    1441
    1501
    1561
    1621
    1681
    1741
    1801
    1861
    1921
    1981
    2041
    2101
    2161
    2221
    2281
    2341
    2401
    2461
    2521
    2581
    2641
    2701
    2761
    2821
    2881
    2941
    3001
    3061
    3121
    3181
    3241
    3301
    3361
    3421
    3481
    3541
    3601
    3661
    3721
    3781
    3841
    ATGGGCAGTG ATCATTACTG CTTGAGGTGG AACAATCATC AATCCAACCT TCTCGGGGTG 
    TTCAGCCAAT TGCTCCAGGA CGAGAGTCTG GTGGACGTCA CACTCGCTTG CTCAGAGGGC
    GCCTCCATCC GAGCCCACAA GGTTGTACTC TCGGCCTGCT CCTCCTACTT CCAGACGCTG
    TTTCTCGACC ATCCGGCCCG GCACCCGATC GTCATACTGA AGGACGTTCG CTTTGCCGAG
    CTGCGCACGC TGATCGAATT CATGTACAAG GGCGAGGTAA ATGTCGAATA CTGTCAGCTT
    TCGGCACTGC TAAAGACGGC AAAATCGCTC AAGGTCAAAG GACTGACGGA GATGACAAAT
    CAGAGCAGCC CGCTGACGGA ACCGAAGCTA GAGCCGGAGC GACTCCGACC CCACAGCAGC
    CATGGTGCGG GCGGTGGCGT CGGCGCGACC GGCGGCAGCA ACAGTGGCAG CGGGAGCGGC
    GGCACCACCC TGGCCGCCTC GGAAGCGATC CTGCCGACCA ACCTCTCCAC GACGCTCCCG
    ACGGCAACGC TCGCGGGCGG CGGTGGCGGC CCGTATCTCA CCTCAACGAC CACCACCGCC
    ACGCCGGGCG CTGCGTCGAC GACGGTCAGC ACCACCGCCA CGACTGCTAC GACCACCGGC
    AGCAGAAGCA GCACCACCAC TACTACTACT ACTACGAGCG AACCGGACGG TGCCGACAGA
    CCGGCAGCGG CGGAGCAGCA GCAGCAACAT CACCAGCAAC ACTCGCAACA GCATCACTAC
    ACGACGCAGC AGCAACGTTT GAGTTCCTCC GGCATTGGCA GTGCCAGCGT CACCTCGGCA
    AGCTCCCTCA GCAGCAGCAG CAGCAGCAGC AGTAGCACGG CCACCGCCGG CAACGTCACC
    GATTTGTCCA CCACGCACGG GAAGGACGCG GCAATGGTGT GCGAAGCGGA CGCGTCGACA
    GCCGCCGCCC CGAGCGCCAC CACGACCACT ACGCCCCTCT CGCTGCTCAC CCGAAAGCAT
    CGCGACGCGC CGATCGACAT CGCAACCGCC ACCACAGCCT TATCCTCAGC GAGTGCCGCC
    GCCGTAACGT CGTCGGCCCG CACAGCTGCA CGGTGGTGGC GGTGGTGCCT AGCAGCAGCA
    GCAGCACCAC CACCACCCAC CGTCATCCAG CGGCCGGCGT CGGGCGATGG CATTCTCTCT
    TCCCAGCTGG CCGCGTCGGC AGTCAGCAGC GACGGCAGAC GTAGCGCCAC ACCCCTGCAC
    GACGAGCCGC CCAGCCGGGA GGAGGAAGAA CCCGAGGAGA TGGAGCAGGA GGTAAACGAT
    CGCCATCCGC AGGATGCGCG GACTACACCG ACAATCGGCG CCGCGCTAGA CGACGAACGG
    GACACTACTA CAACCGCCAC CACCACCACC ACCACGGTCA GCGATAAGGC AGACGCGCAC
    CAAAGTGCTC ACCAGGAGCG GGGAGCAAGG ATAGGGACGG CGTCGCCCCC GGCCGGCAGC
    GGGGGCGATA CGGACGAGTG CGCCGCAAGC GACGAACCCG CCGCCGCAGG CGACCCAGTA
    ACCGCGTGCT GCTCCACCTC GATCGCGGAG CGGCAGCCAT CGCCCGCACC GGCCGCCCTG
    ATCCCCGACA GCAGCCTATC GCGCAGCAGC CCGCTCGGCG AGCTGGTGGA CGAGGTGCGG
    CGGCCGGCAA GCAGTGCCGC TGCTGCGGCC GCCGCCGCCG CCGCTGCCGC CCTGATGGCA
    ACCGATCTGC GCACGGACAC AGACCATCAT CACGCGATGC TGCTGCGGGA CGTGACGATG
    GGAACCGCCG ACTACGACGA TGAGGAGGAA GAGGAGGAGG AGGACGAGGA CGAGGGGGAG
    GACGTGGACG ATGGGGCAGC GGAGATGCTG CTGGCAGCGG CCGCCGTCGC ACAGGTTGCA
    CGGGAGGCAC GCGATGAAGG GACGGACGCC ATCGTCATCG GCACTAGCGA GGAGATGCGG
    GCAGCACGGG CAGCGGCGGC AGCGGCGGCG CTCCAGCAGC AGCTGCACCA GCACCATCAG
    CAGCAGCTGG CGGCCGCGGA GCTGCTGCGG ACGGCCGAGC AGCAGCAGCG GGTGCGCCAG
    CAGCAGCAGC ACCAGTCCCG TCGCCGCCGC TCGTCCGAGG ACGGTCCGGC CCGGCCAAGT
    GGCACCACCA CAAACAACAA CAATAACAGC AGCAGCGCCA CCTACAACAA CAACTCCGAC
    GACGAGTATC AGCCACCGAT GAAGATCAAA AACGAAGTCG ACTTTTCAGC GTTCGCCGGT
    CTGAACATGA CGATCAATGC GGCGGCAAGC GCCAACGCCA ACCTGTCCGC CGCACTGAAC
    GCGGCGTCGG CCGGTCCGGG CCCGATCCTG CTCGCGACCG ATCTGCGGCC GGCATCGCGC
    GGCGGCAGCA CCACCACCAC AACCACCTCG GCCGAGGCCG CCGCCCAGAG CGGGAACAAC
    GGCAGCAAGA ACGGCGAGCA GCAGCAGCAG CGCGAGCGTG GTTCGCCCCA GCACCAGCAT
    CTGCAGCACC ATCACCATCA GCGGGCGGCG GCGGCCACCG CAGCCGCGCT GGCCGTCTTT
    AACGCGAGCG GCCTCTCCAG CCCCCTGGCC GAACCGATCG CGGGACCGTC CGGCATCATG
    GGCCCGGTCC AGCAAGTGCC TCTGTCGCTA AAGAAGGAGA TCGACTGGGG CGACGACAAG
    TCTTCGGGCG AAAGCTCACT AGACTTCCGT CACGCGCACG ACTCGGAAGC GTCGGAAGGG
    CTGCATGGAA GCGTTGGTGG TGGTGTTGGT GGTGTTGGTG TCGGCGGCAG TGGCACGACC
    GGCGGTGTGA GTGCTAGTGG CCACGGCAGT GCGAACAGTG CAGGCAGCGG CGGCAGTGGC
    AGCAACAGTG GCGGCATCCT CGGGCTCTGT ACCACCGCGC ACCACGGGAG CGGTGACGCC
    CACAGTGGAC GCAGGTCGGC GGGAGGCAGT GTGGGCAGTG GTGGCGGCGG CGGCACCAGT
    GGGAGGGAAA GCTCCGGTCC GGCCACTGCG ACGGCGGGCG GCAGTGGGTC GGCGGCCCCG
    GCAGCCTCGG GGAGAAGTAG TAGTGGTGGT GTTAGTGGTG GTGGTGGAAG TGGCGGTGGC
    GGTGGCGGCG CCGTCGCCGG CCTCCCGGTG CACACCTGCA TGTACTGTGG CGTGACGTTT
    CACAACCAGA ACAAGCTGAC CCGCCACATC CTCTCCCACT CGCTCGAGTC GCTCAAGTTC
    CGGGAGGCGA CGCACCTGCT CCATCCGCAC CTCGCCCTGC GGCACGAGCT GACGCAGGTG
    GCGCTCGCCG CCCAGCAGGC CCAGTCGACG TCGCCGTCCC AGCTGCCGTC GCCCCACTAC
    GGCCACCATC GGGGCAGCGG GGTGGGGGGC GGGCCGACGC TCGAGCAGAT GCTCGATCCG
    GCGGAAGCGA TGGCCGAGCT GGAGCTGGCG GCCGTGCTCG CGTCCCAGTC GGGCGGCCTC
    GGCCTCGGCC CGGTGGGCGG CGTCAATCCG CTCGCGGGCG GCGGGGGCGG CGGCGGCGGT
    GGTGCCGGCG GGCTCGGCCC CGACCAGGGC GGCAACAGCG TGGTGCTGTG CAAGTTCTGC
    GGCAAGAGCT TCCCGGACGT GACGTCGCTG ATCACGCACC TGCCGGTGCA CACGGGCGAC
    CGGCCGTTCA AGTGCGAGTT CTGCGGCAAG GCGTTCAAGC TGCGGCACCA CATGAAGGAC
    CACTGCCGAG TGCACACAGG CGAGCGGCCG TTTCGGTGCG GCATGTGCGG CAAGACGTTC
    TCGCGCTCGA CCATACTGAA GGCGCACGAG AAGACGCACT ATCCAAAGTA CGCGCGAAAG
    TTCCTGGCGT CCCCACCGCC GATCGACGGG AAGGACGATT CGCCCCAATA G

    The stop codons will be deleted if pcDNA3.1+/C-(K)DYK vector is selected.

    RefSeq XP_001688288.2
    CDS1..3891
    Translation

    Target ORF information:

    RefSeq Version XM_001688236.2
    Organism Anopheles gambiae str. PEST
    Definition Anopheles gambiae str. PEST AGAP000585-RA (AgaP_AGAP000585), partial mRNA.

    Target ORF information:

    Epitope DYKDDDDK
    Bacterial selection AMPR
    Mammalian selection NeoR
    Vector pcDNA3.1+/C-(K)DYK
    XM_001688236.2

    ORF Insert Sequence:

    1
    61
    121
    181
    241
    301
    361
    421
    481
    541
    601
    661
    721
    781
    841
    901
    961
    1021
    1081
    1141
    1201
    1261
    1321
    1381
    1441
    1501
    1561
    1621
    1681
    1741
    1801
    1861
    1921
    1981
    2041
    2101
    2161
    2221
    2281
    2341
    2401
    2461
    2521
    2581
    2641
    2701
    2761
    2821
    2881
    2941
    3001
    3061
    3121
    3181
    3241
    3301
    3361
    3421
    3481
    3541
    3601
    3661
    3721
    3781
    3841
    ATGGGCAGTG ATCATTACTG CTTGAGGTGG AACAATCATC AATCCAACCT TCTCGGGGTG 
    TTCAGCCAAT TGCTCCAGGA CGAGAGTCTG GTGGACGTCA CACTCGCTTG CTCAGAGGGC
    GCCTCCATCC GAGCCCACAA GGTTGTACTC TCGGCCTGCT CCTCCTACTT CCAGACGCTG
    TTTCTCGACC ATCCGGCCCG GCACCCGATC GTCATACTGA AGGACGTTCG CTTTGCCGAG
    CTGCGCACGC TGATCGAATT CATGTACAAG GGCGAGGTAA ATGTCGAATA CTGTCAGCTT
    TCGGCACTGC TAAAGACGGC AAAATCGCTC AAGGTCAAAG GACTGACGGA GATGACAAAT
    CAGAGCAGCC CGCTGACGGA ACCGAAGCTA GAGCCGGAGC GACTCCGACC CCACAGCAGC
    CATGGTGCGG GCGGTGGCGT CGGCGCGACC GGCGGCAGCA ACAGTGGCAG CGGGAGCGGC
    GGCACCACCC TGGCCGCCTC GGAAGCGATC CTGCCGACCA ACCTCTCCAC GACGCTCCCG
    ACGGCAACGC TCGCGGGCGG CGGTGGCGGC CCGTATCTCA CCTCAACGAC CACCACCGCC
    ACGCCGGGCG CTGCGTCGAC GACGGTCAGC ACCACCGCCA CGACTGCTAC GACCACCGGC
    AGCAGAAGCA GCACCACCAC TACTACTACT ACTACGAGCG AACCGGACGG TGCCGACAGA
    CCGGCAGCGG CGGAGCAGCA GCAGCAACAT CACCAGCAAC ACTCGCAACA GCATCACTAC
    ACGACGCAGC AGCAACGTTT GAGTTCCTCC GGCATTGGCA GTGCCAGCGT CACCTCGGCA
    AGCTCCCTCA GCAGCAGCAG CAGCAGCAGC AGTAGCACGG CCACCGCCGG CAACGTCACC
    GATTTGTCCA CCACGCACGG GAAGGACGCG GCAATGGTGT GCGAAGCGGA CGCGTCGACA
    GCCGCCGCCC CGAGCGCCAC CACGACCACT ACGCCCCTCT CGCTGCTCAC CCGAAAGCAT
    CGCGACGCGC CGATCGACAT CGCAACCGCC ACCACAGCCT TATCCTCAGC GAGTGCCGCC
    GCCGTAACGT CGTCGGCCCG CACAGCTGCA CGGTGGTGGC GGTGGTGCCT AGCAGCAGCA
    GCAGCACCAC CACCACCCAC CGTCATCCAG CGGCCGGCGT CGGGCGATGG CATTCTCTCT
    TCCCAGCTGG CCGCGTCGGC AGTCAGCAGC GACGGCAGAC GTAGCGCCAC ACCCCTGCAC
    GACGAGCCGC CCAGCCGGGA GGAGGAAGAA CCCGAGGAGA TGGAGCAGGA GGTAAACGAT
    CGCCATCCGC AGGATGCGCG GACTACACCG ACAATCGGCG CCGCGCTAGA CGACGAACGG
    GACACTACTA CAACCGCCAC CACCACCACC ACCACGGTCA GCGATAAGGC AGACGCGCAC
    CAAAGTGCTC ACCAGGAGCG GGGAGCAAGG ATAGGGACGG CGTCGCCCCC GGCCGGCAGC
    GGGGGCGATA CGGACGAGTG CGCCGCAAGC GACGAACCCG CCGCCGCAGG CGACCCAGTA
    ACCGCGTGCT GCTCCACCTC GATCGCGGAG CGGCAGCCAT CGCCCGCACC GGCCGCCCTG
    ATCCCCGACA GCAGCCTATC GCGCAGCAGC CCGCTCGGCG AGCTGGTGGA CGAGGTGCGG
    CGGCCGGCAA GCAGTGCCGC TGCTGCGGCC GCCGCCGCCG CCGCTGCCGC CCTGATGGCA
    ACCGATCTGC GCACGGACAC AGACCATCAT CACGCGATGC TGCTGCGGGA CGTGACGATG
    GGAACCGCCG ACTACGACGA TGAGGAGGAA GAGGAGGAGG AGGACGAGGA CGAGGGGGAG
    GACGTGGACG ATGGGGCAGC GGAGATGCTG CTGGCAGCGG CCGCCGTCGC ACAGGTTGCA
    CGGGAGGCAC GCGATGAAGG GACGGACGCC ATCGTCATCG GCACTAGCGA GGAGATGCGG
    GCAGCACGGG CAGCGGCGGC AGCGGCGGCG CTCCAGCAGC AGCTGCACCA GCACCATCAG
    CAGCAGCTGG CGGCCGCGGA GCTGCTGCGG ACGGCCGAGC AGCAGCAGCG GGTGCGCCAG
    CAGCAGCAGC ACCAGTCCCG TCGCCGCCGC TCGTCCGAGG ACGGTCCGGC CCGGCCAAGT
    GGCACCACCA CAAACAACAA CAATAACAGC AGCAGCGCCA CCTACAACAA CAACTCCGAC
    GACGAGTATC AGCCACCGAT GAAGATCAAA AACGAAGTCG ACTTTTCAGC GTTCGCCGGT
    CTGAACATGA CGATCAATGC GGCGGCAAGC GCCAACGCCA ACCTGTCCGC CGCACTGAAC
    GCGGCGTCGG CCGGTCCGGG CCCGATCCTG CTCGCGACCG ATCTGCGGCC GGCATCGCGC
    GGCGGCAGCA CCACCACCAC AACCACCTCG GCCGAGGCCG CCGCCCAGAG CGGGAACAAC
    GGCAGCAAGA ACGGCGAGCA GCAGCAGCAG CGCGAGCGTG GTTCGCCCCA GCACCAGCAT
    CTGCAGCACC ATCACCATCA GCGGGCGGCG GCGGCCACCG CAGCCGCGCT GGCCGTCTTT
    AACGCGAGCG GCCTCTCCAG CCCCCTGGCC GAACCGATCG CGGGACCGTC CGGCATCATG
    GGCCCGGTCC AGCAAGTGCC TCTGTCGCTA AAGAAGGAGA TCGACTGGGG CGACGACAAG
    TCTTCGGGCG AAAGCTCACT AGACTTCCGT CACGCGCACG ACTCGGAAGC GTCGGAAGGG
    CTGCATGGAA GCGTTGGTGG TGGTGTTGGT GGTGTTGGTG TCGGCGGCAG TGGCACGACC
    GGCGGTGTGA GTGCTAGTGG CCACGGCAGT GCGAACAGTG CAGGCAGCGG CGGCAGTGGC
    AGCAACAGTG GCGGCATCCT CGGGCTCTGT ACCACCGCGC ACCACGGGAG CGGTGACGCC
    CACAGTGGAC GCAGGTCGGC GGGAGGCAGT GTGGGCAGTG GTGGCGGCGG CGGCACCAGT
    GGGAGGGAAA GCTCCGGTCC GGCCACTGCG ACGGCGGGCG GCAGTGGGTC GGCGGCCCCG
    GCAGCCTCGG GGAGAAGTAG TAGTGGTGGT GTTAGTGGTG GTGGTGGAAG TGGCGGTGGC
    GGTGGCGGCG CCGTCGCCGG CCTCCCGGTG CACACCTGCA TGTACTGTGG CGTGACGTTT
    CACAACCAGA ACAAGCTGAC CCGCCACATC CTCTCCCACT CGCTCGAGTC GCTCAAGTTC
    CGGGAGGCGA CGCACCTGCT CCATCCGCAC CTCGCCCTGC GGCACGAGCT GACGCAGGTG
    GCGCTCGCCG CCCAGCAGGC CCAGTCGACG TCGCCGTCCC AGCTGCCGTC GCCCCACTAC
    GGCCACCATC GGGGCAGCGG GGTGGGGGGC GGGCCGACGC TCGAGCAGAT GCTCGATCCG
    GCGGAAGCGA TGGCCGAGCT GGAGCTGGCG GCCGTGCTCG CGTCCCAGTC GGGCGGCCTC
    GGCCTCGGCC CGGTGGGCGG CGTCAATCCG CTCGCGGGCG GCGGGGGCGG CGGCGGCGGT
    GGTGCCGGCG GGCTCGGCCC CGACCAGGGC GGCAACAGCG TGGTGCTGTG CAAGTTCTGC
    GGCAAGAGCT TCCCGGACGT GACGTCGCTG ATCACGCACC TGCCGGTGCA CACGGGCGAC
    CGGCCGTTCA AGTGCGAGTT CTGCGGCAAG GCGTTCAAGC TGCGGCACCA CATGAAGGAC
    CACTGCCGAG TGCACACAGG CGAGCGGCCG TTTCGGTGCG GCATGTGCGG CAAGACGTTC
    TCGCGCTCGA CCATACTGAA GGCGCACGAG AAGACGCACT ATCCAAAGTA CGCGCGAAAG
    TTCCTGGCGT CCCCACCGCC GATCGACGGG AAGGACGATT CGCCCCAATA G

    The stop codons will be deleted if pcDNA3.1+/C-(K)DYK vector is selected.

  • PubMed

    The Anopheles gambiae genome: an update.
    Trends in parasitology20(2)49-52(2004 Feb)
    Mongin E,Louis C,Holt RA,Birney E,Collins FH


    The genome sequence of the malaria mosquito Anopheles gambiae.
    Science (New York, N.Y.)298(5591)129-49(2002 Oct)
    Holt RA,Subramanian GM,Halpern A,Sutton GG,Charlab R,Nusskern DR,Wincker P,Clark AG,Ribeiro JM,Wides R,Salzberg SL,Loftus B,Yandell M,Majoros WH,Rusch DB,Lai Z,Kraft CL,Abril JF,Anthouard V,Arensburger P,Atkinson PW,Baden H,de Berardinis V,Baldwin D,Benes V,Biedler J,Blass C,Bolanos R,Boscus D,Barnstead M,Cai S,Center A,Chaturverdi K,Christophides GK,Chrystal MA,Clamp M,Cravchik A,Curwen V,Dana A,Delcher A,Dew I,Evans CA,Flanigan M,Grundschober-Freimoser A,Friedli L,Gu Z,Guan P,Guigo R,Hillenmeyer ME,Hladun SL,Hogan JR,Hong YS,Hoover J,Jaillon O,Ke Z,Kodira C,Kokoza E,Koutsos A,Letunic I,Levitsky A,Liang Y,Lin JJ,Lobo NF,Lopez JR,Malek JA,McIntosh TC,Meister S,Miller J,Mobarry C,Mongin E,Murphy SD,O'Brochta DA,Pfannkoch C,Qi R,Regier MA,Remington K,Shao H,Sharakhova MV,Sitter CD,Shetty J,Smith TJ,Strong R,Sun J,Thomasova D,Ton LQ,Topalis P,Tu Z,Unger MF,Walenz B,Wang A,Wang J,Wang M,Wang X,Woodford KJ,Wortman JR,Wu M,Yao A,Zdobnov EM,Zhang H,Zhao Q,Zhao S,Zhu SC,Zhimulev I,Coluzzi M,della Torre A,Roth CW,Louis C,Kalush F,Mural RJ,Myers EW,Adams MD,Smith HO,Broder S,Gardner MJ,Fraser CM,Birney E,Bork P,Brey PT,Venter JC,Weissenbach J,Kafatos FC,Collins FH,Hoffman SL