AgaP_AGAP007203 cDNA ORF clone, Anopheles gambiae str. PEST

The following AgaP_AGAP007203 gene cDNA ORF clone sequences were retrieved from the NCBI Reference Sequence Database (RefSeq). These sequences represent the protein coding region of the AgaP_AGAP007203 cDNA ORF which is encoded by the open reading frame (ORF) sequence. ORF sequences can be delivered in our standard vector, pcDNA3.1+/C-(K)DYK or the vector of your choice as an expression/transfection-ready ORF clone. Not the clone you want? Click here to find your clone.

***CloneID Accession No. Definition **Vector *Turnaround time Price (USD) Select
OAh00906 XM_001687956.1
Latest version!
Anopheles gambiae str. PEST AGAP007203-RB (AgaP_AGAP007203), mRNA. pcDNA3.1-C-(k)DYK or customized vector 25 $797.30
$1139.00
OAh00907 XM_308578.4
Latest version!
Anopheles gambiae str. PEST AGAP007203-RA (AgaP_AGAP007203), partial mRNA. pcDNA3.1-C-(k)DYK or customized vector 25 $881.30
$1259.00

ORF Online Only Promotion

Next-day Shipping ORF Clones ( in default vector with tag)
1 Clone 30% OFF
2-4 Clone 40% OFF
5 or more Clone 50% OFF
All Other ORF Clones
30% OFF

*Business Day

** You may select a custom vector to replace pcDNA3.1+/C-(K)DYK after clone is added to cart.

** GenScript guarantees 100% sequence accuracy of all synthetic DNA constructs we deliver, but we do not guarantee protein expression in your experimental system. Protein expression is influenced by many factors that may vary between experiments or laboratories. In addition, please pay attention to the signal peptide, propeptide and transit peptide in target ORF, which may affect the choice of vector (N/C terminal tag vector).

***One clone ID might be correlated to multiple accession numbers, which share the same CDS sequence.

  • Reference Sequences (Refseq)
    CloneID OAh00906
    Clone ID Related Accession (Same CDS sequence) XM_001687956.1
    Accession Version XM_001687956.1 Latest version! Documents for ORF clone product in default vector
    Sequence Information ORF Nucleotide Sequence (Length: 4236bp)
    Protein sequence
    SNP
    Vector pcDNA3.1-C-(k)DYK or customized vector User Manual
    Clone information Clone Map MSDS
    Tag on pcDNA3.1+/C-(K)DYK C terminal DYKDDDDK tags
    ORF Insert Method CloneEZ™ Seamless cloning technology
    Insert Structure linear
    Update Date 2018-04-25
    Organism Anopheles gambiae str. PEST
    Product AGAP007203-PB
    Comment Comment: PROVISIONAL REFSEQ: This record has not yet been subject to final NCBI review. This record is derived from an annotated genomic sequence (NT_078265).

    1
    61
    121
    181
    241
    301
    361
    421
    481
    541
    601
    661
    721
    781
    841
    901
    961
    1021
    1081
    1141
    1201
    1261
    1321
    1381
    1441
    1501
    1561
    1621
    1681
    1741
    1801
    1861
    1921
    1981
    2041
    2101
    2161
    2221
    2281
    2341
    2401
    2461
    2521
    2581
    2641
    2701
    2761
    2821
    2881
    2941
    3001
    3061
    3121
    3181
    3241
    3301
    3361
    3421
    3481
    3541
    3601
    3661
    3721
    3781
    3841
    3901
    3961
    4021
    4081
    4141
    4201
    ATGCTGAAGG AAGAGCAGCC CGTTGGTACG CCGGTACTGC GCGTGACGGC GTCCGATCCA 
    GACCCGCGGC AAACGATCGA GTACAGCTTC GTGACAACGC CCGGCGAACG GGCACGCTTT
    CGGATCGATA AGAGCACGGG CGACATAACG ACCGCTCACA TCTTCGATCG AGATGAGCCG
    ATCCGAGAGA AAGAGATCTA TATCACGGTC CGTGCGACCG ATAATGGACG TCCTCGGTTG
    GACGATGTGT GCACTTTCAA GGTGACGATT CTCGACATTA ATGACAATCC GCCCGTGTTC
    GATAAGGTAC GCTACGAAGA GTCGGTGACG AAAGACATGA AGGCGAACCT GCGAGTGGCT
    ACCATCTCGG CGACGGATAT GGACGACGGT GATAACAGTA TCATCAAGTA CGAAATCGTG
    CAGCAGAACC CAGACAGTAG CTACTTCAAG ATTAACGAGA ACAACGGTCT CCTTACGCTT
    ACGAAACCGG TCGATCGTAG CCCCGGGCAG TACTATTCTA TCCGCGTGCG AGCGTACAAT
    GTCGATCCAC AGGGTGAGGC CGTGCAGGAT GCGGAGGTCG ATGTTAAGGT ACGCGTCGTG
    GAGTCCAACA AGCTGCCCCC ATACTTCACC AAGGTGCAAG CGGCAGCTCT GGTGTTGAAC
    GAGACATTCA AAAACTACAC CGAATCGTTG GCAGAGTTCG AAGCAGAGTC CAACATTCCC
    GAGAAACCGG AAGTAATCTT CGAACTGATC CAAGGTCGTG CTGAGCAGAC GAACTCGAAA
    AATACATTCC TGCTGGAGCA GATCAACAAC ACTGCTTCGA TCAAGCTTGG AAAGACACTC
    GATTATGAAA CTGTGACGGA GTACACGCTG ACAGTGAGCG TCAAAAACTC GCACGATCTT
    GTGGCGGAAA CTGTGCTGAA GATTAAGGTA CTGGACGAGA ACGACATCAT TCCAGTGTTT
    ACGGAAGTGA CATCGGGTAC GATACCGGAA GATGAGCCAC CCGGCACTCC CGTGATGCAG
    GTACGAGCAT ACGATCTGGA CGGCACACCG GCCAACAACA TCGTCTCTTA CAGGTTTGAC
    GATGAGAACC AACAACTGTT CCACATCGAC AGTCGAACGG GAAACATCAC GTCGCGGGTG
    GAGTTCGATC GAGAAGCAAC CGATTCTTAT CATTTGAAAA TCATCGCAGA AGACAACTCT
    CCGTCCGCGC TGTACCGCAA CGGCAAACCG AACAGTATCT CTCAGCTGTT TATCATCAAG
    ATCTCGGACA AGAACGACCA CCAGCCGAAG TTCGTGAAGG ATCACTTCGT GGCGGAGAAT
    GTGCCGGAGG ATGCGAATAT CAACACCGTG GTGATTGAAG TGACGGCTCT GGATCAAGAC
    ACGGCCTCGC TCATCACGTA CAGCATCATC GAGGGCAATG TGGGTGATGC GTTCAAGATC
    GATGAAAACA CCGGTCGTAT CTCGGTAAAC AGCCGCCTCG ACTACGAGAC TATTCGTGAG
    TACATGCTGA TCGTGCAAGC GGACGATGGG ATCTTCCAGG ACAACGCAAC AGTCTCGATC
    AAGATCGAAA ACGTCAACGA TAATCCGCCG CGCTTCATCG ATCTGCGCAA TGTGACGATC
    CAGGAAGAGA CCATTCCGCC CGGGTGTATT ATGACCATCC AGGCGTATGA TCCGGACATC
    GAGAACCGAG ACGAGCCGCA GCACATTCGC TTTTCGTTTG TGAAGGAACA GGAAGATCTG
    CTCGAGATCG ATGATACCGG TTGCCTTCGG TTAAGGAAGG CACTGGATCG TGATCCACCG
    CAAGGCTTCA AGAGCTGGCA GTTCATCATC ACCGCGACGG ATGAGGATGG TGCGGGCAAG
    AAAACGCCAG CGACTGTAAA CATATTCTTG GAAGACATTA ACGACAACGC ACCACGGCTT
    TCGAATGCAA TGCCAGTCGT CTGGGGTGAG AATCGTTCTC CTGGTTTGAT CGTACGACTG
    ACGGCAGAGG ATGTGGACGA GGCGCAGAAC GGTCCACCGT TCCATTTCAG TATTGATCCG
    AATGCTCCGT ATGAGATTAA GGAACGGTTC CAAGTCCAAA ACGACGAGCT GTACGCGTTG
    GTTGAGTTCG ATCGAGAGGA GCAAAAGGAG TACCGAGTTC CAATCCGCAT TAGCGACTCT
    GGCGAAGAAC CCATGAGTGA CGTCAGTATC CTGCAGCTGG TGATTGGAGA TGACAACGAC
    AACGAGATGA GACCAGGCGA AAGTCGCATC TTCGTGTACA ACTACAAAGG AGAATCACCC
    AACACGGAAG TTGGACGCGT GTACGTGGAC GATCCGGACG ACTGGGATCT TCCAGATAAG
    ACGTTCATGT GGGATGATGC TACCCGTCAT CAGTCGGTGG ACTTTTTCGA CCTAAACAGG
    GACACTGGTA TGATCACCAT GCTTCAAGGT ACACGTGGGG GAGATTATGA GCTGAACTTC
    CACGTGATTG AGCAGTCGAG TCATTTCCCG CGCCATAATG TCACAGCCAA AGTGACGGTA
    ACGGTGAAAG AGATCCCGGA GGAAGCCGTT GATAAGAGTG GATCGATCCG TTTCCATAAT
    GTAACTGCGG AAGAGTTCGT ATCGCGCACA CCTGGTCAGC TCACAACGCC CAAAGATCGA
    CTGCAGGCGA GTATTGCAAA CACGCTCAAC GTTAGCCGAG AGAACGTGGA TGTGTTCACC
    GTTTTGAAGC GCGACAATGT CAACGGAACC TTTCTGGACG TTCGGTTCTC GGCCCATGGT
    AGTCCTTATT ACGCACCTGA ACGTCTCAAC GGAATGATGG GCTATCGGCT CCGTCAGTTG
    GAAGAGGACG TAGGCTTTTC GGTGCTTATG GTGGGCATTG ACGAGTGTAT CGAAGAAGGT
    CGCAACTGTG AGCTATCGTG CAAGAATACG CTCTACAAGT CGAACGTCCC TATTGCGGTA
    TACACCAACA CCAGCTCGTT TGTGGGTGTG AATGCCTTCG TACAGGCAGA GTGTGTGTGT
    GAGGCTCCTT CACCGTCGCT CACATGTCTG AATGGAGGTT TCTTGGTGAA CGATCGCTGC
    TCCTGTCCCG AGGGCTTTGA AGGTCCTCAT TGTGAGATGC TTGGAATCGG GTTCTATGGC
    AGTGGATATG CGCTTTATCC TCCGATCAGT CCTTGCAACA TGACGCGAAT AAGTGTAGAA
    CTATCACCGC AGCAGGAAGA TGGCTTGGTG ATGTACATTG GTCCACTCAA CTATAATCCT
    CGTCTGCCTG TGCAGGACTT CTTGGCGTTG GAGCTCGTGA AGGGACTCCC TGTGTTGCTG
    CTTGACTACG GCAGTGGTAC CATTCGTATC GAACATCGGC ATCGATTCCC CCAGGGCAAA
    CCTTTCACCG TAGAGATCGT GTTGCAACCA CAAACAATCG AGATGATCGT CGACAACTGC
    AAGCTGTCCA CGTGCATGAG TCTGGATGCG CCAAAGGGAC CAAATCGGTT CCTAAACGTA
    AACGCTCCAC TGCAACTCGG AGGAGCTGCG GTTAATCTAG ACTACCTCGG ATCGCTGTTC
    AACTGGACGT ACGTGCCACA GGACAAGGGT TTCAGTGGCT GTCTGCGCAA TCTCACCATC
    AACGAGCGCA CGTACGATCT CGGTCTGCCC AGCCTGGCAA AGAATGCCGA TCCGGGTTGC
    CAACGCTCGG TTGCGGTCGC CGTGTCGTTT GGCATCGATT CCTACTTCCT CATTGCAATC
    ATCGCGTGCA TAGTGGTGTT GCTCATTTTG CTGCTCGCCG TCGTCGTGCA CAAGAAGCAC
    CAGGACGGCT GGCATGAGAA GGATATGGAC GACATCCGGG AGACGATTAT CAACTACGAG
    GAGGAGGGTG GCGGCGAACG GGACGCCGAG TACGATCTCA CCGTGCTGCA AGGTCCGCCG
    ATCTATCTAG ACAAACCGTA CGGCAGCGAG CTGCGACAGA AGGAGGCAAA CACCGAGGTG
    CCTGATATTG GCGCGTTCCT CACCGACAAG AAGGACGCGT GCGACAAGGA TGCGGACGCG
    TATCCGATCG ACGATGTGCG GCATTACGCG TACGAGGGTG ACGGCAACAG TACGGGCTCG
    CTGTCGAGCC TCGCTTCCTG CACGGACGAG GGTGATCTGA AGTTCAACTA TCTATCGAAC
    TTCGGTCCAC GGTTCCGCAA GCTCGCCGAC ATGTATGGAG AAGAACCGTC CGACACCGAC
    TCGAACGTGG ACGATGAGGA AGGCTGGAGG ATATGA

    The stop codons will be deleted if pcDNA3.1+/C-(K)DYK vector is selected.

    RefSeq XP_001688008.1
    CDS307..4542
    Translation

    Target ORF information:

    RefSeq Version XM_001687956.1
    Organism Anopheles gambiae str. PEST
    Definition Anopheles gambiae str. PEST AGAP007203-RB (AgaP_AGAP007203), mRNA.

    Target ORF information:

    Epitope DYKDDDDK
    Bacterial selection AMPR
    Mammalian selection NeoR
    Vector pcDNA3.1+/C-(K)DYK
    XM_001687956.1

    ORF Insert Sequence:

    1
    61
    121
    181
    241
    301
    361
    421
    481
    541
    601
    661
    721
    781
    841
    901
    961
    1021
    1081
    1141
    1201
    1261
    1321
    1381
    1441
    1501
    1561
    1621
    1681
    1741
    1801
    1861
    1921
    1981
    2041
    2101
    2161
    2221
    2281
    2341
    2401
    2461
    2521
    2581
    2641
    2701
    2761
    2821
    2881
    2941
    3001
    3061
    3121
    3181
    3241
    3301
    3361
    3421
    3481
    3541
    3601
    3661
    3721
    3781
    3841
    3901
    3961
    4021
    4081
    4141
    4201
    ATGCTGAAGG AAGAGCAGCC CGTTGGTACG CCGGTACTGC GCGTGACGGC GTCCGATCCA 
    GACCCGCGGC AAACGATCGA GTACAGCTTC GTGACAACGC CCGGCGAACG GGCACGCTTT
    CGGATCGATA AGAGCACGGG CGACATAACG ACCGCTCACA TCTTCGATCG AGATGAGCCG
    ATCCGAGAGA AAGAGATCTA TATCACGGTC CGTGCGACCG ATAATGGACG TCCTCGGTTG
    GACGATGTGT GCACTTTCAA GGTGACGATT CTCGACATTA ATGACAATCC GCCCGTGTTC
    GATAAGGTAC GCTACGAAGA GTCGGTGACG AAAGACATGA AGGCGAACCT GCGAGTGGCT
    ACCATCTCGG CGACGGATAT GGACGACGGT GATAACAGTA TCATCAAGTA CGAAATCGTG
    CAGCAGAACC CAGACAGTAG CTACTTCAAG ATTAACGAGA ACAACGGTCT CCTTACGCTT
    ACGAAACCGG TCGATCGTAG CCCCGGGCAG TACTATTCTA TCCGCGTGCG AGCGTACAAT
    GTCGATCCAC AGGGTGAGGC CGTGCAGGAT GCGGAGGTCG ATGTTAAGGT ACGCGTCGTG
    GAGTCCAACA AGCTGCCCCC ATACTTCACC AAGGTGCAAG CGGCAGCTCT GGTGTTGAAC
    GAGACATTCA AAAACTACAC CGAATCGTTG GCAGAGTTCG AAGCAGAGTC CAACATTCCC
    GAGAAACCGG AAGTAATCTT CGAACTGATC CAAGGTCGTG CTGAGCAGAC GAACTCGAAA
    AATACATTCC TGCTGGAGCA GATCAACAAC ACTGCTTCGA TCAAGCTTGG AAAGACACTC
    GATTATGAAA CTGTGACGGA GTACACGCTG ACAGTGAGCG TCAAAAACTC GCACGATCTT
    GTGGCGGAAA CTGTGCTGAA GATTAAGGTA CTGGACGAGA ACGACATCAT TCCAGTGTTT
    ACGGAAGTGA CATCGGGTAC GATACCGGAA GATGAGCCAC CCGGCACTCC CGTGATGCAG
    GTACGAGCAT ACGATCTGGA CGGCACACCG GCCAACAACA TCGTCTCTTA CAGGTTTGAC
    GATGAGAACC AACAACTGTT CCACATCGAC AGTCGAACGG GAAACATCAC GTCGCGGGTG
    GAGTTCGATC GAGAAGCAAC CGATTCTTAT CATTTGAAAA TCATCGCAGA AGACAACTCT
    CCGTCCGCGC TGTACCGCAA CGGCAAACCG AACAGTATCT CTCAGCTGTT TATCATCAAG
    ATCTCGGACA AGAACGACCA CCAGCCGAAG TTCGTGAAGG ATCACTTCGT GGCGGAGAAT
    GTGCCGGAGG ATGCGAATAT CAACACCGTG GTGATTGAAG TGACGGCTCT GGATCAAGAC
    ACGGCCTCGC TCATCACGTA CAGCATCATC GAGGGCAATG TGGGTGATGC GTTCAAGATC
    GATGAAAACA CCGGTCGTAT CTCGGTAAAC AGCCGCCTCG ACTACGAGAC TATTCGTGAG
    TACATGCTGA TCGTGCAAGC GGACGATGGG ATCTTCCAGG ACAACGCAAC AGTCTCGATC
    AAGATCGAAA ACGTCAACGA TAATCCGCCG CGCTTCATCG ATCTGCGCAA TGTGACGATC
    CAGGAAGAGA CCATTCCGCC CGGGTGTATT ATGACCATCC AGGCGTATGA TCCGGACATC
    GAGAACCGAG ACGAGCCGCA GCACATTCGC TTTTCGTTTG TGAAGGAACA GGAAGATCTG
    CTCGAGATCG ATGATACCGG TTGCCTTCGG TTAAGGAAGG CACTGGATCG TGATCCACCG
    CAAGGCTTCA AGAGCTGGCA GTTCATCATC ACCGCGACGG ATGAGGATGG TGCGGGCAAG
    AAAACGCCAG CGACTGTAAA CATATTCTTG GAAGACATTA ACGACAACGC ACCACGGCTT
    TCGAATGCAA TGCCAGTCGT CTGGGGTGAG AATCGTTCTC CTGGTTTGAT CGTACGACTG
    ACGGCAGAGG ATGTGGACGA GGCGCAGAAC GGTCCACCGT TCCATTTCAG TATTGATCCG
    AATGCTCCGT ATGAGATTAA GGAACGGTTC CAAGTCCAAA ACGACGAGCT GTACGCGTTG
    GTTGAGTTCG ATCGAGAGGA GCAAAAGGAG TACCGAGTTC CAATCCGCAT TAGCGACTCT
    GGCGAAGAAC CCATGAGTGA CGTCAGTATC CTGCAGCTGG TGATTGGAGA TGACAACGAC
    AACGAGATGA GACCAGGCGA AAGTCGCATC TTCGTGTACA ACTACAAAGG AGAATCACCC
    AACACGGAAG TTGGACGCGT GTACGTGGAC GATCCGGACG ACTGGGATCT TCCAGATAAG
    ACGTTCATGT GGGATGATGC TACCCGTCAT CAGTCGGTGG ACTTTTTCGA CCTAAACAGG
    GACACTGGTA TGATCACCAT GCTTCAAGGT ACACGTGGGG GAGATTATGA GCTGAACTTC
    CACGTGATTG AGCAGTCGAG TCATTTCCCG CGCCATAATG TCACAGCCAA AGTGACGGTA
    ACGGTGAAAG AGATCCCGGA GGAAGCCGTT GATAAGAGTG GATCGATCCG TTTCCATAAT
    GTAACTGCGG AAGAGTTCGT ATCGCGCACA CCTGGTCAGC TCACAACGCC CAAAGATCGA
    CTGCAGGCGA GTATTGCAAA CACGCTCAAC GTTAGCCGAG AGAACGTGGA TGTGTTCACC
    GTTTTGAAGC GCGACAATGT CAACGGAACC TTTCTGGACG TTCGGTTCTC GGCCCATGGT
    AGTCCTTATT ACGCACCTGA ACGTCTCAAC GGAATGATGG GCTATCGGCT CCGTCAGTTG
    GAAGAGGACG TAGGCTTTTC GGTGCTTATG GTGGGCATTG ACGAGTGTAT CGAAGAAGGT
    CGCAACTGTG AGCTATCGTG CAAGAATACG CTCTACAAGT CGAACGTCCC TATTGCGGTA
    TACACCAACA CCAGCTCGTT TGTGGGTGTG AATGCCTTCG TACAGGCAGA GTGTGTGTGT
    GAGGCTCCTT CACCGTCGCT CACATGTCTG AATGGAGGTT TCTTGGTGAA CGATCGCTGC
    TCCTGTCCCG AGGGCTTTGA AGGTCCTCAT TGTGAGATGC TTGGAATCGG GTTCTATGGC
    AGTGGATATG CGCTTTATCC TCCGATCAGT CCTTGCAACA TGACGCGAAT AAGTGTAGAA
    CTATCACCGC AGCAGGAAGA TGGCTTGGTG ATGTACATTG GTCCACTCAA CTATAATCCT
    CGTCTGCCTG TGCAGGACTT CTTGGCGTTG GAGCTCGTGA AGGGACTCCC TGTGTTGCTG
    CTTGACTACG GCAGTGGTAC CATTCGTATC GAACATCGGC ATCGATTCCC CCAGGGCAAA
    CCTTTCACCG TAGAGATCGT GTTGCAACCA CAAACAATCG AGATGATCGT CGACAACTGC
    AAGCTGTCCA CGTGCATGAG TCTGGATGCG CCAAAGGGAC CAAATCGGTT CCTAAACGTA
    AACGCTCCAC TGCAACTCGG AGGAGCTGCG GTTAATCTAG ACTACCTCGG ATCGCTGTTC
    AACTGGACGT ACGTGCCACA GGACAAGGGT TTCAGTGGCT GTCTGCGCAA TCTCACCATC
    AACGAGCGCA CGTACGATCT CGGTCTGCCC AGCCTGGCAA AGAATGCCGA TCCGGGTTGC
    CAACGCTCGG TTGCGGTCGC CGTGTCGTTT GGCATCGATT CCTACTTCCT CATTGCAATC
    ATCGCGTGCA TAGTGGTGTT GCTCATTTTG CTGCTCGCCG TCGTCGTGCA CAAGAAGCAC
    CAGGACGGCT GGCATGAGAA GGATATGGAC GACATCCGGG AGACGATTAT CAACTACGAG
    GAGGAGGGTG GCGGCGAACG GGACGCCGAG TACGATCTCA CCGTGCTGCA AGGTCCGCCG
    ATCTATCTAG ACAAACCGTA CGGCAGCGAG CTGCGACAGA AGGAGGCAAA CACCGAGGTG
    CCTGATATTG GCGCGTTCCT CACCGACAAG AAGGACGCGT GCGACAAGGA TGCGGACGCG
    TATCCGATCG ACGATGTGCG GCATTACGCG TACGAGGGTG ACGGCAACAG TACGGGCTCG
    CTGTCGAGCC TCGCTTCCTG CACGGACGAG GGTGATCTGA AGTTCAACTA TCTATCGAAC
    TTCGGTCCAC GGTTCCGCAA GCTCGCCGAC ATGTATGGAG AAGAACCGTC CGACACCGAC
    TCGAACGTGG ACGATGAGGA AGGCTGGAGG ATATGA

    The stop codons will be deleted if pcDNA3.1+/C-(K)DYK vector is selected.

    CloneID OAh00907
    Clone ID Related Accession (Same CDS sequence) XM_308578.4
    Accession Version XM_308578.4 Latest version! Documents for ORF clone product in default vector
    Sequence Information ORF Nucleotide Sequence (Length: 4575bp)
    Protein sequence
    SNP
    Vector pcDNA3.1-C-(k)DYK or customized vector User Manual
    Clone information Clone Map MSDS
    Tag on pcDNA3.1+/C-(K)DYK C terminal DYKDDDDK tags
    ORF Insert Method CloneEZ™ Seamless cloning technology
    Insert Structure linear
    Update Date 2018-04-25
    Organism Anopheles gambiae str. PEST
    Product AGAP007203-PA
    Comment Comment: PROVISIONAL REFSEQ: This record has not yet been subject to final NCBI review. This record is derived from an annotated genomic sequence (NT_078265). On Oct 15, 2007 this sequence version replaced XM_308578.3. COMPLETENESS: incomplete on the 5' end.

    1
    61
    121
    181
    241
    301
    361
    421
    481
    541
    601
    661
    721
    781
    841
    901
    961
    1021
    1081
    1141
    1201
    1261
    1321
    1381
    1441
    1501
    1561
    1621
    1681
    1741
    1801
    1861
    1921
    1981
    2041
    2101
    2161
    2221
    2281
    2341
    2401
    2461
    2521
    2581
    2641
    2701
    2761
    2821
    2881
    2941
    3001
    3061
    3121
    3181
    3241
    3301
    3361
    3421
    3481
    3541
    3601
    3661
    3721
    3781
    3841
    3901
    3961
    4021
    4081
    4141
    4201
    4261
    4321
    4381
    4441
    4501
    4561
    ATGTCCACGG TAGCAAGTGT GGCAAGCAGT AGCAGGGGCA GCACATCACA GCAAGCGAAC 
    AGTGCCGCTG AGCTGCATCG ATTAGGTGGT GTGAGTCAGC TGTATGACTA CCGGCGGGAA
    CTATTATTGG CCGGTGGAGA GCGAAGGAAA CGTAGTGCCT TCAGCACACT GTACGACACG
    GGCAATACAC TCAGCGTGAA CGAATATGGC GGTGGGCCGG TCAGCAGCGG TAACGTCGGT
    AGTGGCAACG GTGGCTCTTC CAGAGAGTCC TACGTTGGGC CGGAGCTTTT GCGTGACAAT
    CGCAAACCCA GCTTCGTCGA GTGTGACAGC TACAAGCCGA TGCTGAAGGA AGAGCAGCCC
    GTTGGTACGC CGGTACTGCG CGTGACGGCG TCCGATCCAG ACCCGCGGCA AACGATCGAG
    TACAGCTTCG TGACAACGCC CGGCGAACGG GCACGCTTTC GGATCGATAA GAGCACGGGC
    GACATAACGA CCGCTCACAT CTTCGATCGA GATGAGCCGA TCCGAGAGAA AGAGATCTAT
    ATCACGGTCC GTGCGACCGA TAATGGACGT CCTCGGTTGG ACGATGTGTG CACTTTCAAG
    GTGACGATTC TCGACATTAA TGACAATCCG CCCGTGTTCG ATAAGGTACG CTACGAAGAG
    TCGGTGACGA AAGACATGAA GGCGAACCTG CGAGTGGCTA CCATCTCGGC GACGGATATG
    GACGACGGTG ATAACAGTAT CATCAAGTAC GAAATCGTGC AGCAGAACCC AGACAGTAGC
    TACTTCAAGA TTAACGAGAA CAACGGTCTC CTTACGCTTA CGAAACCGGT CGATCGTAGC
    CCCGGGCAGT ACTATTCTAT CCGCGTGCGA GCGTACAATG TCGATCCACA GGGTGAGGCC
    GTGCAGGATG CGGAGGTCGA TGTTAAGGTA CGCGTCGTGG AGTCCAACAA GCTGCCCCCA
    TACTTCACCA AGGTGCAAGC GGCAGCTCTG GTGTTGAACG AGACATTCAA AAACTACACC
    GAATCGTTGG CAGAGTTCGA AGCAGAGTCC AACATTCCCG AGAAACCGGA AGTAATCTTC
    GAACTGATCC AAGGTCGTGC TGAGCAGACG AACTCGAAAA ATACATTCCT GCTGGAGCAG
    ATCAACAACA CTGCTTCGAT CAAGCTTGGA AAGACACTCG ATTATGAAAC TGTGACGGAG
    TACACGCTGA CAGTGAGCGT CAAAAACTCG CACGATCTTG TGGCGGAAAC TGTGCTGAAG
    ATTAAGGTAC TGGACGAGAA CGACATCATT CCAGTGTTTA CGGAAGTGAC ATCGGGTACG
    ATACCGGAAG ATGAGCCACC CGGCACTCCC GTGATGCAGG TACGAGCATA CGATCTGGAC
    GGCACACCGG CCAACAACAT CGTCTCTTAC AGGTTTGACG ATGAGAACCA ACAACTGTTC
    CACATCGACA GTCGAACGGG AAACATCACG TCGCGGGTGG AGTTCGATCG AGAAGCAACC
    GATTCTTATC ATTTGAAAAT CATCGCAGAA GACAACTCTC CGTCCGCGCT GTACCGCAAC
    GGCAAACCGA ACAGTATCTC TCAGCTGTTT ATCATCAAGA TCTCGGACAA GAACGACCAC
    CAGCCGAAGT TCGTGAAGGA TCACTTCGTG GCGGAGAATG TGCCGGAGGA TGCGAATATC
    AACACCGTGG TGATTGAAGT GACGGCTCTG GATCAAGACA CGGCCTCGCT CATCACGTAC
    AGCATCATCG AGGGCAATGT GGGTGATGCG TTCAAGATCG ATGAAAACAC CGGTCGTATC
    TCGGTAAACA GCCGCCTCGA CTACGAGACT ATTCGTGAGT ACATGCTGAT CGTGCAAGCG
    GACGATGGGA TCTTCCAGGA CAACGCAACA GTCTCGATCA AGATCGAAAA CGTCAACGAT
    AATCCGCCGC GCTTCATCGA TCTGCGCAAT GTGACGATCC AGGAAGAGAC CATTCCGCCC
    GGGTGTATTA TGACCATCCA GGCGTATGAT CCGGACATCG AGAACCGAGA CGAGCCGCAG
    CACATTCGCT TTTCGTTTGT GAAGGAACAG GAAGATCTGC TCGAGATCGA TGATACCGGT
    TGCCTTCGGT TAAGGAAGGC ACTGGATCGT GATCCACCGC AAGGCTTCAA GAGCTGGCAG
    TTCATCATCA CCGCGACGGA TGAGGATGGT GCGGGCAAGA AAACGCCAGC GACTGTAAAC
    ATATTCTTGG AAGACATTAA CGACAACGCA CCACGGCTTT CGAATGCAAT GCCAGTCGTC
    TGGGGTGAGA ATCGTTCTCC TGGTTTGATC GTACGACTGA CGGCAGAGGA TGTGGACGAG
    GCGCAGAACG GTCCACCGTT CCATTTCAGT ATTGATCCGA ATGCTCCGTA TGAGATTAAG
    GAACGGTTCC AAGTCCAAAA CGACGAGCTG TACGCGTTGG TTGAGTTCGA TCGAGAGGAG
    CAAAAGGAGT ACCGAGTTCC AATCCGCATT AGCGACTCTG GCGAAGAACC CATGAGTGAC
    GTCAGTATCC TGCAGCTGGT GATTGGAGAT GACAACGACA ACGAGATGAG ACCAGGCGAA
    AGTCGCATCT TCGTGTACAA CTACAAAGGA GAATCACCCA ACACGGAAGT TGGACGCGTG
    TACGTGGACG ATCCGGACGA CTGGGATCTT CCAGATAAGA CGTTCATGTG GGATGATGCT
    ACCCGTCATC AGTCGGTGGA CTTTTTCGAC CTAAACAGGG ACACTGGTAT GATCACCATG
    CTTCAAGGTA CACGTGGGGG AGATTATGAG CTGAACTTCC ACGTGATTGA GCAGTCGAGT
    CATTTCCCGC GCCATAATGT CACAGCCAAA GTGACGGTAA CGGTGAAAGA GATCCCGGAG
    GAAGCCGTTG ATAAGAGTGG ATCGATCCGT TTCCATAATG TAACTGCGGA AGAGTTCGTA
    TCGCGCACAC CTGGTCAGCT CACAACGCCC AAAGATCGAC TGCAGGCGAG TATTGCAAAC
    ACGCTCAACG TTAGCCGAGA GAACGTGGAT GTGTTCACCG TTTTGAAGCG CGACAATGTC
    AACGGAACCT TTCTGGACGT TCGGTTCTCG GCCCATGGTA GTCCTTATTA CGCACCTGAA
    CGTCTCAACG GAATGATGGG CTATCGGCTC CGTCAGTTGG AAGAGGACGT AGGCTTTTCG
    GTGCTTATGG TGGGCATTGA CGAGTGTATC GAAGAAGGTC GCAACTGTGA GCTATCGTGC
    AAGAATACGC TCTACAAGTC GAACGTCCCT ATTGCGGTAT ACACCAACAC CAGCTCGTTT
    GTGGGTGTGA ATGCCTTCGT ACAGGCAGAG TGTGTGTGTG AGGCTCCTTC ACCGTCGCTC
    ACATGTCTGA ATGGAGGTTT CTTGGTGAAC GATCGCTGCT CCTGTCCCGA GGGCTTTGAA
    GGTCCTCATT GTGAGATGCT TGGAATCGGG TTCTATGGCA GTGGATATGC GCTTTATCCT
    CCGATCAGTC CTTGCAACAT GACGCGAATA AGTGTAGAAC TATCACCGCA GCAGGAAGAT
    GGCTTGGTGA TGTACATTGG TCCACTCAAC TATAATCCTC GTCTGCCTGT GCAGGACTTC
    TTGGCGTTGG AGCTCGTGAA GGGACTCCCT GTGTTGCTGC TTGACTACGG CAGTGGTACC
    ATTCGTATCG AACATCGGCA TCGATTCCCC CAGGGCAAAC CTTTCACCGT AGAGATCGTG
    TTGCAACCAC AAACAATCGA GATGATCGTC GACAACTGCA AGCTGTCCAC GTGCATGAGT
    CTGGATGCGC CAAAGGGACC AAATCGGTTC CTAAACGTAA ACGCTCCACT GCAACTCGGA
    GGAGCTGCGG TTAATCTAGA CTACCTCGGA TCGCTGTTCA ACTGGACGTA CGTGCCACAG
    GACAAGGGTT TCAGTGGCTG TCTGCGCAAT CTCACCATCA ACGAGCGCAC GTACGATCTC
    GGTCTGCCCA GCCTGGCAAA GAATGCCGAT CCGGGTTGCC AACGCTCGGT TGCGGTCGCC
    GTGTCGTTTG GCATCGATTC CTACTTCCTC ATTGCAATCA TCGCGTGCAT AGTGGTGTTG
    CTCATTTTGC TGCTCGCCGT CGTCGTGCAC AAGAAGCACC AGGACGGCTG GCATGAGAAG
    GATATGGACG ACATCCGGGA GACGATTATC AACTACGAGG AGGAGGGTGG CGGCGAACGG
    GACGCCGAGT ACGATCTCAC CGTGCTGCAA GGTCCGCCGA TCTATCTAGA CAAACCGTAC
    GGCAGCGAGC TGCGACAGAA GGAGGCAAAC ACCGAGGTGC CTGATATTGG CGCGTTCCTC
    ACCGACAAGA AGGACGCGTG CGACAAGGAT GCGGACGCGT ATCCGATCGA CGATGTGCGG
    CATTACGCGT ACGAGGGTGA CGGCAACAGT ACGGGCTCGC TGTCGAGCCT CGCTTCCTGC
    ACGGACGAGG GTGATCTGAA GTTCAACTAT CTATCGAACT TCGGTCCACG GTTCCGCAAG
    CTCGCCGACA TGTATGGAGA AGAACCGTCC GACACCGACT CGAACGTGGA CGATGAGGAA
    GGCTGGAGGA TATGA

    The stop codons will be deleted if pcDNA3.1+/C-(K)DYK vector is selected.

    RefSeq XP_308578.4
    CDS1..4575
    Translation

    Target ORF information:

    RefSeq Version XM_308578.4
    Organism Anopheles gambiae str. PEST
    Definition Anopheles gambiae str. PEST AGAP007203-RA (AgaP_AGAP007203), partial mRNA.

    Target ORF information:

    Epitope DYKDDDDK
    Bacterial selection AMPR
    Mammalian selection NeoR
    Vector pcDNA3.1+/C-(K)DYK
    XM_308578.4

    ORF Insert Sequence:

    1
    61
    121
    181
    241
    301
    361
    421
    481
    541
    601
    661
    721
    781
    841
    901
    961
    1021
    1081
    1141
    1201
    1261
    1321
    1381
    1441
    1501
    1561
    1621
    1681
    1741
    1801
    1861
    1921
    1981
    2041
    2101
    2161
    2221
    2281
    2341
    2401
    2461
    2521
    2581
    2641
    2701
    2761
    2821
    2881
    2941
    3001
    3061
    3121
    3181
    3241
    3301
    3361
    3421
    3481
    3541
    3601
    3661
    3721
    3781
    3841
    3901
    3961
    4021
    4081
    4141
    4201
    4261
    4321
    4381
    4441
    4501
    4561
    ATGTCCACGG TAGCAAGTGT GGCAAGCAGT AGCAGGGGCA GCACATCACA GCAAGCGAAC 
    AGTGCCGCTG AGCTGCATCG ATTAGGTGGT GTGAGTCAGC TGTATGACTA CCGGCGGGAA
    CTATTATTGG CCGGTGGAGA GCGAAGGAAA CGTAGTGCCT TCAGCACACT GTACGACACG
    GGCAATACAC TCAGCGTGAA CGAATATGGC GGTGGGCCGG TCAGCAGCGG TAACGTCGGT
    AGTGGCAACG GTGGCTCTTC CAGAGAGTCC TACGTTGGGC CGGAGCTTTT GCGTGACAAT
    CGCAAACCCA GCTTCGTCGA GTGTGACAGC TACAAGCCGA TGCTGAAGGA AGAGCAGCCC
    GTTGGTACGC CGGTACTGCG CGTGACGGCG TCCGATCCAG ACCCGCGGCA AACGATCGAG
    TACAGCTTCG TGACAACGCC CGGCGAACGG GCACGCTTTC GGATCGATAA GAGCACGGGC
    GACATAACGA CCGCTCACAT CTTCGATCGA GATGAGCCGA TCCGAGAGAA AGAGATCTAT
    ATCACGGTCC GTGCGACCGA TAATGGACGT CCTCGGTTGG ACGATGTGTG CACTTTCAAG
    GTGACGATTC TCGACATTAA TGACAATCCG CCCGTGTTCG ATAAGGTACG CTACGAAGAG
    TCGGTGACGA AAGACATGAA GGCGAACCTG CGAGTGGCTA CCATCTCGGC GACGGATATG
    GACGACGGTG ATAACAGTAT CATCAAGTAC GAAATCGTGC AGCAGAACCC AGACAGTAGC
    TACTTCAAGA TTAACGAGAA CAACGGTCTC CTTACGCTTA CGAAACCGGT CGATCGTAGC
    CCCGGGCAGT ACTATTCTAT CCGCGTGCGA GCGTACAATG TCGATCCACA GGGTGAGGCC
    GTGCAGGATG CGGAGGTCGA TGTTAAGGTA CGCGTCGTGG AGTCCAACAA GCTGCCCCCA
    TACTTCACCA AGGTGCAAGC GGCAGCTCTG GTGTTGAACG AGACATTCAA AAACTACACC
    GAATCGTTGG CAGAGTTCGA AGCAGAGTCC AACATTCCCG AGAAACCGGA AGTAATCTTC
    GAACTGATCC AAGGTCGTGC TGAGCAGACG AACTCGAAAA ATACATTCCT GCTGGAGCAG
    ATCAACAACA CTGCTTCGAT CAAGCTTGGA AAGACACTCG ATTATGAAAC TGTGACGGAG
    TACACGCTGA CAGTGAGCGT CAAAAACTCG CACGATCTTG TGGCGGAAAC TGTGCTGAAG
    ATTAAGGTAC TGGACGAGAA CGACATCATT CCAGTGTTTA CGGAAGTGAC ATCGGGTACG
    ATACCGGAAG ATGAGCCACC CGGCACTCCC GTGATGCAGG TACGAGCATA CGATCTGGAC
    GGCACACCGG CCAACAACAT CGTCTCTTAC AGGTTTGACG ATGAGAACCA ACAACTGTTC
    CACATCGACA GTCGAACGGG AAACATCACG TCGCGGGTGG AGTTCGATCG AGAAGCAACC
    GATTCTTATC ATTTGAAAAT CATCGCAGAA GACAACTCTC CGTCCGCGCT GTACCGCAAC
    GGCAAACCGA ACAGTATCTC TCAGCTGTTT ATCATCAAGA TCTCGGACAA GAACGACCAC
    CAGCCGAAGT TCGTGAAGGA TCACTTCGTG GCGGAGAATG TGCCGGAGGA TGCGAATATC
    AACACCGTGG TGATTGAAGT GACGGCTCTG GATCAAGACA CGGCCTCGCT CATCACGTAC
    AGCATCATCG AGGGCAATGT GGGTGATGCG TTCAAGATCG ATGAAAACAC CGGTCGTATC
    TCGGTAAACA GCCGCCTCGA CTACGAGACT ATTCGTGAGT ACATGCTGAT CGTGCAAGCG
    GACGATGGGA TCTTCCAGGA CAACGCAACA GTCTCGATCA AGATCGAAAA CGTCAACGAT
    AATCCGCCGC GCTTCATCGA TCTGCGCAAT GTGACGATCC AGGAAGAGAC CATTCCGCCC
    GGGTGTATTA TGACCATCCA GGCGTATGAT CCGGACATCG AGAACCGAGA CGAGCCGCAG
    CACATTCGCT TTTCGTTTGT GAAGGAACAG GAAGATCTGC TCGAGATCGA TGATACCGGT
    TGCCTTCGGT TAAGGAAGGC ACTGGATCGT GATCCACCGC AAGGCTTCAA GAGCTGGCAG
    TTCATCATCA CCGCGACGGA TGAGGATGGT GCGGGCAAGA AAACGCCAGC GACTGTAAAC
    ATATTCTTGG AAGACATTAA CGACAACGCA CCACGGCTTT CGAATGCAAT GCCAGTCGTC
    TGGGGTGAGA ATCGTTCTCC TGGTTTGATC GTACGACTGA CGGCAGAGGA TGTGGACGAG
    GCGCAGAACG GTCCACCGTT CCATTTCAGT ATTGATCCGA ATGCTCCGTA TGAGATTAAG
    GAACGGTTCC AAGTCCAAAA CGACGAGCTG TACGCGTTGG TTGAGTTCGA TCGAGAGGAG
    CAAAAGGAGT ACCGAGTTCC AATCCGCATT AGCGACTCTG GCGAAGAACC CATGAGTGAC
    GTCAGTATCC TGCAGCTGGT GATTGGAGAT GACAACGACA ACGAGATGAG ACCAGGCGAA
    AGTCGCATCT TCGTGTACAA CTACAAAGGA GAATCACCCA ACACGGAAGT TGGACGCGTG
    TACGTGGACG ATCCGGACGA CTGGGATCTT CCAGATAAGA CGTTCATGTG GGATGATGCT
    ACCCGTCATC AGTCGGTGGA CTTTTTCGAC CTAAACAGGG ACACTGGTAT GATCACCATG
    CTTCAAGGTA CACGTGGGGG AGATTATGAG CTGAACTTCC ACGTGATTGA GCAGTCGAGT
    CATTTCCCGC GCCATAATGT CACAGCCAAA GTGACGGTAA CGGTGAAAGA GATCCCGGAG
    GAAGCCGTTG ATAAGAGTGG ATCGATCCGT TTCCATAATG TAACTGCGGA AGAGTTCGTA
    TCGCGCACAC CTGGTCAGCT CACAACGCCC AAAGATCGAC TGCAGGCGAG TATTGCAAAC
    ACGCTCAACG TTAGCCGAGA GAACGTGGAT GTGTTCACCG TTTTGAAGCG CGACAATGTC
    AACGGAACCT TTCTGGACGT TCGGTTCTCG GCCCATGGTA GTCCTTATTA CGCACCTGAA
    CGTCTCAACG GAATGATGGG CTATCGGCTC CGTCAGTTGG AAGAGGACGT AGGCTTTTCG
    GTGCTTATGG TGGGCATTGA CGAGTGTATC GAAGAAGGTC GCAACTGTGA GCTATCGTGC
    AAGAATACGC TCTACAAGTC GAACGTCCCT ATTGCGGTAT ACACCAACAC CAGCTCGTTT
    GTGGGTGTGA ATGCCTTCGT ACAGGCAGAG TGTGTGTGTG AGGCTCCTTC ACCGTCGCTC
    ACATGTCTGA ATGGAGGTTT CTTGGTGAAC GATCGCTGCT CCTGTCCCGA GGGCTTTGAA
    GGTCCTCATT GTGAGATGCT TGGAATCGGG TTCTATGGCA GTGGATATGC GCTTTATCCT
    CCGATCAGTC CTTGCAACAT GACGCGAATA AGTGTAGAAC TATCACCGCA GCAGGAAGAT
    GGCTTGGTGA TGTACATTGG TCCACTCAAC TATAATCCTC GTCTGCCTGT GCAGGACTTC
    TTGGCGTTGG AGCTCGTGAA GGGACTCCCT GTGTTGCTGC TTGACTACGG CAGTGGTACC
    ATTCGTATCG AACATCGGCA TCGATTCCCC CAGGGCAAAC CTTTCACCGT AGAGATCGTG
    TTGCAACCAC AAACAATCGA GATGATCGTC GACAACTGCA AGCTGTCCAC GTGCATGAGT
    CTGGATGCGC CAAAGGGACC AAATCGGTTC CTAAACGTAA ACGCTCCACT GCAACTCGGA
    GGAGCTGCGG TTAATCTAGA CTACCTCGGA TCGCTGTTCA ACTGGACGTA CGTGCCACAG
    GACAAGGGTT TCAGTGGCTG TCTGCGCAAT CTCACCATCA ACGAGCGCAC GTACGATCTC
    GGTCTGCCCA GCCTGGCAAA GAATGCCGAT CCGGGTTGCC AACGCTCGGT TGCGGTCGCC
    GTGTCGTTTG GCATCGATTC CTACTTCCTC ATTGCAATCA TCGCGTGCAT AGTGGTGTTG
    CTCATTTTGC TGCTCGCCGT CGTCGTGCAC AAGAAGCACC AGGACGGCTG GCATGAGAAG
    GATATGGACG ACATCCGGGA GACGATTATC AACTACGAGG AGGAGGGTGG CGGCGAACGG
    GACGCCGAGT ACGATCTCAC CGTGCTGCAA GGTCCGCCGA TCTATCTAGA CAAACCGTAC
    GGCAGCGAGC TGCGACAGAA GGAGGCAAAC ACCGAGGTGC CTGATATTGG CGCGTTCCTC
    ACCGACAAGA AGGACGCGTG CGACAAGGAT GCGGACGCGT ATCCGATCGA CGATGTGCGG
    CATTACGCGT ACGAGGGTGA CGGCAACAGT ACGGGCTCGC TGTCGAGCCT CGCTTCCTGC
    ACGGACGAGG GTGATCTGAA GTTCAACTAT CTATCGAACT TCGGTCCACG GTTCCGCAAG
    CTCGCCGACA TGTATGGAGA AGAACCGTCC GACACCGACT CGAACGTGGA CGATGAGGAA
    GGCTGGAGGA TATGA

    The stop codons will be deleted if pcDNA3.1+/C-(K)DYK vector is selected.

  • PubMed

    The Anopheles gambiae genome: an update.
    Trends in parasitology20(2)49-52(2004 Feb)
    Mongin E,Louis C,Holt RA,Birney E,Collins FH


    The genome sequence of the malaria mosquito Anopheles gambiae.
    Science (New York, N.Y.)298(5591)129-49(2002 Oct)
    Holt RA,Subramanian GM,Halpern A,Sutton GG,Charlab R,Nusskern DR,Wincker P,Clark AG,Ribeiro JM,Wides R,Salzberg SL,Loftus B,Yandell M,Majoros WH,Rusch DB,Lai Z,Kraft CL,Abril JF,Anthouard V,Arensburger P,Atkinson PW,Baden H,de Berardinis V,Baldwin D,Benes V,Biedler J,Blass C,Bolanos R,Boscus D,Barnstead M,Cai S,Center A,Chaturverdi K,Christophides GK,Chrystal MA,Clamp M,Cravchik A,Curwen V,Dana A,Delcher A,Dew I,Evans CA,Flanigan M,Grundschober-Freimoser A,Friedli L,Gu Z,Guan P,Guigo R,Hillenmeyer ME,Hladun SL,Hogan JR,Hong YS,Hoover J,Jaillon O,Ke Z,Kodira C,Kokoza E,Koutsos A,Letunic I,Levitsky A,Liang Y,Lin JJ,Lobo NF,Lopez JR,Malek JA,McIntosh TC,Meister S,Miller J,Mobarry C,Mongin E,Murphy SD,O'Brochta DA,Pfannkoch C,Qi R,Regier MA,Remington K,Shao H,Sharakhova MV,Sitter CD,Shetty J,Smith TJ,Strong R,Sun J,Thomasova D,Ton LQ,Topalis P,Tu Z,Unger MF,Walenz B,Wang A,Wang J,Wang M,Wang X,Woodford KJ,Wortman JR,Wu M,Yao A,Zdobnov EM,Zhang H,Zhao Q,Zhao S,Zhu SC,Zhimulev I,Coluzzi M,della Torre A,Roth CW,Louis C,Kalush F,Mural RJ,Myers EW,Adams MD,Smith HO,Broder S,Gardner MJ,Fraser CM,Birney E,Bork P,Brey PT,Venter JC,Weissenbach J,Kafatos FC,Collins FH,Hoffman SL