VPS15 cDNA ORF clone, Arabidopsis thaliana(thale cress)

  • Gene

  • Clones

  • gRNAs

The following VPS15 gene cDNA ORF clone sequences were retrieved from the NCBI Reference Sequence Database (RefSeq). These sequences represent the protein coding region of the VPS15 cDNA ORF which is encoded by the open reading frame (ORF) sequence. ORF sequences can be delivered in our standard vector, pcDNA3.1+/C-(K)DYK or the vector of your choice as an expression/transfection-ready ORF clone. Not the clone you want? Click here to find your clone.

***CloneID Accession No. Definition **Vector *Turnaround time Price (USD) Select
OAb39856 NM_001341967.1
Latest version!
Arabidopsis thaliana protein kinase family protein / WD-40 repeat family protein (VPS15), mRNA. pcDNA3.1-C-(k)DYK or customized vector 19-21 $713.30
$1019.00
OAb16655 NM_119083.3
Latest version!
Arabidopsis thaliana protein kinase family protein / WD-40 repeat family protein (VPS15), mRNA. pcDNA3.1-C-(k)DYK or customized vector 25 $797.30
$1139.00

ORF Online Only Promotion

Next-day Shipping ORF Clones ( in default vector with tag)
1 Clone 30% OFF
2-4 Clone 40% OFF
5 or more Clone 50% OFF
All Other ORF Clones
30% OFF

*Business Day

** You may select a custom vector to replace pcDNA3.1+/C-(K)DYK after clone is added to cart.

** GenScript guarantees 100% sequence accuracy of all synthetic DNA constructs we deliver, but we do not guarantee protein expression in your experimental system. Protein expression is influenced by many factors that may vary between experiments or laboratories. In addition, please pay attention to the signal peptide, propeptide and transit peptide in target ORF, which may affect the choice of vector (N/C terminal tag vector).

***One clone ID might be correlated to multiple accession numbers, which share the same CDS sequence.

  • Reference Sequences (Refseq)
    CloneID OAb39856
    Clone ID Related Accession (Same CDS sequence) NM_001341967.1
    Accession Version NM_001341967.1 Latest version! Documents for ORF clone product in default vector
    Sequence Information ORF Nucleotide Sequence (Length: 3822bp)
    Protein sequence
    SNP
    Vector pcDNA3.1-C-(k)DYK or customized vector User Manual
    Clone information Clone Map MSDS
    Tag on pcDNA3.1+/C-(K)DYK C terminal DYKDDDDK tags
    ORF Insert Method CloneEZ™ Seamless cloning technology
    Insert Structure linear
    Update Date 2019-02-13
    Organism Arabidopsis thaliana(thale cress)
    Product protein kinase family protein / WD-40 repeat family protein
    Comment Comment: REVIEWED REFSEQ: This record has been curated by TAIR and Araport. This record is derived from an annotated genomic sequence (NC_003075).

    1
    61
    121
    181
    241
    301
    361
    421
    481
    541
    601
    661
    721
    781
    841
    901
    961
    1021
    1081
    1141
    1201
    1261
    1321
    1381
    1441
    1501
    1561
    1621
    1681
    1741
    1801
    1861
    1921
    1981
    2041
    2101
    2161
    2221
    2281
    2341
    2401
    2461
    2521
    2581
    2641
    2701
    2761
    2821
    2881
    2941
    3001
    3061
    3121
    3181
    3241
    3301
    3361
    3421
    3481
    3541
    3601
    3661
    3721
    3781
    ATGGATATAT TTGCTGTGGG GTGTGTGATA GCCGAACTTT TTCTTGAGGG TCAGCCACTA 
    TTTGAACTGG CGCAGCTTCT CGCTTATCGT AGAGGGCAAC ATGATCCTAG CCAACACCTT
    GAAAAGATTC CTGATCCGGG AATTCGCAAG ATGATTCTTC ATATGATTCA GTTAGAACCC
    GAAGCACGCC TATCTGCTGA AGACTACCTG CAAAATTATG TGGGAGTTGT TTTCCCAAAC
    TACTTCTCAC CATTTCTGCA CACTTTATAT TGTTGTTGGA ATCCACTTCC TTCAGACATG
    AGGGTAGCAA CTTGCCAGGG GATATTTCAA GAAATACTTA AAAAGATGAT GGAAAATAAG
    TCAGGTGATG AGATCGGCGT TGATTCTCCT GTAACTTCAA ATCCAATGAA CGCAAGCACA
    GTACAGGAAA CTTTTGCAAA TCACAAATTG AACTCATCTA AGGATTTGAT AAGGAATACT
    GTGAACTCTA AGGATGAGAT CTTTTACTCT ATTTCTGATG CACTCAAGAA AAATCGCCAT
    CCTTTCTTGA AAAAGATAAC AATGGACGAT TTGGGTACAC TGATGTCTCT CTATGATAGC
    CGTTCTGACA CTTATGGCAC GCCTTTTCTA CCGGTAGAGG GTAACATGAG ATGTGAGGGA
    ATGGTTCTGA TTGCATCTAT GCTCTGTTCT TGTATCCGCA ATATCAAGTT GCCTCATTTG
    AGGAGGGAAG CTATACTTCT ATTGAGATCT TGCTCTTTGT ATATTGATGA TGATGATCGC
    TTACAGCGTG TACTTCCATA CGTCGTTGCC TTGCTTTCTG ATCCAACAGC AATCGTGCGG
    TGCGCTGCCA TGGAAACTTT GTGTGACATT CTGCCGCTTG TCCGAGATTT TCCTCCTAGT
    GATGCAAAGA TTTTCCCAGA GTACATATTT CCGATGCTCT CCATGCTTCC TGAAGATACG
    GAAGAGAGTG TGAGGATATG CTATGCCAGC AATATTGCAA AACTCGCTCT TACTGCTTAT
    GGATTCTTGA TACATTCTTT CCAGTTGAGC GATGTAGGGG TTCTTAATGA ATTGAATTCC
    CAGCAGATCT CCACTACACC TGCTAGTGAG ACCCCTAGTC ATTTGCAAAA GGCAAATGGC
    AATGCGCAGC TTCAACAGCT TAGAAAAACT ATAGCTGAAG TTGTTCAAGA GCTTGTTATG
    GGTCCAAAAC AAACTCCAAA TGTTAGAAGA GCACTCCTTC AGGACATAGG GGAGCTCTGC
    TTTTTCTTTG GTCAGAGGCA GAGTAATGAC TTTCTACTAC CGATCCTCCC TGCCTTTCTA
    AACGACAGAG ATGAGCAGCT AAGATCTGTA TTCTTTGAGA AGATTGTTTA TGTATGCTTT
    TTTGTTGGCC AGAGAAGTGT GGAGGAGTAT CTATTGCCTT ATATCGATCA AGCTTTGAGT
    GATCAGACGG AGGCTGTTAT TGTCAATGCA TTGGAGTGCT TATCCACATT ATGCAAGAGT
    AGTTTCTTGC GGAAGAGAGC TCTCCTCCAA ATGATAGAGT GTGTTTATCC TTTGTTGTGC
    TATCCATCTC AATGGGTAAG GAGGGCAGTT GTCACTTTCA TTGCCGCAAG TAGTGAATGC
    TTAGGTGCAG TCGACTCTTA TGCTTTTATT GCCCCAGTAA TACGCTCTTA TCTTAGTAGA
    CTGCCTGCGT CAATTGCTTC TGAGGAAGGT CTACTTTCAT GTTTGAAGCC CCCTGTCACA
    AGGGAGGTAG TTTATCGTAT CTTTGAAAAA ACCAGGAACC CAGAATTCAT GGCGAAACAG
    CGAAAGATGT GGTATAGTTC TTCACCTCAG TCCAAAGATT GGGAATCTGT TGATTTGTTT
    GACAAAGATG CTGGGGAGTT GAATTCAGTA GAATGCAGGG CCGAACAGAA GCAAAGTGTG
    GAAGGAAAAA AACAGATTAA GAGTGCATCA AAGCAACCAG AAGTTCAAGG AAAGTATGCA
    GAAAAGGATG CTAAATTAAG AATCCCGAGA AACCCAAGAC CTAATGCTTC TAACACTGTT
    GAGCTACGTG ATCCCGTGTA TCCAGAGAAG TTACAGTTCT CTGGGTTTAT GGCACCATAT
    GTATCTGGTG CGAATAGCTT TATTGAACCA GAGAACATAC CTCTCTATTC GTTTAGCATG
    GACAAACGAG CAGCTACAAA TCCTCCTGTG GCTTCTGAGT CTTCATTGCA GATGAACTCT
    CTGGGAATGG GTTCATTGTC TGTGCCATGG ATGGATTCCA TGAGTAAATC ATTTAACTTG
    GCTAGTTCGG TCCCAGTGCC TAAGCTGATT TCTGGGTCAT TCCATGTCGG TACCAATCCT
    AAACAATTTT ACAGAGTGGT ACATGAGCCA GAAAGCAGAG AAAATGATCA AATCTCCTCA
    GCCATCAGTA AATTTCAAGA CCTCGGAGTA TCAAGCTCCT CAAAAAGTGC TTCTGTAACT
    TCAGAAGATG CTTCTTCTCC AGCGGATCTT GTAGGAGAGC CATCTCTGTC AAGGACATCG
    GTTCCGGATT CAGGGTGGAA GCCTCGTGGA GTATTAGTTG CTCATCTACA AGAGCATCGC
    TCTGCGGTCA ATGACATTGC CACTTCAAGC GATCATAGCT TTTTTGTTAG TGCATCAGAT
    GATTCCACAG TGAAGGTGTG GGACTCTAGA AAACTGGAAA AGGACATCTC TTTTAGGTCA
    AGGCTAACAT ATCATCTCGA GGGAAGCAGA GGGATGTGCA CAACAATGCT TCGGAATTCA
    ACGCAAGTTG TAGTTGGAGC CTCTGATGGT GTGATACATA TGTTTTCAAT TGACCATATC
    TCCAGAGGCT TGGGGAACGT AGTGGAGAAG TATTCAGGCA TTGTTGATAT TAAAAAGAAA
    GATGTTAAAG AAGGCGCTCT AGTTTCTCTC TTGAATTATA CTGCTGATAG CCTTTCTGGT
    CCGATGGTAA TGTATAGTAC CCAAAACTGC GGAATCCACC TTTGGGATAC AAGGTCAGAT
    TTAGATGCAT GGACACTGAA AGCAAATCCT GAAGAAGGAT ATGTGTCTTC ATTGGTTACA
    AGTCCTTGTG GGAATTGGTT TGTCTCTGGG TCTTCAAGGG GAGTGCTTAC TCTTTGGGAT
    TTGAGATTTC GTGTTCCTGT AAATTCGTGG CAATACCCCA TCATATGTCC CATAGAGAAG
    ATGTGCCTCT GCTTTCTTCC TCCAAGCGTC TCAGTGTCCA CCACTATGAA ACCTTTAATT
    TATGTTGCTG CCGGTTGCAA CGAAGTTTCA CTCTGGAATG CAGAGGGAGG TAGCTGTCAC
    CAGGTATTGA GAGTAGCCAA TTATGAAAAT GAGACGGATG TTTCCGAGTT TCAATGGAAG
    TTACCAAGCA ATAAGGTAAA TCCGAAGCCG AATCATCGTC AGAACATGAG CTCCAAGTAC
    AGAATCGAAG AGTTGAACGA GCCTCCTCCT CGTCTTCCTG GTATCCGCTC TTTGCTCCCT
    TTACCTGGAG GTGACTTGTT AACAGGTGGT ACTGACTTGA AGATTCGGCG TTGGGATTAC
    TCCAGCCCTG AGAGAAGTTA TTGTATATGC GGTCCGAGTT TGAAAGGAGT CGGAAATGAT
    GATTTCTATG AACTAAAAAC CAACACGGGC GTGCAATTTG TTCAGGAGAC AAAGAGACGG
    CCTCTGGCTA CTAAACTGAC GGCAAAGGCG GTACTTGCGG CTGCTGCGAC AGACACAGCG
    GGTTGTCATC GTGACTCAGT TCAGTCTCTG GCATCTGTGA AGCTGAACCA GAGACTGTTG
    ATATCAAGCA GCAGAGATGG AGCCATAAAG GTCTGGAAGT AA

    The stop codons will be deleted if pcDNA3.1+/C-(K)DYK vector is selected.

    RefSeq NP_001329182.1
    CDS113..3934
    Translation

    Target ORF information:

    RefSeq Version NM_001341967.1
    Organism Arabidopsis thaliana(thale cress)
    Definition Arabidopsis thaliana protein kinase family protein / WD-40 repeat family protein (VPS15), mRNA.

    Target ORF information:

    Epitope DYKDDDDK
    Bacterial selection AMPR
    Mammalian selection NeoR
    Vector pcDNA3.1+/C-(K)DYK
    NM_001341967.1

    ORF Insert Sequence:

    1
    61
    121
    181
    241
    301
    361
    421
    481
    541
    601
    661
    721
    781
    841
    901
    961
    1021
    1081
    1141
    1201
    1261
    1321
    1381
    1441
    1501
    1561
    1621
    1681
    1741
    1801
    1861
    1921
    1981
    2041
    2101
    2161
    2221
    2281
    2341
    2401
    2461
    2521
    2581
    2641
    2701
    2761
    2821
    2881
    2941
    3001
    3061
    3121
    3181
    3241
    3301
    3361
    3421
    3481
    3541
    3601
    3661
    3721
    3781
    ATGGATATAT TTGCTGTGGG GTGTGTGATA GCCGAACTTT TTCTTGAGGG TCAGCCACTA 
    TTTGAACTGG CGCAGCTTCT CGCTTATCGT AGAGGGCAAC ATGATCCTAG CCAACACCTT
    GAAAAGATTC CTGATCCGGG AATTCGCAAG ATGATTCTTC ATATGATTCA GTTAGAACCC
    GAAGCACGCC TATCTGCTGA AGACTACCTG CAAAATTATG TGGGAGTTGT TTTCCCAAAC
    TACTTCTCAC CATTTCTGCA CACTTTATAT TGTTGTTGGA ATCCACTTCC TTCAGACATG
    AGGGTAGCAA CTTGCCAGGG GATATTTCAA GAAATACTTA AAAAGATGAT GGAAAATAAG
    TCAGGTGATG AGATCGGCGT TGATTCTCCT GTAACTTCAA ATCCAATGAA CGCAAGCACA
    GTACAGGAAA CTTTTGCAAA TCACAAATTG AACTCATCTA AGGATTTGAT AAGGAATACT
    GTGAACTCTA AGGATGAGAT CTTTTACTCT ATTTCTGATG CACTCAAGAA AAATCGCCAT
    CCTTTCTTGA AAAAGATAAC AATGGACGAT TTGGGTACAC TGATGTCTCT CTATGATAGC
    CGTTCTGACA CTTATGGCAC GCCTTTTCTA CCGGTAGAGG GTAACATGAG ATGTGAGGGA
    ATGGTTCTGA TTGCATCTAT GCTCTGTTCT TGTATCCGCA ATATCAAGTT GCCTCATTTG
    AGGAGGGAAG CTATACTTCT ATTGAGATCT TGCTCTTTGT ATATTGATGA TGATGATCGC
    TTACAGCGTG TACTTCCATA CGTCGTTGCC TTGCTTTCTG ATCCAACAGC AATCGTGCGG
    TGCGCTGCCA TGGAAACTTT GTGTGACATT CTGCCGCTTG TCCGAGATTT TCCTCCTAGT
    GATGCAAAGA TTTTCCCAGA GTACATATTT CCGATGCTCT CCATGCTTCC TGAAGATACG
    GAAGAGAGTG TGAGGATATG CTATGCCAGC AATATTGCAA AACTCGCTCT TACTGCTTAT
    GGATTCTTGA TACATTCTTT CCAGTTGAGC GATGTAGGGG TTCTTAATGA ATTGAATTCC
    CAGCAGATCT CCACTACACC TGCTAGTGAG ACCCCTAGTC ATTTGCAAAA GGCAAATGGC
    AATGCGCAGC TTCAACAGCT TAGAAAAACT ATAGCTGAAG TTGTTCAAGA GCTTGTTATG
    GGTCCAAAAC AAACTCCAAA TGTTAGAAGA GCACTCCTTC AGGACATAGG GGAGCTCTGC
    TTTTTCTTTG GTCAGAGGCA GAGTAATGAC TTTCTACTAC CGATCCTCCC TGCCTTTCTA
    AACGACAGAG ATGAGCAGCT AAGATCTGTA TTCTTTGAGA AGATTGTTTA TGTATGCTTT
    TTTGTTGGCC AGAGAAGTGT GGAGGAGTAT CTATTGCCTT ATATCGATCA AGCTTTGAGT
    GATCAGACGG AGGCTGTTAT TGTCAATGCA TTGGAGTGCT TATCCACATT ATGCAAGAGT
    AGTTTCTTGC GGAAGAGAGC TCTCCTCCAA ATGATAGAGT GTGTTTATCC TTTGTTGTGC
    TATCCATCTC AATGGGTAAG GAGGGCAGTT GTCACTTTCA TTGCCGCAAG TAGTGAATGC
    TTAGGTGCAG TCGACTCTTA TGCTTTTATT GCCCCAGTAA TACGCTCTTA TCTTAGTAGA
    CTGCCTGCGT CAATTGCTTC TGAGGAAGGT CTACTTTCAT GTTTGAAGCC CCCTGTCACA
    AGGGAGGTAG TTTATCGTAT CTTTGAAAAA ACCAGGAACC CAGAATTCAT GGCGAAACAG
    CGAAAGATGT GGTATAGTTC TTCACCTCAG TCCAAAGATT GGGAATCTGT TGATTTGTTT
    GACAAAGATG CTGGGGAGTT GAATTCAGTA GAATGCAGGG CCGAACAGAA GCAAAGTGTG
    GAAGGAAAAA AACAGATTAA GAGTGCATCA AAGCAACCAG AAGTTCAAGG AAAGTATGCA
    GAAAAGGATG CTAAATTAAG AATCCCGAGA AACCCAAGAC CTAATGCTTC TAACACTGTT
    GAGCTACGTG ATCCCGTGTA TCCAGAGAAG TTACAGTTCT CTGGGTTTAT GGCACCATAT
    GTATCTGGTG CGAATAGCTT TATTGAACCA GAGAACATAC CTCTCTATTC GTTTAGCATG
    GACAAACGAG CAGCTACAAA TCCTCCTGTG GCTTCTGAGT CTTCATTGCA GATGAACTCT
    CTGGGAATGG GTTCATTGTC TGTGCCATGG ATGGATTCCA TGAGTAAATC ATTTAACTTG
    GCTAGTTCGG TCCCAGTGCC TAAGCTGATT TCTGGGTCAT TCCATGTCGG TACCAATCCT
    AAACAATTTT ACAGAGTGGT ACATGAGCCA GAAAGCAGAG AAAATGATCA AATCTCCTCA
    GCCATCAGTA AATTTCAAGA CCTCGGAGTA TCAAGCTCCT CAAAAAGTGC TTCTGTAACT
    TCAGAAGATG CTTCTTCTCC AGCGGATCTT GTAGGAGAGC CATCTCTGTC AAGGACATCG
    GTTCCGGATT CAGGGTGGAA GCCTCGTGGA GTATTAGTTG CTCATCTACA AGAGCATCGC
    TCTGCGGTCA ATGACATTGC CACTTCAAGC GATCATAGCT TTTTTGTTAG TGCATCAGAT
    GATTCCACAG TGAAGGTGTG GGACTCTAGA AAACTGGAAA AGGACATCTC TTTTAGGTCA
    AGGCTAACAT ATCATCTCGA GGGAAGCAGA GGGATGTGCA CAACAATGCT TCGGAATTCA
    ACGCAAGTTG TAGTTGGAGC CTCTGATGGT GTGATACATA TGTTTTCAAT TGACCATATC
    TCCAGAGGCT TGGGGAACGT AGTGGAGAAG TATTCAGGCA TTGTTGATAT TAAAAAGAAA
    GATGTTAAAG AAGGCGCTCT AGTTTCTCTC TTGAATTATA CTGCTGATAG CCTTTCTGGT
    CCGATGGTAA TGTATAGTAC CCAAAACTGC GGAATCCACC TTTGGGATAC AAGGTCAGAT
    TTAGATGCAT GGACACTGAA AGCAAATCCT GAAGAAGGAT ATGTGTCTTC ATTGGTTACA
    AGTCCTTGTG GGAATTGGTT TGTCTCTGGG TCTTCAAGGG GAGTGCTTAC TCTTTGGGAT
    TTGAGATTTC GTGTTCCTGT AAATTCGTGG CAATACCCCA TCATATGTCC CATAGAGAAG
    ATGTGCCTCT GCTTTCTTCC TCCAAGCGTC TCAGTGTCCA CCACTATGAA ACCTTTAATT
    TATGTTGCTG CCGGTTGCAA CGAAGTTTCA CTCTGGAATG CAGAGGGAGG TAGCTGTCAC
    CAGGTATTGA GAGTAGCCAA TTATGAAAAT GAGACGGATG TTTCCGAGTT TCAATGGAAG
    TTACCAAGCA ATAAGGTAAA TCCGAAGCCG AATCATCGTC AGAACATGAG CTCCAAGTAC
    AGAATCGAAG AGTTGAACGA GCCTCCTCCT CGTCTTCCTG GTATCCGCTC TTTGCTCCCT
    TTACCTGGAG GTGACTTGTT AACAGGTGGT ACTGACTTGA AGATTCGGCG TTGGGATTAC
    TCCAGCCCTG AGAGAAGTTA TTGTATATGC GGTCCGAGTT TGAAAGGAGT CGGAAATGAT
    GATTTCTATG AACTAAAAAC CAACACGGGC GTGCAATTTG TTCAGGAGAC AAAGAGACGG
    CCTCTGGCTA CTAAACTGAC GGCAAAGGCG GTACTTGCGG CTGCTGCGAC AGACACAGCG
    GGTTGTCATC GTGACTCAGT TCAGTCTCTG GCATCTGTGA AGCTGAACCA GAGACTGTTG
    ATATCAAGCA GCAGAGATGG AGCCATAAAG GTCTGGAAGT AA

    The stop codons will be deleted if pcDNA3.1+/C-(K)DYK vector is selected.

    CloneID OAb16655
    Clone ID Related Accession (Same CDS sequence) NM_119083.3
    Accession Version NM_119083.3 Latest version! Documents for ORF clone product in default vector
    Sequence Information ORF Nucleotide Sequence (Length: 4485bp)
    Protein sequence
    SNP
    Vector pcDNA3.1-C-(k)DYK or customized vector User Manual
    Clone information Clone Map MSDS
    Tag on pcDNA3.1+/C-(K)DYK C terminal DYKDDDDK tags
    ORF Insert Method CloneEZ™ Seamless cloning technology
    Insert Structure linear
    Update Date 2019-02-13
    Organism Arabidopsis thaliana(thale cress)
    Product protein kinase family protein / WD-40 repeat family protein
    Comment Comment: REVIEWED REFSEQ: This record has been curated by TAIR and Araport. This record is derived from an annotated genomic sequence (NC_003075). On Sep 12, 2016 this sequence version replaced NM_119083.2.

    1
    61
    121
    181
    241
    301
    361
    421
    481
    541
    601
    661
    721
    781
    841
    901
    961
    1021
    1081
    1141
    1201
    1261
    1321
    1381
    1441
    1501
    1561
    1621
    1681
    1741
    1801
    1861
    1921
    1981
    2041
    2101
    2161
    2221
    2281
    2341
    2401
    2461
    2521
    2581
    2641
    2701
    2761
    2821
    2881
    2941
    3001
    3061
    3121
    3181
    3241
    3301
    3361
    3421
    3481
    3541
    3601
    3661
    3721
    3781
    3841
    3901
    3961
    4021
    4081
    4141
    4201
    4261
    4321
    4381
    4441
    ATGGGAAACA AAATCGCTCG TACGACACAA GTCTCGGCGA CGGAGTACTA TCTCCACGAC 
    TTGCCGTCTT CATACAATCT GGTCTTAAAA GAGGTTTTAG GTCGAGGAAG ATTCCTAAAG
    TCGATTCAAT GTAAGCACGA TGAAGGATTG GTTGTTGTTA AGGTTTACTT CAAGCGTGGT
    GACTCGATCG ATCTCAGAGA GTATGAGCGT CGTCTCGTTA AGATCAAAGA TGTGTTTTTG
    TCTCTAGAAC ATCCTCACGT TTGGCCTTTT CAGTTTTGGC AAGAGACTGA TAAAGCAGCG
    TATCTAGTGA GGCAATACTT TTACAGTAAT CTACATGATC GCTTGAGTAC GAGGCCTTTC
    CTCAGTCTTG TAGAGAAGAA GTGGTTGGCG TTTCAGTTGC TTCTTGCTGT GAAGCAATGT
    CATGAGAAGG ATATATGTCA TGGTGATATC AAGTGCGAGA ACGTATTGTT GACTTCCTGG
    AACTGGCTTT ACCTTGCTGA TTTTGCATCC TTCAAACCTA CATACATTCC TTATGATGAT
    CCTTCAGACT TCTCGTTCTT CTTTGACACA AGGGGACAAA GACTTTGTTA TCTGGCTCCA
    GAGAGATTCT ATGAGCATGG AGGTGAGACA CAAGTAGCAC AAGATGCTCC ATTAAAGCCC
    TCCATGGATA TATTTGCTGT GGGGTGTGTG ATAGCCGAAC TTTTTCTTGA GGGTCAGCCA
    CTATTTGAAC TGGCGCAGCT TCTCGCTTAT CGTAGAGGGC AACATGATCC TAGCCAACAC
    CTTGAAAAGA TTCCTGATCC GGGAATTCGC AAGATGATTC TTCATATGAT TCAGTTAGAA
    CCCGAAGCAC GCCTATCTGC TGAAGACTAC CTGCAAAATT ATGTGGGAGT TGTTTTCCCA
    AACTACTTCT CACCATTTCT GCACACTTTA TATTGTTGTT GGAATCCACT TCCTTCAGAC
    ATGAGGGTAG CAACTTGCCA GGGGATATTT CAAGAAATAC TTAAAAAGAT GATGGAAAAT
    AAGTCAGGTG ATGAGATCGG CGTTGATTCT CCTGTAACTT CAAATCCAAT GAACGCAAGC
    ACAGTACAGG AAACTTTTGC AAATCACAAA TTGAACTCAT CTAAGGATTT GATAAGGAAT
    ACTGTGAACT CTAAGGATGA GATCTTTTAC TCTATTTCTG ATGCACTCAA GAAAAATCGC
    CATCCTTTCT TGAAAAAGAT AACAATGGAC GATTTGGGTA CACTGATGTC TCTCTATGAT
    AGCCGTTCTG ACACTTATGG CACGCCTTTT CTACCGGTAG AGGGTAACAT GAGATGTGAG
    GGAATGGTTC TGATTGCATC TATGCTCTGT TCTTGTATCC GCAATATCAA GTTGCCTCAT
    TTGAGGAGGG AAGCTATACT TCTATTGAGA TCTTGCTCTT TGTATATTGA TGATGATGAT
    CGCTTACAGC GTGTACTTCC ATACGTCGTT GCCTTGCTTT CTGATCCAAC AGCAATCGTG
    CGGTGCGCTG CCATGGAAAC TTTGTGTGAC ATTCTGCCGC TTGTCCGAGA TTTTCCTCCT
    AGTGATGCAA AGATTTTCCC AGAGTACATA TTTCCGATGC TCTCCATGCT TCCTGAAGAT
    ACGGAAGAGA GTGTGAGGAT ATGCTATGCC AGCAATATTG CAAAACTCGC TCTTACTGCT
    TATGGATTCT TGATACATTC TTTCCAGTTG AGCGATGTAG GGGTTCTTAA TGAATTGAAT
    TCCCAGCAGA TCTCCACTAC ACCTGCTAGT GAGACCCCTA GTCATTTGCA AAAGGCAAAT
    GGCAATGCGC AGCTTCAACA GCTTAGAAAA ACTATAGCTG AAGTTGTTCA AGAGCTTGTT
    ATGGGTCCAA AACAAACTCC AAATGTTAGA AGAGCACTCC TTCAGGACAT AGGGGAGCTC
    TGCTTTTTCT TTGGTCAGAG GCAGAGTAAT GACTTTCTAC TACCGATCCT CCCTGCCTTT
    CTAAACGACA GAGATGAGCA GCTAAGATCT GTATTCTTTG AGAAGATTGT TTATGTATGC
    TTTTTTGTTG GCCAGAGAAG TGTGGAGGAG TATCTATTGC CTTATATCGA TCAAGCTTTG
    AGTGATCAGA CGGAGGCTGT TATTGTCAAT GCATTGGAGT GCTTATCCAC ATTATGCAAG
    AGTAGTTTCT TGCGGAAGAG AGCTCTCCTC CAAATGATAG AGTGTGTTTA TCCTTTGTTG
    TGCTATCCAT CTCAATGGGT AAGGAGGGCA GTTGTCACTT TCATTGCCGC AAGTAGTGAA
    TGCTTAGGTG CAGTCGACTC TTATGCTTTT ATTGCCCCAG TAATACGCTC TTATCTTAGT
    AGACTGCCTG CGTCAATTGC TTCTGAGGAA GGTCTACTTT CATGTTTGAA GCCCCCTGTC
    ACAAGGGAGG TAGTTTATCG TATCTTTGAA AAAACCAGGA ACCCAGAATT CATGGCGAAA
    CAGCGAAAGA TGTGGTATAG TTCTTCACCT CAGTCCAAAG ATTGGGAATC TGTTGATTTG
    TTTGACAAAG ATGCTGGGGA GTTGAATTCA GTAGAATGCA GGGCCGAACA GAAGCAAAGT
    GTGGAAGGAA AAAAACAGAT TAAGAGTGCA TCAAAGCAAC CAGAAGTTCA AGGAAAGTAT
    GCAGAAAAGG ATGCTAAATT AAGAATCCCG AGAAACCCAA GACCTAATGC TTCTAACACT
    GTTGAGCTAC GTGATCCCGT GTATCCAGAG AAGTTACAGT TCTCTGGGTT TATGGCACCA
    TATGTATCTG GTGCGAATAG CTTTATTGAA CCAGAGAACA TACCTCTCTA TTCGTTTAGC
    ATGGACAAAC GAGCAGCTAC AAATCCTCCT GTGGCTTCTG AGTCTTCATT GCAGATGAAC
    TCTCTGGGAA TGGGTTCATT GTCTGTGCCA TGGATGGATT CCATGAGTAA ATCATTTAAC
    TTGGCTAGTT CGGTCCCAGT GCCTAAGCTG ATTTCTGGGT CATTCCATGT CGGTACCAAT
    CCTAAACAAT TTTACAGAGT GGTACATGAG CCAGAAAGCA GAGAAAATGA TCAAATCTCC
    TCAGCCATCA GTAAATTTCA AGACCTCGGA GTATCAAGCT CCTCAAAAAG TGCTTCTGTA
    ACTTCAGAAG ATGCTTCTTC TCCAGCGGAT CTTGTAGGAG AGCCATCTCT GTCAAGGACA
    TCGGTTCCGG ATTCAGGGTG GAAGCCTCGT GGAGTATTAG TTGCTCATCT ACAAGAGCAT
    CGCTCTGCGG TCAATGACAT TGCCACTTCA AGCGATCATA GCTTTTTTGT TAGTGCATCA
    GATGATTCCA CAGTGAAGGT GTGGGACTCT AGAAAACTGG AAAAGGACAT CTCTTTTAGG
    TCAAGGCTAA CATATCATCT CGAGGGAAGC AGAGGGATGT GCACAACAAT GCTTCGGAAT
    TCAACGCAAG TTGTAGTTGG AGCCTCTGAT GGTGTGATAC ATATGTTTTC AATTGACCAT
    ATCTCCAGAG GCTTGGGGAA CGTAGTGGAG AAGTATTCAG GCATTGTTGA TATTAAAAAG
    AAAGATGTTA AAGAAGGCGC TCTAGTTTCT CTCTTGAATT ATACTGCTGA TAGCCTTTCT
    GGTCCGATGG TAATGTATAG TACCCAAAAC TGCGGAATCC ACCTTTGGGA TACAAGGTCA
    GATTTAGATG CATGGACACT GAAAGCAAAT CCTGAAGAAG GATATGTGTC TTCATTGGTT
    ACAAGTCCTT GTGGGAATTG GTTTGTCTCT GGGTCTTCAA GGGGAGTGCT TACTCTTTGG
    GATTTGAGAT TTCGTGTTCC TGTAAATTCG TGGCAATACC CCATCATATG TCCCATAGAG
    AAGATGTGCC TCTGCTTTCT TCCTCCAAGC GTCTCAGTGT CCACCACTAT GAAACCTTTA
    ATTTATGTTG CTGCCGGTTG CAACGAAGTT TCACTCTGGA ATGCAGAGGG AGGTAGCTGT
    CACCAGGTAT TGAGAGTAGC CAATTATGAA AATGAGACGG ATGTTTCCGA GTTTCAATGG
    AAGTTACCAA GCAATAAGGT AAATCCGAAG CCGAATCATC GTCAGAACAT GAGCTCCAAG
    TACAGAATCG AAGAGTTGAA CGAGCCTCCT CCTCGTCTTC CTGGTATCCG CTCTTTGCTC
    CCTTTACCTG GAGGTGACTT GTTAACAGGT GGTACTGACT TGAAGATTCG GCGTTGGGAT
    TACTCCAGCC CTGAGAGAAG TTATTGTATA TGCGGTCCGA GTTTGAAAGG AGTCGGAAAT
    GATGATTTCT ATGAACTAAA AACCAACACG GGCGTGCAAT TTGTTCAGGA GACAAAGAGA
    CGGCCTCTGG CTACTAAACT GACGGCAAAG GCGGTACTTG CGGCTGCTGC GACAGACACA
    GCGGGTTGTC ATCGTGACTC AGTTCAGTCT CTGGCATCTG TGAAGCTGAA CCAGAGACTG
    TTGATATCAA GCAGCAGAGA TGGAGCCATA AAGGTCTGGA AGTAA

    The stop codons will be deleted if pcDNA3.1+/C-(K)DYK vector is selected.

    RefSeq NP_194667.1
    CDS175..4659
    Translation

    Target ORF information:

    RefSeq Version NM_119083.3
    Organism Arabidopsis thaliana(thale cress)
    Definition Arabidopsis thaliana protein kinase family protein / WD-40 repeat family protein (VPS15), mRNA.

    Target ORF information:

    Epitope DYKDDDDK
    Bacterial selection AMPR
    Mammalian selection NeoR
    Vector pcDNA3.1+/C-(K)DYK
    NM_119083.3

    ORF Insert Sequence:

    1
    61
    121
    181
    241
    301
    361
    421
    481
    541
    601
    661
    721
    781
    841
    901
    961
    1021
    1081
    1141
    1201
    1261
    1321
    1381
    1441
    1501
    1561
    1621
    1681
    1741
    1801
    1861
    1921
    1981
    2041
    2101
    2161
    2221
    2281
    2341
    2401
    2461
    2521
    2581
    2641
    2701
    2761
    2821
    2881
    2941
    3001
    3061
    3121
    3181
    3241
    3301
    3361
    3421
    3481
    3541
    3601
    3661
    3721
    3781
    3841
    3901
    3961
    4021
    4081
    4141
    4201
    4261
    4321
    4381
    4441
    ATGGGAAACA AAATCGCTCG TACGACACAA GTCTCGGCGA CGGAGTACTA TCTCCACGAC 
    TTGCCGTCTT CATACAATCT GGTCTTAAAA GAGGTTTTAG GTCGAGGAAG ATTCCTAAAG
    TCGATTCAAT GTAAGCACGA TGAAGGATTG GTTGTTGTTA AGGTTTACTT CAAGCGTGGT
    GACTCGATCG ATCTCAGAGA GTATGAGCGT CGTCTCGTTA AGATCAAAGA TGTGTTTTTG
    TCTCTAGAAC ATCCTCACGT TTGGCCTTTT CAGTTTTGGC AAGAGACTGA TAAAGCAGCG
    TATCTAGTGA GGCAATACTT TTACAGTAAT CTACATGATC GCTTGAGTAC GAGGCCTTTC
    CTCAGTCTTG TAGAGAAGAA GTGGTTGGCG TTTCAGTTGC TTCTTGCTGT GAAGCAATGT
    CATGAGAAGG ATATATGTCA TGGTGATATC AAGTGCGAGA ACGTATTGTT GACTTCCTGG
    AACTGGCTTT ACCTTGCTGA TTTTGCATCC TTCAAACCTA CATACATTCC TTATGATGAT
    CCTTCAGACT TCTCGTTCTT CTTTGACACA AGGGGACAAA GACTTTGTTA TCTGGCTCCA
    GAGAGATTCT ATGAGCATGG AGGTGAGACA CAAGTAGCAC AAGATGCTCC ATTAAAGCCC
    TCCATGGATA TATTTGCTGT GGGGTGTGTG ATAGCCGAAC TTTTTCTTGA GGGTCAGCCA
    CTATTTGAAC TGGCGCAGCT TCTCGCTTAT CGTAGAGGGC AACATGATCC TAGCCAACAC
    CTTGAAAAGA TTCCTGATCC GGGAATTCGC AAGATGATTC TTCATATGAT TCAGTTAGAA
    CCCGAAGCAC GCCTATCTGC TGAAGACTAC CTGCAAAATT ATGTGGGAGT TGTTTTCCCA
    AACTACTTCT CACCATTTCT GCACACTTTA TATTGTTGTT GGAATCCACT TCCTTCAGAC
    ATGAGGGTAG CAACTTGCCA GGGGATATTT CAAGAAATAC TTAAAAAGAT GATGGAAAAT
    AAGTCAGGTG ATGAGATCGG CGTTGATTCT CCTGTAACTT CAAATCCAAT GAACGCAAGC
    ACAGTACAGG AAACTTTTGC AAATCACAAA TTGAACTCAT CTAAGGATTT GATAAGGAAT
    ACTGTGAACT CTAAGGATGA GATCTTTTAC TCTATTTCTG ATGCACTCAA GAAAAATCGC
    CATCCTTTCT TGAAAAAGAT AACAATGGAC GATTTGGGTA CACTGATGTC TCTCTATGAT
    AGCCGTTCTG ACACTTATGG CACGCCTTTT CTACCGGTAG AGGGTAACAT GAGATGTGAG
    GGAATGGTTC TGATTGCATC TATGCTCTGT TCTTGTATCC GCAATATCAA GTTGCCTCAT
    TTGAGGAGGG AAGCTATACT TCTATTGAGA TCTTGCTCTT TGTATATTGA TGATGATGAT
    CGCTTACAGC GTGTACTTCC ATACGTCGTT GCCTTGCTTT CTGATCCAAC AGCAATCGTG
    CGGTGCGCTG CCATGGAAAC TTTGTGTGAC ATTCTGCCGC TTGTCCGAGA TTTTCCTCCT
    AGTGATGCAA AGATTTTCCC AGAGTACATA TTTCCGATGC TCTCCATGCT TCCTGAAGAT
    ACGGAAGAGA GTGTGAGGAT ATGCTATGCC AGCAATATTG CAAAACTCGC TCTTACTGCT
    TATGGATTCT TGATACATTC TTTCCAGTTG AGCGATGTAG GGGTTCTTAA TGAATTGAAT
    TCCCAGCAGA TCTCCACTAC ACCTGCTAGT GAGACCCCTA GTCATTTGCA AAAGGCAAAT
    GGCAATGCGC AGCTTCAACA GCTTAGAAAA ACTATAGCTG AAGTTGTTCA AGAGCTTGTT
    ATGGGTCCAA AACAAACTCC AAATGTTAGA AGAGCACTCC TTCAGGACAT AGGGGAGCTC
    TGCTTTTTCT TTGGTCAGAG GCAGAGTAAT GACTTTCTAC TACCGATCCT CCCTGCCTTT
    CTAAACGACA GAGATGAGCA GCTAAGATCT GTATTCTTTG AGAAGATTGT TTATGTATGC
    TTTTTTGTTG GCCAGAGAAG TGTGGAGGAG TATCTATTGC CTTATATCGA TCAAGCTTTG
    AGTGATCAGA CGGAGGCTGT TATTGTCAAT GCATTGGAGT GCTTATCCAC ATTATGCAAG
    AGTAGTTTCT TGCGGAAGAG AGCTCTCCTC CAAATGATAG AGTGTGTTTA TCCTTTGTTG
    TGCTATCCAT CTCAATGGGT AAGGAGGGCA GTTGTCACTT TCATTGCCGC AAGTAGTGAA
    TGCTTAGGTG CAGTCGACTC TTATGCTTTT ATTGCCCCAG TAATACGCTC TTATCTTAGT
    AGACTGCCTG CGTCAATTGC TTCTGAGGAA GGTCTACTTT CATGTTTGAA GCCCCCTGTC
    ACAAGGGAGG TAGTTTATCG TATCTTTGAA AAAACCAGGA ACCCAGAATT CATGGCGAAA
    CAGCGAAAGA TGTGGTATAG TTCTTCACCT CAGTCCAAAG ATTGGGAATC TGTTGATTTG
    TTTGACAAAG ATGCTGGGGA GTTGAATTCA GTAGAATGCA GGGCCGAACA GAAGCAAAGT
    GTGGAAGGAA AAAAACAGAT TAAGAGTGCA TCAAAGCAAC CAGAAGTTCA AGGAAAGTAT
    GCAGAAAAGG ATGCTAAATT AAGAATCCCG AGAAACCCAA GACCTAATGC TTCTAACACT
    GTTGAGCTAC GTGATCCCGT GTATCCAGAG AAGTTACAGT TCTCTGGGTT TATGGCACCA
    TATGTATCTG GTGCGAATAG CTTTATTGAA CCAGAGAACA TACCTCTCTA TTCGTTTAGC
    ATGGACAAAC GAGCAGCTAC AAATCCTCCT GTGGCTTCTG AGTCTTCATT GCAGATGAAC
    TCTCTGGGAA TGGGTTCATT GTCTGTGCCA TGGATGGATT CCATGAGTAA ATCATTTAAC
    TTGGCTAGTT CGGTCCCAGT GCCTAAGCTG ATTTCTGGGT CATTCCATGT CGGTACCAAT
    CCTAAACAAT TTTACAGAGT GGTACATGAG CCAGAAAGCA GAGAAAATGA TCAAATCTCC
    TCAGCCATCA GTAAATTTCA AGACCTCGGA GTATCAAGCT CCTCAAAAAG TGCTTCTGTA
    ACTTCAGAAG ATGCTTCTTC TCCAGCGGAT CTTGTAGGAG AGCCATCTCT GTCAAGGACA
    TCGGTTCCGG ATTCAGGGTG GAAGCCTCGT GGAGTATTAG TTGCTCATCT ACAAGAGCAT
    CGCTCTGCGG TCAATGACAT TGCCACTTCA AGCGATCATA GCTTTTTTGT TAGTGCATCA
    GATGATTCCA CAGTGAAGGT GTGGGACTCT AGAAAACTGG AAAAGGACAT CTCTTTTAGG
    TCAAGGCTAA CATATCATCT CGAGGGAAGC AGAGGGATGT GCACAACAAT GCTTCGGAAT
    TCAACGCAAG TTGTAGTTGG AGCCTCTGAT GGTGTGATAC ATATGTTTTC AATTGACCAT
    ATCTCCAGAG GCTTGGGGAA CGTAGTGGAG AAGTATTCAG GCATTGTTGA TATTAAAAAG
    AAAGATGTTA AAGAAGGCGC TCTAGTTTCT CTCTTGAATT ATACTGCTGA TAGCCTTTCT
    GGTCCGATGG TAATGTATAG TACCCAAAAC TGCGGAATCC ACCTTTGGGA TACAAGGTCA
    GATTTAGATG CATGGACACT GAAAGCAAAT CCTGAAGAAG GATATGTGTC TTCATTGGTT
    ACAAGTCCTT GTGGGAATTG GTTTGTCTCT GGGTCTTCAA GGGGAGTGCT TACTCTTTGG
    GATTTGAGAT TTCGTGTTCC TGTAAATTCG TGGCAATACC CCATCATATG TCCCATAGAG
    AAGATGTGCC TCTGCTTTCT TCCTCCAAGC GTCTCAGTGT CCACCACTAT GAAACCTTTA
    ATTTATGTTG CTGCCGGTTG CAACGAAGTT TCACTCTGGA ATGCAGAGGG AGGTAGCTGT
    CACCAGGTAT TGAGAGTAGC CAATTATGAA AATGAGACGG ATGTTTCCGA GTTTCAATGG
    AAGTTACCAA GCAATAAGGT AAATCCGAAG CCGAATCATC GTCAGAACAT GAGCTCCAAG
    TACAGAATCG AAGAGTTGAA CGAGCCTCCT CCTCGTCTTC CTGGTATCCG CTCTTTGCTC
    CCTTTACCTG GAGGTGACTT GTTAACAGGT GGTACTGACT TGAAGATTCG GCGTTGGGAT
    TACTCCAGCC CTGAGAGAAG TTATTGTATA TGCGGTCCGA GTTTGAAAGG AGTCGGAAAT
    GATGATTTCT ATGAACTAAA AACCAACACG GGCGTGCAAT TTGTTCAGGA GACAAAGAGA
    CGGCCTCTGG CTACTAAACT GACGGCAAAG GCGGTACTTG CGGCTGCTGC GACAGACACA
    GCGGGTTGTC ATCGTGACTC AGTTCAGTCT CTGGCATCTG TGAAGCTGAA CCAGAGACTG
    TTGATATCAA GCAGCAGAGA TGGAGCCATA AAGGTCTGGA AGTAA

    The stop codons will be deleted if pcDNA3.1+/C-(K)DYK vector is selected.

  • PubMed

    Sequence and analysis of chromosome 4 of the plant Arabidopsis thaliana.
    Nature402(6763)769-77(1999 Dec)
    Mayer K,Sch?ller C,Wambutt R,Murphy G,Volckaert G,Pohl T,D?sterh?ft A,Stiekema W,Entian KD,Terryn N,Harris B,Ansorge W,Brandt P,Grivell L,Rieger M,Weichselgartner M,de Simone V,Obermaier B,Mache R,M?ller M,Kreis M,Delseny M,Puigdomenech P,Watson M,Schmidtheini T,Reichert B,Portatelle D,Perez-Alonso M,Boutry M,Bancroft I,Vos P,Hoheisel J,Zimmermann W,Wedler H,Ridley P,Langham SA,McCullagh B,Bilham L,Robben J,Van der Schueren J,Grymonprez B,Chuang YJ,Vandenbussche F,Braeken M,Weltjens I,Voet M,Bastiaens I,Aert R,Defoor E,Weitzenegger T,Bothe G,Ramsperger U,Hilbert H,Braun M,Holzer E,Brandt A,Peters S,van Staveren M,Dirske W,Mooijman P,Klein Lankhorst R,Rose M,Hauf J,K?tter P,Berneiser S,Hempel S,Feldpausch M,Lamberth S,Van den Daele H,De Keyser A,Buysshaert C,Gielen J,Villarroel R,De Clercq R,Van Montagu M,Rogers J,Cronin A,Quail M,Bray-Allen S,Clark L,Doggett J,Hall S,Kay M,Lennard N,McLay K,Mayes R,Pettett A,Rajandream MA,Lyne M,Benes V,Rechmann S,Borkova D,Bl?cker H,Scharfe M,Grimm M,L?hnert TH,Dose S,de Haan M,Maarse A,Sch?fer M,M?ller-Auer S,Gabel C,Fuchs M,Fartmann B,Granderath K,Dauner D,Herzl A,Neumann S,Argiriou A,Vitale D,Liguori R,Piravandi E,Massenet O,Quigley F,Clabauld G,M?ndlein A,Felber R,Schnabl S,Hiller R,Schmidt W,Lecharny A,Aubourg S,Chefdor F,Cooke R,Berger C,Montfort A,Casacuberta E,Gibbons T,Weber N,Vandenbol M,Bargues M,Terol J,Torres A,Perez-Perez A,Purnelle B,Bent E,Johnson S,Tacon D,Jesse T,Heijnen L,Schwarz S,Scholler P,Heber S,Francs P,Bielke C,Frishman D,Haase D,Lemcke K,Mewes HW,Stocker S,Zaccaria P,Bevan M,Wilson RK,de la Bastide M,Habermann K,Parnell L,Dedhia N,Gnoj L,Schutz K,Huang E,Spiegel L,Sehkon M,Murray J,Sheet P,Cordes M,Abu-Threideh J,Stoneking T,Kalicki J,Graves T,Harmon G,Edwards J,Latreille P,Courtney L,Cloud J,Abbott A,Scott K,Johnson D,Minx P,Bentley D,Fulton B,Miller N,Greco T,Kemp K,Kramer J,Fulton L,Mardis E,Dante M,Pepin K,Hillier L,Nelson J,Spieth J,Ryan E,Andrews S,Geisel C,Layman D,Du H,Ali J,Berghoff A,Jones K,Drone K,Cotton M,Joshu C,Antonoiu B,Zidanic M,Strong C,Sun H,Lamar B,Yordan C,Ma P,Zhong J,Preston R,Vil D,Shekher M,Matero A,Shah R,Swaby IK,O'Shaughnessy A,Rodriguez M,Hoffmann J,Till S,Granat S,Shohdy N,Hasegawa A,Hameed A,Lodhi M,Johnson A,Chen E,Marra M,Martienssen R,McCombie WR