EMB3103 cDNA ORF clone, Arabidopsis thaliana(thale cress)

The following EMB3103 gene cDNA ORF clone sequences were retrieved from the NCBI Reference Sequence Database (RefSeq). These sequences represent the protein coding region of the EMB3103 cDNA ORF which is encoded by the open reading frame (ORF) sequence. ORF sequences can be delivered in our standard vector, pcDNA3.1+/C-(K)DYK or the vector of your choice as an expression/transfection-ready ORF clone. Not the clone you want? Click here to find your clone.

***CloneID Accession No. Definition **Vector *Turnaround time Price (USD) Select
OAb23190 NM_100966.3
Latest version!
Arabidopsis thaliana Pentatricopeptide repeat (PPR) superfamily protein (EMB3103), mRNA. pcDNA3.1-C-(k)DYK or customized vector 7-9 $342.30
$489.00
OAb33391 NM_001331941.1
Latest version!
Arabidopsis thaliana Pentatricopeptide repeat (PPR) superfamily protein (EMB3103), mRNA. pcDNA3.1-C-(k)DYK or customized vector 14-16 $342.30
$489.00

ORF Online Only Promotion

Next-day Shipping ORF Clones ( in default vector with tag)
1 Clone 30% OFF
2-4 Clone 40% OFF
5 or more Clone 50% OFF
All Other ORF Clones
30% OFF

*Business Day

** You may select a custom vector to replace pcDNA3.1+/C-(K)DYK after clone is added to cart.

** GenScript guarantees 100% sequence accuracy of all synthetic DNA constructs we deliver, but we do not guarantee protein expression in your experimental system. Protein expression is influenced by many factors that may vary between experiments or laboratories. In addition, please pay attention to the signal peptide, propeptide and transit peptide in target ORF, which may affect the choice of vector (N/C terminal tag vector).

***One clone ID might be correlated to multiple accession numbers, which share the same CDS sequence.

  • Reference Sequences (Refseq)
    CloneID OAb23190
    Clone ID Related Accession (Same CDS sequence) NM_100966.3
    Accession Version NM_100966.3 Latest version! Documents for ORF clone product in default vector
    Sequence Information ORF Nucleotide Sequence (Length: 1995bp)
    Protein sequence
    SNP
    Vector pcDNA3.1-C-(k)DYK or customized vector User Manual
    Clone information Clone Map MSDS
    Tag on pcDNA3.1+/C-(K)DYK C terminal DYKDDDDK tags
    ORF Insert Method CloneEZ™ Seamless cloning technology
    Insert Structure linear
    Update Date 2019-02-13
    Organism Arabidopsis thaliana(thale cress)
    Product Pentatricopeptide repeat (PPR) superfamily protein
    Comment Comment: REVIEWED REFSEQ: This record has been curated by TAIR and Araport. This record is derived from an annotated genomic sequence (NC_003070). On Sep 12, 2016 this sequence version replaced NM_100966.2.

    1
    61
    121
    181
    241
    301
    361
    421
    481
    541
    601
    661
    721
    781
    841
    901
    961
    1021
    1081
    1141
    1201
    1261
    1321
    1381
    1441
    1501
    1561
    1621
    1681
    1741
    1801
    1861
    1921
    1981
    ATGGAGACGC CACTTCTTGT GGGACTCGAA CTACGTTGTC CTCCTCATCT CTTCAACACT 
    CACTCTCGTC CCTCTTCATC TCTCTCCATT CCCGCCTTAT CCTTACGAAT CCTCACGCCG
    ACGGCGGCAA CGACATCTTC CGCTGTTATT GAATTACCGG CGAACGTAGC AGAAGCTCCT
    CGTTCCAAAC GCCATTCCAA TTCATACCTG GCGAGAAAAT CCGCCATTTC TGAAGTTCAA
    CGCTCCTCGG ATTTTCTCTC TTCTCTGCAG AGATTAGCAA CAGTTTTGAA GGTACAAGAT
    TTGAATGTAA TTTTGCGTGA TTTTGGAATC TCTGGAAGAT GGCAGGATCT TATACAGCTC
    TTTGAATGGA TGCAACAACA TGGAAAGATT AGTGTTTCAA CTTACAGTAG CTGCATAAAG
    TTTGTTGGGG CCAAAAATGT CTCCAAGGCT CTAGAAATAT ACCAAAGCAT TCCAGACGAG
    TCTACCAAAA TCAATGTCTA TATATGTAAC TCCATTCTTA GTTGTCTGGT CAAGAATGGA
    AAGCTTGACA GCTGCATCAA ATTGTTTGAT CAGATGAAGC GCGATGGTCT GAAACCAGAT
    GTGGTTACAT ATAACACGTT GCTTGCAGGT TGCATTAAGG TAAAAAATGG ATACCCTAAG
    GCTATTGAAC TCATTGGAGA GCTGCCTCAT AATGGAATAC AAATGGACAG TGTGATGTAT
    GGGACTGTCT TGGCCATTTG TGCTTCAAAT GGTCGAAGTG AAGAAGCTGA AAACTTTATC
    CAGCAGATGA AAGTTGAAGG TCATTCGCCC AACATATATC ATTACAGCTC TTTACTCAAT
    TCATATTCTT GGAAAGGAGA TTACAAGAAA GCTGATGAGC TGATGACCGA GATGAAATCA
    ATAGGATTAG TGCCAAACAA GGTGATGATG ACAACTTTAC TTAAGGTTTA TATCAAAGGA
    GGGTTGTTTG ATAGATCAAG AGAATTACTT TCTGAACTTG AATCTGCTGG GTACGCCGAG
    AACGAGATGC CGTATTGTAT GTTGATGGAT GGTCTTTCAA AGGCGGGAAA GTTAGAAGAA
    GCGAGGTCAA TCTTTGATGA TATGAAAGGG AAAGGTGTTA GATCTGATGG CTATGCCAAC
    AGCATCATGA TATCTGCTTT ATGTCGAAGT AAGCGTTTTA AGGAGGCAAA AGAACTGTCA
    AGGGACTCTG AAACCACTTA TGAAAAATGC GACTTGGTAA TGTTAAACAC AATGCTCTGT
    GCCTATTGCA GAGCAGGAGA GATGGAAAGT GTTATGCGAA TGATGAAGAA AATGGATGAG
    CAAGCCGTTA GTCCCGACTA TAATACTTTC CATATCTTGA TCAAATACTT CATCAAGGAG
    AAATTGCACC TGCTTGCGTA CCAAACCACA CTGGACATGC ACAGCAAAGG CCACAGGCTT
    GAGGAGGAAC TTTGCTCGTC CTTGATATAT CATCTCGGCA AGATTAGAGC TCAAGCGGAA
    GCATTCTCAG TCTACAATAT GTTGCGATAC AGCAAAAGAA CTATCTGCAA AGAGCTGCAT
    GAGAAAATTC TTCACATTCT AATCCAAGGG AATCTCCTAA AAGATGCATA CATTGTAGTG
    AAGGACAATG CGAAGATGAT CTCACAGCCT ACTTTAAAGA AATTTGGCAG AGCTTTTATG
    ATCTCGGGTA ATATTAATTT AGTGAATGAT GTTCTGAAAG TATTGCATGG TTCTGGCCAC
    AAAATCGATC AGGTTCAGTT TGAGATTGCG ATTTCTCGAT ACATTTCGCA GCCCGATAAA
    AAGGAATTGC TTCTTCAACT TCTACAATGG ATGCCTGGTC AAGGATATGT TGTTGACTCC
    TCCACAAGAA ACCTCATTCT CAAGAACTCT CATATGTTTG GTCGGCTACT CATTGCAGAG
    ATCTTGTCAA AGCATCATGT CGCTTCAAGA CCGATGATAA AATCACGACC AGAGCAAAAA
    TTTAGATGTA AATAA

    The stop codons will be deleted if pcDNA3.1+/C-(K)DYK vector is selected.

    RefSeq NP_172560.2
    CDS74..2068
    Translation

    Target ORF information:

    RefSeq Version NM_100966.3
    Organism Arabidopsis thaliana(thale cress)
    Definition Arabidopsis thaliana Pentatricopeptide repeat (PPR) superfamily protein (EMB3103), mRNA.

    Target ORF information:

    Epitope DYKDDDDK
    Bacterial selection AMPR
    Mammalian selection NeoR
    Vector pcDNA3.1+/C-(K)DYK
    NM_100966.3

    ORF Insert Sequence:

    1
    61
    121
    181
    241
    301
    361
    421
    481
    541
    601
    661
    721
    781
    841
    901
    961
    1021
    1081
    1141
    1201
    1261
    1321
    1381
    1441
    1501
    1561
    1621
    1681
    1741
    1801
    1861
    1921
    1981
    ATGGAGACGC CACTTCTTGT GGGACTCGAA CTACGTTGTC CTCCTCATCT CTTCAACACT 
    CACTCTCGTC CCTCTTCATC TCTCTCCATT CCCGCCTTAT CCTTACGAAT CCTCACGCCG
    ACGGCGGCAA CGACATCTTC CGCTGTTATT GAATTACCGG CGAACGTAGC AGAAGCTCCT
    CGTTCCAAAC GCCATTCCAA TTCATACCTG GCGAGAAAAT CCGCCATTTC TGAAGTTCAA
    CGCTCCTCGG ATTTTCTCTC TTCTCTGCAG AGATTAGCAA CAGTTTTGAA GGTACAAGAT
    TTGAATGTAA TTTTGCGTGA TTTTGGAATC TCTGGAAGAT GGCAGGATCT TATACAGCTC
    TTTGAATGGA TGCAACAACA TGGAAAGATT AGTGTTTCAA CTTACAGTAG CTGCATAAAG
    TTTGTTGGGG CCAAAAATGT CTCCAAGGCT CTAGAAATAT ACCAAAGCAT TCCAGACGAG
    TCTACCAAAA TCAATGTCTA TATATGTAAC TCCATTCTTA GTTGTCTGGT CAAGAATGGA
    AAGCTTGACA GCTGCATCAA ATTGTTTGAT CAGATGAAGC GCGATGGTCT GAAACCAGAT
    GTGGTTACAT ATAACACGTT GCTTGCAGGT TGCATTAAGG TAAAAAATGG ATACCCTAAG
    GCTATTGAAC TCATTGGAGA GCTGCCTCAT AATGGAATAC AAATGGACAG TGTGATGTAT
    GGGACTGTCT TGGCCATTTG TGCTTCAAAT GGTCGAAGTG AAGAAGCTGA AAACTTTATC
    CAGCAGATGA AAGTTGAAGG TCATTCGCCC AACATATATC ATTACAGCTC TTTACTCAAT
    TCATATTCTT GGAAAGGAGA TTACAAGAAA GCTGATGAGC TGATGACCGA GATGAAATCA
    ATAGGATTAG TGCCAAACAA GGTGATGATG ACAACTTTAC TTAAGGTTTA TATCAAAGGA
    GGGTTGTTTG ATAGATCAAG AGAATTACTT TCTGAACTTG AATCTGCTGG GTACGCCGAG
    AACGAGATGC CGTATTGTAT GTTGATGGAT GGTCTTTCAA AGGCGGGAAA GTTAGAAGAA
    GCGAGGTCAA TCTTTGATGA TATGAAAGGG AAAGGTGTTA GATCTGATGG CTATGCCAAC
    AGCATCATGA TATCTGCTTT ATGTCGAAGT AAGCGTTTTA AGGAGGCAAA AGAACTGTCA
    AGGGACTCTG AAACCACTTA TGAAAAATGC GACTTGGTAA TGTTAAACAC AATGCTCTGT
    GCCTATTGCA GAGCAGGAGA GATGGAAAGT GTTATGCGAA TGATGAAGAA AATGGATGAG
    CAAGCCGTTA GTCCCGACTA TAATACTTTC CATATCTTGA TCAAATACTT CATCAAGGAG
    AAATTGCACC TGCTTGCGTA CCAAACCACA CTGGACATGC ACAGCAAAGG CCACAGGCTT
    GAGGAGGAAC TTTGCTCGTC CTTGATATAT CATCTCGGCA AGATTAGAGC TCAAGCGGAA
    GCATTCTCAG TCTACAATAT GTTGCGATAC AGCAAAAGAA CTATCTGCAA AGAGCTGCAT
    GAGAAAATTC TTCACATTCT AATCCAAGGG AATCTCCTAA AAGATGCATA CATTGTAGTG
    AAGGACAATG CGAAGATGAT CTCACAGCCT ACTTTAAAGA AATTTGGCAG AGCTTTTATG
    ATCTCGGGTA ATATTAATTT AGTGAATGAT GTTCTGAAAG TATTGCATGG TTCTGGCCAC
    AAAATCGATC AGGTTCAGTT TGAGATTGCG ATTTCTCGAT ACATTTCGCA GCCCGATAAA
    AAGGAATTGC TTCTTCAACT TCTACAATGG ATGCCTGGTC AAGGATATGT TGTTGACTCC
    TCCACAAGAA ACCTCATTCT CAAGAACTCT CATATGTTTG GTCGGCTACT CATTGCAGAG
    ATCTTGTCAA AGCATCATGT CGCTTCAAGA CCGATGATAA AATCACGACC AGAGCAAAAA
    TTTAGATGTA AATAA

    The stop codons will be deleted if pcDNA3.1+/C-(K)DYK vector is selected.

    CloneID OAb33391
    Clone ID Related Accession (Same CDS sequence) NM_001331941.1
    Accession Version NM_001331941.1 Latest version! Documents for ORF clone product in default vector
    Sequence Information ORF Nucleotide Sequence (Length: 1779bp)
    Protein sequence
    SNP
    Vector pcDNA3.1-C-(k)DYK or customized vector User Manual
    Clone information Clone Map MSDS
    Tag on pcDNA3.1+/C-(K)DYK C terminal DYKDDDDK tags
    ORF Insert Method CloneEZ™ Seamless cloning technology
    Insert Structure linear
    Update Date 2019-02-13
    Organism Arabidopsis thaliana(thale cress)
    Product Pentatricopeptide repeat (PPR) superfamily protein
    Comment Comment: REVIEWED REFSEQ: This record has been curated by TAIR and Araport. This record is derived from an annotated genomic sequence (NC_003070).

    1
    61
    121
    181
    241
    301
    361
    421
    481
    541
    601
    661
    721
    781
    841
    901
    961
    1021
    1081
    1141
    1201
    1261
    1321
    1381
    1441
    1501
    1561
    1621
    1681
    1741
    ATGGAGACGC CACTTCTTGT GGGACTCGAA CTACGTTGTC CTCCTCATCT CTTCAACACT 
    CACTCTCGTC CCTCTTCATC TCTCTCCATT CCCGCCTTAT CCTTACGAAT CCTCACGCCG
    ACGGCGGCAA CGACATCTTC CGCTGTTATT GAATTACCGG CGAACGTAGC AGAAGCTCCT
    CGTTCCAAAC GCCATTCCAA TTCATACCTG GCGAGAAAAT CCGCCATTTC TGAAGTTCAA
    CGCTCCTCGG ATTTTCTCTC TTCTCTGCAG AGATTAGCAA CAGTTTTGAA GGTACAAGAT
    TTGAATGTAA TTTTGCGTGA TTTTGGAATC TCTGGAAGAT GGCAGGATCT TATACAGCTC
    TTTGAATGGA TGCAACAACA TGGAAAGATT AGTGTTTCAA CTTACAGTAG CTGCATAAAG
    TTTGTTGGGG CCAAAAATGT CTCCAAGGCT CTAGAAATAT ACCAAAGCAT TCCAGACGAG
    TCTACCAAAA TCAATGTCTA TATATGTAAC TCCATTCTTA GTTGTCTGGT CAAGAATGGA
    AAGCTTGACA GCTGCATCAA ATTGTTTGAT CAGATGAAGC GCGATGGTCT GAAACCAGAT
    GTGGTTACAT ATAACACGTT GCTTGCAGGT TGCATTAAGG TAAAAAATGG ATACCCTAAG
    GCTATTGAAC TCATTGGAGA GCTGCCTCAT AATGGAATAC AAATGGACAG TGTGATGTAT
    GGGACTGTCT TGGCCATTTG TGCTTCAAAT GGTCGAAGTG AAGAAGCTGA AAACTTTATC
    CAGCAGATGA AAGTTGAAGG TCATTCGCCC AACATATATC ATTACAGCTC TTTACTCAAT
    TCATATTCTT GGAAAGGAGA TTACAAGAAA GCTGATGAGC TGATGACCGA GATGAAATCA
    ATAGGATTAG TGCCAAACAA GGTGATGATG ACAACTTTAC TTAAGGTTTA TATCAAAGGA
    GGGTTGTTTG ATAGATCAAG AGAATTACTT TCTGAACTTG AATCTGCTGG GTACGCCGAG
    AACGAGATGC CGTATTGTAT GTTGATGGAT GGTCTTTCAA AGGCGGGAAA GTTAGAAGAA
    GCGAGGTCAA TCTTTGATGA TATGAAAGGG AAAGGTGTTA GATCTGATGG CTATGCCAAC
    AGCATCATGA TATCTGCTTT ATGTCGAAGT AAGCGTTTTA AGGAGGCAAA AGAACTGTCA
    AGGGACTCTG AAACCACTTA TGAAAAATGC GACTTGGTAA TGTTAAACAC AATGCTCTGT
    GCCTATTGCA GAGCAGGAGA GATGGAAAGT GTTATGCGAA TGATGAAGAA AATGGATGAG
    CAAGCCGTTA GTCCCGACTA TAATACTTTC CATATCTTGA TCAAATACTT CATCAAGGAG
    AAATTGCACC TGCTTGCGTA CCAAACCACA CTGGACATGC ACAGCAAAGG CCACAGGCTT
    GAGGAGGAAC TTTGCTCGTC CTTGATATAT CATCTCGGCA AGATTAGAGC TCAAGCGGAA
    GCATTCTCAG TCTACAATAT GTTGCGATAC AGCAAAAGAA CTATCTGCAA AGAGCTGCAT
    GAGAAAATTC TTCACATTCT AATCCAAGGG AATCTCCTAA AAGATGCATA CATTGTAGTG
    AAGGACAATG CGAAGATGAT CTCACAGCCT ACTTTAAAGA AATTTGGCAG AGCTTTTATG
    ATCTCGGGTA ATATTAATTT AGTGAATGAT GTTCTGAAAG TATTGCATGG TTCTGGCCAC
    AAAATCGATC AGGTAAATAG CCAACCGGTA ACAATTTGA

    The stop codons will be deleted if pcDNA3.1+/C-(K)DYK vector is selected.

    RefSeq NP_001320560.1
    CDS74..1852
    Translation

    Target ORF information:

    RefSeq Version NM_001331941.1
    Organism Arabidopsis thaliana(thale cress)
    Definition Arabidopsis thaliana Pentatricopeptide repeat (PPR) superfamily protein (EMB3103), mRNA.

    Target ORF information:

    Epitope DYKDDDDK
    Bacterial selection AMPR
    Mammalian selection NeoR
    Vector pcDNA3.1+/C-(K)DYK
    NM_001331941.1

    ORF Insert Sequence:

    1
    61
    121
    181
    241
    301
    361
    421
    481
    541
    601
    661
    721
    781
    841
    901
    961
    1021
    1081
    1141
    1201
    1261
    1321
    1381
    1441
    1501
    1561
    1621
    1681
    1741
    ATGGAGACGC CACTTCTTGT GGGACTCGAA CTACGTTGTC CTCCTCATCT CTTCAACACT 
    CACTCTCGTC CCTCTTCATC TCTCTCCATT CCCGCCTTAT CCTTACGAAT CCTCACGCCG
    ACGGCGGCAA CGACATCTTC CGCTGTTATT GAATTACCGG CGAACGTAGC AGAAGCTCCT
    CGTTCCAAAC GCCATTCCAA TTCATACCTG GCGAGAAAAT CCGCCATTTC TGAAGTTCAA
    CGCTCCTCGG ATTTTCTCTC TTCTCTGCAG AGATTAGCAA CAGTTTTGAA GGTACAAGAT
    TTGAATGTAA TTTTGCGTGA TTTTGGAATC TCTGGAAGAT GGCAGGATCT TATACAGCTC
    TTTGAATGGA TGCAACAACA TGGAAAGATT AGTGTTTCAA CTTACAGTAG CTGCATAAAG
    TTTGTTGGGG CCAAAAATGT CTCCAAGGCT CTAGAAATAT ACCAAAGCAT TCCAGACGAG
    TCTACCAAAA TCAATGTCTA TATATGTAAC TCCATTCTTA GTTGTCTGGT CAAGAATGGA
    AAGCTTGACA GCTGCATCAA ATTGTTTGAT CAGATGAAGC GCGATGGTCT GAAACCAGAT
    GTGGTTACAT ATAACACGTT GCTTGCAGGT TGCATTAAGG TAAAAAATGG ATACCCTAAG
    GCTATTGAAC TCATTGGAGA GCTGCCTCAT AATGGAATAC AAATGGACAG TGTGATGTAT
    GGGACTGTCT TGGCCATTTG TGCTTCAAAT GGTCGAAGTG AAGAAGCTGA AAACTTTATC
    CAGCAGATGA AAGTTGAAGG TCATTCGCCC AACATATATC ATTACAGCTC TTTACTCAAT
    TCATATTCTT GGAAAGGAGA TTACAAGAAA GCTGATGAGC TGATGACCGA GATGAAATCA
    ATAGGATTAG TGCCAAACAA GGTGATGATG ACAACTTTAC TTAAGGTTTA TATCAAAGGA
    GGGTTGTTTG ATAGATCAAG AGAATTACTT TCTGAACTTG AATCTGCTGG GTACGCCGAG
    AACGAGATGC CGTATTGTAT GTTGATGGAT GGTCTTTCAA AGGCGGGAAA GTTAGAAGAA
    GCGAGGTCAA TCTTTGATGA TATGAAAGGG AAAGGTGTTA GATCTGATGG CTATGCCAAC
    AGCATCATGA TATCTGCTTT ATGTCGAAGT AAGCGTTTTA AGGAGGCAAA AGAACTGTCA
    AGGGACTCTG AAACCACTTA TGAAAAATGC GACTTGGTAA TGTTAAACAC AATGCTCTGT
    GCCTATTGCA GAGCAGGAGA GATGGAAAGT GTTATGCGAA TGATGAAGAA AATGGATGAG
    CAAGCCGTTA GTCCCGACTA TAATACTTTC CATATCTTGA TCAAATACTT CATCAAGGAG
    AAATTGCACC TGCTTGCGTA CCAAACCACA CTGGACATGC ACAGCAAAGG CCACAGGCTT
    GAGGAGGAAC TTTGCTCGTC CTTGATATAT CATCTCGGCA AGATTAGAGC TCAAGCGGAA
    GCATTCTCAG TCTACAATAT GTTGCGATAC AGCAAAAGAA CTATCTGCAA AGAGCTGCAT
    GAGAAAATTC TTCACATTCT AATCCAAGGG AATCTCCTAA AAGATGCATA CATTGTAGTG
    AAGGACAATG CGAAGATGAT CTCACAGCCT ACTTTAAAGA AATTTGGCAG AGCTTTTATG
    ATCTCGGGTA ATATTAATTT AGTGAATGAT GTTCTGAAAG TATTGCATGG TTCTGGCCAC
    AAAATCGATC AGGTAAATAG CCAACCGGTA ACAATTTGA

    The stop codons will be deleted if pcDNA3.1+/C-(K)DYK vector is selected.

  • PubMed

    Sequence and analysis of chromosome 1 of the plant Arabidopsis thaliana.
    Nature408(6814)816-20(2000 Dec)
    Theologis A,Ecker JR,Palm CJ,Federspiel NA,Kaul S,White O,Alonso J,Altafi H,Araujo R,Bowman CL,Brooks SY,Buehler E,Chan A,Chao Q,Chen H,Cheuk RF,Chin CW,Chung MK,Conn L,Conway AB,Conway AR,Creasy TH,Dewar K,Dunn P,Etgu P,Feldblyum TV,Feng J,Fong B,Fujii CY,Gill JE,Goldsmith AD,Haas B,Hansen NF,Hughes B,Huizar L,Hunter JL,Jenkins J,Johnson-Hopson C,Khan S,Khaykin E,Kim CJ,Koo HL,Kremenetskaia I,Kurtz DB,Kwan A,Lam B,Langin-Hooper S,Lee A,Lee JM,Lenz CA,Li JH,Li Y,Lin X,Liu SX,Liu ZA,Luros JS,Maiti R,Marziali A,Militscher J,Miranda M,Nguyen M,Nierman WC,Osborne BI,Pai G,Peterson J,Pham PK,Rizzo M,Rooney T,Rowley D,Sakano H,Salzberg SL,Schwartz JR,Shinn P,Southwick AM,Sun H,Tallon LJ,Tambunga G,Toriumi MJ,Town CD,Utterback T,Van Aken S,Vaysberg M,Vysotskaia VS,Walker M,Wu D,Yu G,Fraser CM,Venter JC,Davis RW