HEG1 cDNA ORF clone, Sus scrofa(Pig)

The following HEG1 gene cDNA ORF clone sequences were retrieved from the NCBI Reference Sequence Database (RefSeq). These sequences represent the protein coding region of the HEG1 cDNA ORF which is encoded by the open reading frame (ORF) sequence. ORF sequences can be delivered in our standard vector, pcDNA3.1+/C-(K)DYK or the vector of your choice as an expression/transfection-ready ORF clone. Not the clone you want? Click here to find your clone.

***CloneID Accession No. Definition **Vector *Turnaround time Price (USD) Select
OSe236090 XM_021070219.1
Latest version!
Sus scrofa heart development protein with EGF like domains 1 (HEG1), mRNA. pcDNA3.1-C-(k)DYK or customized vector TBD $797.30
$1139.00
OSe42307 XM_013982369.1
Latest version!
Sus scrofa HEG homolog 1 (zebrafish) (HEG1), mRNA. pcDNA3.1-C-(k)DYK or customized vector TBD $797.30
$1139.00

ORF Online Only Promotion

Next-day Shipping ORF Clones ( in default vector with tag)
1 Clone 30% OFF
2-4 Clone 40% OFF
5 or more Clone 50% OFF
All Other ORF Clones
30% OFF

*Business Day

** You may select a custom vector to replace pcDNA3.1+/C-(K)DYK after clone is added to cart.

** GenScript guarantees 100% sequence accuracy of all synthetic DNA constructs we deliver, but we do not guarantee protein expression in your experimental system. Protein expression is influenced by many factors that may vary between experiments or laboratories. In addition, please pay attention to the signal peptide, propeptide and transit peptide in target ORF, which may affect the choice of vector (N/C terminal tag vector).

***One clone ID might be correlated to multiple accession numbers, which share the same CDS sequence.

  • Reference Sequences (Refseq)
    CloneID OSe236090
    Clone ID Related Accession (Same CDS sequence) XM_021070219.1
    Accession Version XM_021070219.1 Latest version! Documents for ORF clone product in default vector
    Sequence Information ORF Nucleotide Sequence (Length: 4065bp)
    Protein sequence
    SNP
    Vector pcDNA3.1-C-(k)DYK or customized vector User Manual
    Clone information Clone Map MSDS
    Tag on pcDNA3.1+/C-(K)DYK C terminal DYKDDDDK tags
    ORF Insert Method CloneEZ™ Seamless cloning technology
    Insert Structure linear
    Update Date 2017-05-12
    Organism Sus scrofa(pig)
    Product protein HEG homolog 1
    Comment Comment: MODEL REFSEQ: This record is predicted by automated computational analysis. This record is derived from a genomic sequence (NC_010455.5) annotated using gene prediction method: Gnomon, supported by mRNA and EST evidence. Also see: Documentation of NCBI's Annotation Process ##Genome-Annotation-Data-START## Annotation Provider :: NCBI Annotation Status :: Full annotation Annotation Version :: Sus scrofa Annotation Release 106 Annotation Pipeline :: NCBI eukaryotic genome annotation pipeline Annotation Software Version :: 7.4 Annotation Method :: Best-placed RefSeq; Gnomon Features Annotated :: Gene; mRNA; CDS; ncRNA ##Genome-Annotation-Data-END##

    1
    61
    121
    181
    241
    301
    361
    421
    481
    541
    601
    661
    721
    781
    841
    901
    961
    1021
    1081
    1141
    1201
    1261
    1321
    1381
    1441
    1501
    1561
    1621
    1681
    1741
    1801
    1861
    1921
    1981
    2041
    2101
    2161
    2221
    2281
    2341
    2401
    2461
    2521
    2581
    2641
    2701
    2761
    2821
    2881
    2941
    3001
    3061
    3121
    3181
    3241
    3301
    3361
    3421
    3481
    3541
    3601
    3661
    3721
    3781
    3841
    3901
    3961
    4021
    ATGGCCTCGC CGCCGCCGCT CCTGCTGCTG CTGCTACCGC TGCTGCTACT GCCGCCGCCG 
    GCCGCCCCCA AGCCCCGCAA CGAGAGCGAG CCGCCGCCGC CGCCGCCGCC GCCCGCGCTG
    TCCCCGGAGC GCCACGGGCC CGCGCCTACG GGCCACCGCT GGACAGCCGC TGTGCGTGGC
    ACCGCGACGC GGCCCGAACC TTCGGGCCGA GTCCCCCGAG GCGGGAGCAC CGCTGCTATG
    AGGAACCACT GGCCAGAAAG CAACACTGAA CCCCACAAAG AAAACATGAC CTTCGGCCCG
    AGTCAAGTGG ACTTTTCCAC CGTGGTCTCC AAAGAGGGCG TGATGATGGT TCAGACCCCA
    GGAAAGAACC ACACTTCTTC AGAGGCTCCA CAGAACTCCA CCCTGCAAAC TGAAACAGCA
    GCTGCTGAAG GAAGAAATGA CTCTCTAAGA AGATCACATT TCACTGTTTC CCCCGTCGGA
    CCTGACAGAG CAGCAGCTCT GACCTCCCAG AGCATCACCT CAGCCTGGAG AAGGCATCAC
    CTGCCATCCA GCAGTTCAAG GTCAGAAGGA AGAACTTACC CTGCTCACAC GGACAGCGGG
    ACATCACAGG GTTCTCCAGC AGGGGGGCAG AGGCTCCCAG GAGACGTGAC TGTGCACACC
    CAAGTGGCGG CCACTTCGGT TTTGAGCCAG TCGTCTCCTC CTGGTTTGGA GATGGGAGAA
    GAGACCACAC CCGCCAGGCA GCAAAATTCC TCAGGCCCGT GGCTCTCCTG GATGCCTTTC
    TCTGGGACAC CAGCTTCCTC CCCGTCCTCA GACCATTCTG CACCTTCTGG AATTCCGGAG
    GATTTAAACA ACTCCACTGC TGTCCAACAC CCCTCGGCCC ATGGGACAAA GAGTGCCCGT
    GTTACCACGG TCCCCACCGG TGTTTCTCGG ATGCTCCAAT CTTTAGCTGT CCATCCAGGA
    CCTCTAATGG AGACAGAGCC TTTCTCTGAG GACACTGCAA CTGCCACGAC TTCAGCATCA
    GCCCATTCCT CACCCCCTGA GGCAGAGTCC AGAAGAAACA GTGAAGTGGT CGGGAGCCCA
    GGGGATGGAG CGTTCATGGA ACCATCCACG GAGAATGCAT TTGGCCTTAC ATCTTCCAAG
    GTCTCAGTGG AGTCTGGGCA AAATGATTCC CCAACCTCGG GAGGACTCAG ACTTGCCAGC
    AGCTCTGGTG CTGGGGATGG AAGCCCCGGG TCTCAGACTG AGACCGTGTC CCGGTCAGCC
    CCGTTGGTCA GAGGTGGACA GAGCACTGCT CTCTGGTTCG TGAGCAACAG CGAGACGTTG
    GCAGATGCAC CGGGAAGCTC CACTGCCCAT CCCGAAGTTG AGAATGCTTC CGTGTTGACC
    CGGTTCTCAG CCTTGGCCCC ACAGTCTGGA AGGAGTGACA CCACGCTGGG TGGTGGGAGC
    TCTGAGCCAG ACACCGAGTC CTCCTCCTCG TCTTCCTCTT CCTCAACAAG CCTGGACTCC
    TGGGCACCAC GCGGGGAGCA CTTGATCACT GGAACTAGCT CCGATCCAGT GCACAGCACA
    GACCCTGAGC ACAGGACCTC AGGCGACTAC ACGGACCACA CCTACGTTTC AGCCCCTTTC
    ACCAAAGGGG AACGGACACT GCTGTCCATT GCAGACAACA GTTCAGCTGC AGACCTAAGG
    GAGAGCTCCA CCTCTTCTGT TAAAATCTCA AACGCTTCAC ATCCAGACTC TTCTTCTCCT
    TCTTTGGCTC AGACTGAGAG AGGTAATGCC TCGTCCCACA AAGGGGAGCC CGCCCTGCCT
    TCCACTGAGG TGCAGGTTCT GCACACAGCC CGCCCTCCGT CCCACACACC CACTGTCATC
    TTGCCAAGCA CCCTGGACGC CCATGCTGAC TCTGTGGGTG ACCCGTCATC TTCTTCGTCA
    GGGCCCCCTC TGCCCCCACC CTCAGTGTCA CAGTCCTACC ACTTGTTCTT GTCAACGGTG
    CCATCAACCG GGGCCTCCAC GCACCGACTG CAGTCCACCC CTGATGTGCC CACACCTTTG
    TCTTCCTTGC CACCACCTTC GCCAGCGTCC TTGACGACAT CCACCCCTGC CGAGCCGGCC
    ATCTCACAAA CAACCCTCCC ACCTTTGTCA TCAACCCTGG TCCCGCCCCG GCCAAGGGAC
    GCTCCGGTGA CTTCGGTCTG GACATTGACG ATGGCGTCAT CCGTGGCGGT GCTCCCCAAC
    AGTCAGACAG CAGATCCTAA GATCCAGAGC AACCCAGACC TCGGGAACGT CATTACAGAA
    TCAAAGCTCC CAAGCCTGGA GACTCGGACT CCGGAGGCCA CGGAAGCTGT GACAGTGAGG
    TCTACTCTGA GGATCCCTTC ACCCCCAGCC TTCACAGAGG CCTCCACTGA GCAAACCCCT
    CCAGCCACCA GCACCAGCTT AGCCCAGACA TCTTCAGCTT CAGCAGCCAC CACCCTCAAG
    ATCTCTCATC CCCCCACACC CAGCCCCAGC CCCCCTTCCC GCACAGCCGC CCCCGGGGGT
    GGCCCCACAG CAGCACAGAC AGTGGCTGGA AAGCAGCCCC CACCAACCAG TCCTGAAATG
    CTAGTGGTAC AAGTCTCAAC AGGAGGTGCT GTCATCCCAG AAAAGAGCCA AGCACATGGA
    GATGCTGCCG CTGGGTCAGC CCGCCTGACC AGCACCCCCA CGTCAGCAGA AGAACTGACC
    ACGGAGCGTG GCCGTGCAGG AAACAATAGC CCGGCTCCGC ATTTCCTCAG AACATCTCCT
    GCTCCCCAGA CCACACATAT TTCCACAGCT GAAGTGGTGA CGGCTGCATC AACCACCCCT
    GGTGCTCAGA GCAGCACCCA GTCACCCACC ACACCATCAT CCCCAGTCTC AGTGAACAGC
    TGCACCCCTC GCCCTTGTCT CCACGACGGG AAGTGCGTCG TGGACCCCAC CACCAGCCGT
    GGGCACCGCT GTGTGTGCTC CCCTTCCTGG CAAGGGCAAG ACTGCAGTGT GGATGTGAAT
    GAATGCCTCT CAAACCCCTG CCCACCCCTG GCCACGTGCA ACAATACTCA GGGATCCTTC
    ACCTGCAGAT GCCCAGTGGG GTACCAGCTG GAAAAAGGGA TATGCAATTT GGTCAGAACC
    TTCATGACGG AGTTTAAGTT GAAGAAAACG TTTCTGAATA CCACCATGGA ACAACATGCA
    GACCTCCGTG AGGTTGAAAA TGAGATCACT AAGACGTTAA ATGTGTGTTT TTCAACATTG
    CCTGGTTATA CCCGATCCAC AGCTCATGCT TCTAGGGAGC CCAGTGCAGT GGTGATGTCA
    CTGCAAAGCA CCTTTTCCCT GGCCTCCAAC GTGACGCTGT TCGACCTGGC CGACGGGATG
    CAGAAATGTG TCAATGCCTG CAGGTCCTCT GCTGAGGTCT GCCAGCTCTT GGGGTCTCAG
    AAGCGGATCT TTAGAGCGGG CAGCTTGTGC AAGCGGAAGA CTCCAGAATG TGACAAAGAG
    ACCTCCATCT GTACCGATCT GGATGGGGTC GCACTGTGCC AATGCAAGTC CGGCTACTTC
    CAGTTCAACA AGATGGACCA CTCCTGCCGA GCATGTGAAG ATGGATATAG GCTTGAAAAT
    GAAACCTGTA CGAGTTGCCC ATTCGGCCTT GGTGGTCTCA ACTGTGGAAA CCCTTATCAG
    CTCATCACCG TGGTGATCGC CGCAGCGGGA GGTGGGCTTC TGCTCATTCT GGGCATCGCG
    CTGATTGTTA CCTGCTGCCG AAAGAATAAA AATGACATAA GCAAACTCAT CTTCAAAAGT
    GGGGATTTCC AGATGTCGCC ATATGCTGAG TACCCCAAGA ACCCTCGGTC ACAAGAATGG
    GGCCGAGAAG CTATTGAAAT GCATGAGAAT GGAAGTACCA AAAACCTCCT CCAGATGACT
    GACGTGTATT ACTCGCCCAC AAGTGTCAGG AATCCCGAAC TTGAACGAAA TGGACTCTAC
    CCGGCCTACA CTGGATTGCC AGGATCACGG CATTCTTGCA TTTTTCCCGG ACAGTATAAC
    CCATCTTTCA TCAGCGATGA GAGCAGGAGA AGGGACTACT TCTAA

    The stop codons will be deleted if pcDNA3.1+/C-(K)DYK vector is selected.

    RefSeq XP_020925878.1
    CDS215..4279
    Translation

    Target ORF information:

    RefSeq Version XM_021070219.1
    Organism Sus scrofa(pig)
    Definition Sus scrofa heart development protein with EGF like domains 1 (HEG1), mRNA.

    Target ORF information:

    Epitope DYKDDDDK
    Bacterial selection AMPR
    Mammalian selection NeoR
    Vector pcDNA3.1+/C-(K)DYK
    XM_021070219.1

    ORF Insert Sequence:

    1
    61
    121
    181
    241
    301
    361
    421
    481
    541
    601
    661
    721
    781
    841
    901
    961
    1021
    1081
    1141
    1201
    1261
    1321
    1381
    1441
    1501
    1561
    1621
    1681
    1741
    1801
    1861
    1921
    1981
    2041
    2101
    2161
    2221
    2281
    2341
    2401
    2461
    2521
    2581
    2641
    2701
    2761
    2821
    2881
    2941
    3001
    3061
    3121
    3181
    3241
    3301
    3361
    3421
    3481
    3541
    3601
    3661
    3721
    3781
    3841
    3901
    3961
    4021
    ATGGCCTCGC CGCCGCCGCT CCTGCTGCTG CTGCTACCGC TGCTGCTACT GCCGCCGCCG 
    GCCGCCCCCA AGCCCCGCAA CGAGAGCGAG CCGCCGCCGC CGCCGCCGCC GCCCGCGCTG
    TCCCCGGAGC GCCACGGGCC CGCGCCTACG GGCCACCGCT GGACAGCCGC TGTGCGTGGC
    ACCGCGACGC GGCCCGAACC TTCGGGCCGA GTCCCCCGAG GCGGGAGCAC CGCTGCTATG
    AGGAACCACT GGCCAGAAAG CAACACTGAA CCCCACAAAG AAAACATGAC CTTCGGCCCG
    AGTCAAGTGG ACTTTTCCAC CGTGGTCTCC AAAGAGGGCG TGATGATGGT TCAGACCCCA
    GGAAAGAACC ACACTTCTTC AGAGGCTCCA CAGAACTCCA CCCTGCAAAC TGAAACAGCA
    GCTGCTGAAG GAAGAAATGA CTCTCTAAGA AGATCACATT TCACTGTTTC CCCCGTCGGA
    CCTGACAGAG CAGCAGCTCT GACCTCCCAG AGCATCACCT CAGCCTGGAG AAGGCATCAC
    CTGCCATCCA GCAGTTCAAG GTCAGAAGGA AGAACTTACC CTGCTCACAC GGACAGCGGG
    ACATCACAGG GTTCTCCAGC AGGGGGGCAG AGGCTCCCAG GAGACGTGAC TGTGCACACC
    CAAGTGGCGG CCACTTCGGT TTTGAGCCAG TCGTCTCCTC CTGGTTTGGA GATGGGAGAA
    GAGACCACAC CCGCCAGGCA GCAAAATTCC TCAGGCCCGT GGCTCTCCTG GATGCCTTTC
    TCTGGGACAC CAGCTTCCTC CCCGTCCTCA GACCATTCTG CACCTTCTGG AATTCCGGAG
    GATTTAAACA ACTCCACTGC TGTCCAACAC CCCTCGGCCC ATGGGACAAA GAGTGCCCGT
    GTTACCACGG TCCCCACCGG TGTTTCTCGG ATGCTCCAAT CTTTAGCTGT CCATCCAGGA
    CCTCTAATGG AGACAGAGCC TTTCTCTGAG GACACTGCAA CTGCCACGAC TTCAGCATCA
    GCCCATTCCT CACCCCCTGA GGCAGAGTCC AGAAGAAACA GTGAAGTGGT CGGGAGCCCA
    GGGGATGGAG CGTTCATGGA ACCATCCACG GAGAATGCAT TTGGCCTTAC ATCTTCCAAG
    GTCTCAGTGG AGTCTGGGCA AAATGATTCC CCAACCTCGG GAGGACTCAG ACTTGCCAGC
    AGCTCTGGTG CTGGGGATGG AAGCCCCGGG TCTCAGACTG AGACCGTGTC CCGGTCAGCC
    CCGTTGGTCA GAGGTGGACA GAGCACTGCT CTCTGGTTCG TGAGCAACAG CGAGACGTTG
    GCAGATGCAC CGGGAAGCTC CACTGCCCAT CCCGAAGTTG AGAATGCTTC CGTGTTGACC
    CGGTTCTCAG CCTTGGCCCC ACAGTCTGGA AGGAGTGACA CCACGCTGGG TGGTGGGAGC
    TCTGAGCCAG ACACCGAGTC CTCCTCCTCG TCTTCCTCTT CCTCAACAAG CCTGGACTCC
    TGGGCACCAC GCGGGGAGCA CTTGATCACT GGAACTAGCT CCGATCCAGT GCACAGCACA
    GACCCTGAGC ACAGGACCTC AGGCGACTAC ACGGACCACA CCTACGTTTC AGCCCCTTTC
    ACCAAAGGGG AACGGACACT GCTGTCCATT GCAGACAACA GTTCAGCTGC AGACCTAAGG
    GAGAGCTCCA CCTCTTCTGT TAAAATCTCA AACGCTTCAC ATCCAGACTC TTCTTCTCCT
    TCTTTGGCTC AGACTGAGAG AGGTAATGCC TCGTCCCACA AAGGGGAGCC CGCCCTGCCT
    TCCACTGAGG TGCAGGTTCT GCACACAGCC CGCCCTCCGT CCCACACACC CACTGTCATC
    TTGCCAAGCA CCCTGGACGC CCATGCTGAC TCTGTGGGTG ACCCGTCATC TTCTTCGTCA
    GGGCCCCCTC TGCCCCCACC CTCAGTGTCA CAGTCCTACC ACTTGTTCTT GTCAACGGTG
    CCATCAACCG GGGCCTCCAC GCACCGACTG CAGTCCACCC CTGATGTGCC CACACCTTTG
    TCTTCCTTGC CACCACCTTC GCCAGCGTCC TTGACGACAT CCACCCCTGC CGAGCCGGCC
    ATCTCACAAA CAACCCTCCC ACCTTTGTCA TCAACCCTGG TCCCGCCCCG GCCAAGGGAC
    GCTCCGGTGA CTTCGGTCTG GACATTGACG ATGGCGTCAT CCGTGGCGGT GCTCCCCAAC
    AGTCAGACAG CAGATCCTAA GATCCAGAGC AACCCAGACC TCGGGAACGT CATTACAGAA
    TCAAAGCTCC CAAGCCTGGA GACTCGGACT CCGGAGGCCA CGGAAGCTGT GACAGTGAGG
    TCTACTCTGA GGATCCCTTC ACCCCCAGCC TTCACAGAGG CCTCCACTGA GCAAACCCCT
    CCAGCCACCA GCACCAGCTT AGCCCAGACA TCTTCAGCTT CAGCAGCCAC CACCCTCAAG
    ATCTCTCATC CCCCCACACC CAGCCCCAGC CCCCCTTCCC GCACAGCCGC CCCCGGGGGT
    GGCCCCACAG CAGCACAGAC AGTGGCTGGA AAGCAGCCCC CACCAACCAG TCCTGAAATG
    CTAGTGGTAC AAGTCTCAAC AGGAGGTGCT GTCATCCCAG AAAAGAGCCA AGCACATGGA
    GATGCTGCCG CTGGGTCAGC CCGCCTGACC AGCACCCCCA CGTCAGCAGA AGAACTGACC
    ACGGAGCGTG GCCGTGCAGG AAACAATAGC CCGGCTCCGC ATTTCCTCAG AACATCTCCT
    GCTCCCCAGA CCACACATAT TTCCACAGCT GAAGTGGTGA CGGCTGCATC AACCACCCCT
    GGTGCTCAGA GCAGCACCCA GTCACCCACC ACACCATCAT CCCCAGTCTC AGTGAACAGC
    TGCACCCCTC GCCCTTGTCT CCACGACGGG AAGTGCGTCG TGGACCCCAC CACCAGCCGT
    GGGCACCGCT GTGTGTGCTC CCCTTCCTGG CAAGGGCAAG ACTGCAGTGT GGATGTGAAT
    GAATGCCTCT CAAACCCCTG CCCACCCCTG GCCACGTGCA ACAATACTCA GGGATCCTTC
    ACCTGCAGAT GCCCAGTGGG GTACCAGCTG GAAAAAGGGA TATGCAATTT GGTCAGAACC
    TTCATGACGG AGTTTAAGTT GAAGAAAACG TTTCTGAATA CCACCATGGA ACAACATGCA
    GACCTCCGTG AGGTTGAAAA TGAGATCACT AAGACGTTAA ATGTGTGTTT TTCAACATTG
    CCTGGTTATA CCCGATCCAC AGCTCATGCT TCTAGGGAGC CCAGTGCAGT GGTGATGTCA
    CTGCAAAGCA CCTTTTCCCT GGCCTCCAAC GTGACGCTGT TCGACCTGGC CGACGGGATG
    CAGAAATGTG TCAATGCCTG CAGGTCCTCT GCTGAGGTCT GCCAGCTCTT GGGGTCTCAG
    AAGCGGATCT TTAGAGCGGG CAGCTTGTGC AAGCGGAAGA CTCCAGAATG TGACAAAGAG
    ACCTCCATCT GTACCGATCT GGATGGGGTC GCACTGTGCC AATGCAAGTC CGGCTACTTC
    CAGTTCAACA AGATGGACCA CTCCTGCCGA GCATGTGAAG ATGGATATAG GCTTGAAAAT
    GAAACCTGTA CGAGTTGCCC ATTCGGCCTT GGTGGTCTCA ACTGTGGAAA CCCTTATCAG
    CTCATCACCG TGGTGATCGC CGCAGCGGGA GGTGGGCTTC TGCTCATTCT GGGCATCGCG
    CTGATTGTTA CCTGCTGCCG AAAGAATAAA AATGACATAA GCAAACTCAT CTTCAAAAGT
    GGGGATTTCC AGATGTCGCC ATATGCTGAG TACCCCAAGA ACCCTCGGTC ACAAGAATGG
    GGCCGAGAAG CTATTGAAAT GCATGAGAAT GGAAGTACCA AAAACCTCCT CCAGATGACT
    GACGTGTATT ACTCGCCCAC AAGTGTCAGG AATCCCGAAC TTGAACGAAA TGGACTCTAC
    CCGGCCTACA CTGGATTGCC AGGATCACGG CATTCTTGCA TTTTTCCCGG ACAGTATAAC
    CCATCTTTCA TCAGCGATGA GAGCAGGAGA AGGGACTACT TCTAA

    The stop codons will be deleted if pcDNA3.1+/C-(K)DYK vector is selected.

    CloneID OSe42307
    Clone ID Related Accession (Same CDS sequence) XM_013982369.1
    Accession Version XM_013982369.1 Latest version! Documents for ORF clone product in default vector
    Sequence Information ORF Nucleotide Sequence (Length: 4284bp)
    Protein sequence
    SNP
    Vector pcDNA3.1-C-(k)DYK or customized vector User Manual
    Clone information Clone Map MSDS
    Tag on pcDNA3.1+/C-(K)DYK C terminal DYKDDDDK tags
    ORF Insert Method CloneEZ™ Seamless cloning technology
    Insert Structure linear
    Update Date 2015-09-10
    Organism Sus scrofa(Pig)
    Product LOW QUALITY PROTEIN: protein HEG homolog 1
    Comment MODEL REFSEQ: This record is predicted by automated computational analysis. This record is derived from a genomic sequence (NW_003611795.1) annotated using gene prediction method: Gnomon, supported by mRNA and EST evidence. Also see: Documentation of NCBI's Annotation Process ##Genome-Annotation-Data-START## Annotation Provider :: NCBI Annotation Status :: Full annotation Annotation Version :: Sus scrofa Annotation Release 105 Annotation Pipeline :: NCBI eukaryotic genome annotation pipeline Annotation Software Version :: 6.4 Annotation Method :: Best-placed RefSeq; Gnomon Features Annotated :: Gene; mRNA; CDS; ncRNA ##Genome-Annotation-Data-END## ##RefSeq-Attributes-START## frameshifts :: corrected 1 indel ##RefSeq-Attributes-END##

    1
    61
    121
    181
    241
    301
    361
    421
    481
    541
    601
    661
    721
    781
    841
    901
    961
    1021
    1081
    1141
    1201
    1261
    1321
    1381
    1441
    1501
    1561
    1621
    1681
    1741
    1801
    1861
    1921
    1981
    2041
    2101
    2161
    2221
    2281
    2341
    2401
    2461
    2521
    2581
    2641
    2701
    2761
    2821
    2881
    2941
    3001
    3061
    3121
    3181
    3241
    3301
    3361
    3421
    3481
    3541
    3601
    3661
    3721
    3781
    3841
    3901
    3961
    4021
    4081
    4141
    4201
    4261
    ATGGCCTCGC CGCCGCCGCT CCTGCTGCTG CTGCTACCGC TGCTGCTACT GCCGCCGCCG 
    GCCGCCCCCA AGCCCCGCAA CGAGAGCGAG CCGCCGCCGC CGCCGCCGCC GCCCGCGCTG
    TCCCCGGAGC GCCACGGGCC CGCGCCTACG GGCCACCGCT GGACAGCCGC TGTGCGTGGC
    ACCGCGACGC GGCCCGAACC TTCGGGCCGA GTCCCCCGAG GCGGGAGCAC CGCTGCTATG
    AGGAACCACT GGCCAGAAAG CAACACTGAA CCCCACAAAG AAAACATGAC CTTCGGCCCG
    AGTCAAGTGG ACTTTTCCAC CGTAGTCTCC AAAGAGGGCG TGATGATGGT TCAGACCCCA
    GGAAAGAACC ACACTTCTTC AGAGGCTCCA CAGAACTCCA CCCTGCAAAC TGAAACAGCA
    GCTGCTGAAG GAAGAAATGA CTCTCTAAGA AGATCACATT TCACTGTTTC CCCCGTCAGA
    CCTGACAGAG CAGCAGCTCT GACCTCCCAG AGCATCACCT CAGCCTGGAG AAGGCATCAC
    CTGCCATCCA GCAGTTCAAG GTCAGAAGGA AGAACTTACC CTGCTCACAC GGACGACGGG
    ACATCGCAGG GTTCTCCGGC CGGGGGGCAG AGGCTCCCAG GAGACGTGAC TGTGCACACC
    CAGGTGGCGG CCACTTCGGT TTTGAGCCAG TCGTCTCCTC CTGGTTTGGA GATGGGAGAA
    GAGACCACAC CCGCCAGGCA GCAAAATTCC TCAGGCCCAT GGCTCTCCTG GATGCCTTTC
    TCTGGGACAC CAGCTTCCTC CCCGCCCTCA GACCATTCTG CACCTTCTGG AATTCCGGAG
    GATTTAAACA ACTCCACTGC TGTCCAACAC CCCTCGGCCC ATGGGACAAA GAGTGCCCGT
    GTTACCACGG TCCCCACCGG TGTTTCTCGG ATGCTCCAAT CTTTAGCTGT CCATCCAGGA
    CCTCTAATGG AGACAGAGCC TTTCTCTGAG GACACTGCAA CTGCCACGAC TTCAGCATCA
    GCCCATTCCT CACCCCCTGA GGCAGAGTCC AGAAGAAACA GTGAAGTGGT CGGGAGCCCA
    GGGGATGGAG CGTTCATGGA ACCATCCACG GAGAATGCAT TTGGCCTTAC ATCTTCCAAG
    GTCTCAGTGG AGTCTGGGCA AAATGATTCC CCAACCTCGG GAGGACTCAG ACTTGCCAGC
    AGCTCTGGTG CCGGGGATGG AAGCCCCGGG TCTCAGACTG AGACCGTGTC CCGGTCAGCC
    CCGTTGGTCA GAGGTGGACA GAGCACTGCT CTCTGGTTCG TGAGCAACAG CGAGACGTTG
    GCAGATGCAC CGGGAAGCTC CACTGCCCAT CCCGAAGTTG AGAATGCTTC CGTGTTGACC
    CGGTTCTCAG CCTCGGCCCC ACAGTCTGGA AGGAGTGACA CCACGCTGGG TGGTGGGAGC
    TCTGAGCCAG ACACCGAGTC CTCCTCCTCG TCTTCCTCTT CCTCAACAAG CCTGGACTCC
    TGGGCACCAC ACGGGGAGCA CTTGACCTCG GGAGGACTCA GACTTGCCAG CAGCTCTGGT
    GCCGGGGATG GAAGCCCCGG GTCTCAGACT GAGACCGTGT CCCGGTCAGC CCCGTTGGTC
    AGAGGTGGAC AGAGCACTGC TCTCTGGTTC GTGAGCAACA GCGAGACGTT GGCAGATGCA
    CCGGGAAGCT CCACTGCCCA TCCCGAAGTT GAGAATGCTT CCGTGTTGAC CCGGTTCTCA
    GCCTCGGCCC CACAGTCTGG AAGGAGTGAC ACCACGCTGG GTGGTGGGAG CTCTGAGCCA
    GACACCGAGT CCTCCTCCTC GTCTTCCTCT TCCTCAACAA GCCTGGACTC CTGGGCACCA
    CACGGGGAGC ACTTGACCAC AGAAGACAGC CCCGAGCTGG GCGTCGGTTC TGAATCGGAG
    GAAGGAGCTT CCGAGGGGGC ATCCAGAACC CGCGCTCACC GCCCGCACAC TCTGGCCACC
    CTCACCGGGA TCGGAGAGCG CACGCTGCGG TCTCTCACCA ACGGGAGCAC CACTCCTGGG
    GATGCGGGAC GATCGGAGGC CGAGGACACG GAGAGCGCCA CTCTGCAGGG GAACGTGACC
    GCCGCTGGGG ACTCCCACCT GGTCTCCAGC TCCCTGGCAG CCTCGCGCAC CCTCGGAGTC
    ACTGGAACTA GCTCCGATCC AGTGCACAGC ACAGACCCTG AGCACAGGAC CTCAGGCGAC
    TACACGGACC ACACCTACGT TTCAGCCCCT TTCACCAAAG GGGAACGGAC ACTGCTGTCC
    ATTGCAGACA ACAGTTCAGC TGCAGACCTA AGGGAGAGCT CCACCTCTTC TGTTAAAATC
    TCAAACGCTT CACATCCAGA CTCTTCTTCT CCTTCTTTGG CTCAGACTGA GAGAGGTAAT
    GCCTCGTCCC ACAAAGGGGA GCCCGCCCTG CCTTCCACTG AGGTGCAGGT TCTGCACACA
    GCCCGCCCTC CGTCCCACAC ACCCACTGTC ATCTTGCCAA GCACCCTGGA CGCCCATGCT
    GACTCTGTGG GTGACCCATC ATCTTCTTCA TCAGGGCCCC CTCTGCCCCC ACCCTCAGTG
    TCACAGTCCT ACCACTTGTT CTTGTCAACG GTGCCATCAA CCGGGGCCTC CACGCACCGA
    CTGCAGTCCA CCCCTGATGT GCCCACACCT TTGTCTTCCT TGCCACCACC TTCGCCAGCG
    TCCTTGACGA CATCCACCCC TGCCGAGCCG GCCATCTCAC AAACAACCCT CCCACCTTTG
    TCATCAACCC TGGTCCCGCC CCGGCCAAGG GACGCTCCGG TGACTTCGGT CTGGACATTG
    ACGATGGCGT CATCCGTGGC GGTGCTCCCC AACAGTCAGA CAGCAGATCC TAAGATCCAG
    AGCAACCCAG ACCTCGGGAA CGTCATTACA GAATCAAAGC TCCCAAGCCT GGAGACTCGG
    ACCCCGGAGG CCACGGAAGC TGTGACAGTG AGGTCTACTC TGAGGATCCC TTCACCCCCA
    GCCTTCACAG AGGCCTCCAC TGAGCAAACC CCTCCAGCCA CCAGCACCAG CTTAGCCCAG
    ACATCTTCAG CTTCAGCAGC CACCACCCTC AAGATCTCTC ATCCCCCCAC ACCCAGCCCC
    AGCCCCCCTC CCCGCACAGC CGCCCCCGGG GGTGGCCCCA CAGCAGCACA GACAGTGGCT
    GGAAAGCAGC CCCCACCAAC CAGTCCTGAA ATGCTAGTGG TACAAGTCTC AACAGGAGGT
    GCTGTCATCC CAGAAAAGAG CCAAGCACAT GGAGATGCTG CCGCTGGGTC AGCCCGCCTG
    ACCAGCGCCC CCACGTCAGC AGAAGAACTG ACCACGGAGC GTGGCCGTGC AGGAGAGAAC
    AGCCCGGCTC CGCATTTCCT CAGAACATCT CCTGCTCCCC AGACCACACA TATTTCCACA
    GCTGAAGTGG TGACGGCTGC ATCAACCACC CCTGGTGCTC AGAACAGCAC CCAGTCACCC
    ACCACACCAT CATCCCCAGC CTCAGTGAAC AGCTGCACCC CTCGCCCTTG TCTCCACGAC
    GGGAAGTGCG TCGTGGACCC CACCACCAGC CGTGGGCACC GCTGTGTGTG CTCCCCTTCC
    TGGCAAGGGC AAGACTGCAG TGTGGATGTG AATGAATGCC TCTCAAACCC CTGCCCACCC
    CTGGCCACGT GCAACAATAC TCAGGGATCC TTCACCTGCA GATGCCCAGT GGGGTACCAG
    CTGGAAAAAG GGATATGCAA TTTGGTCAGA ACCTTCATGA CGGAGTTTAA GTTGAAGAAA
    ACGTTTCTGA ATACCACCAT GGAACAACAT GCAGACCTCC GTGAGGTTGA AAATGAGATC
    ACTAAGACGT TAAATGTGTG TTTTTCAACA TTGCCTGGTT ATACCCGATC CACAGCTCAT
    GCTTCTAGGG AGCCCAGTGC AGTGGTGATG TCACTGCAAA GCACCTTTTC CCTGGCCTCC
    AACGTGACGC TGTTCGACCT GGCCGACGGG ATGCAGAAAT GTGTCAATGC CTGCAGGTCC
    TCTGCTGAGG TCTGCCAGCT CTTGGGGTCT CAGAAGCGGA TCTTTAGAGC GGGCAGCTTG
    TGCAAGCGGA AGACTCCAGA ATGTGACAAA GAGACCTCCA TCTGTACCGA TCTGGATGGG
    GTCGCACTGT GCCAATGCAA GTCCGGCTAC TTCCAGTTCA ACAAGATGGA CCACTCCTGC
    CGAGGTACCG GCAGCTCTGG GGTCCTGTGG CCTGAGAGAC GGGGAAGGTG CTTTTCTCTG
    CCTGGGTTGC TCTCTCTGCT CTAG

    The stop codons will be deleted if pcDNA3.1+/C-(K)DYK vector is selected.

    RefSeq XP_013837823.1
    CDS216..4499
    Translation

    Target ORF information:

    RefSeq Version XM_013982369.1
    Organism Sus scrofa(Pig)
    Definition Sus scrofa HEG homolog 1 (zebrafish) (HEG1), mRNA.

    Target ORF information:

    Epitope DYKDDDDK
    Bacterial selection AMPR
    Mammalian selection NeoR
    Vector pcDNA3.1+/C-(K)DYK
    XM_013982369.1

    ORF Insert Sequence:

    1
    61
    121
    181
    241
    301
    361
    421
    481
    541
    601
    661
    721
    781
    841
    901
    961
    1021
    1081
    1141
    1201
    1261
    1321
    1381
    1441
    1501
    1561
    1621
    1681
    1741
    1801
    1861
    1921
    1981
    2041
    2101
    2161
    2221
    2281
    2341
    2401
    2461
    2521
    2581
    2641
    2701
    2761
    2821
    2881
    2941
    3001
    3061
    3121
    3181
    3241
    3301
    3361
    3421
    3481
    3541
    3601
    3661
    3721
    3781
    3841
    3901
    3961
    4021
    4081
    4141
    4201
    4261
    ATGGCCTCGC CGCCGCCGCT CCTGCTGCTG CTGCTACCGC TGCTGCTACT GCCGCCGCCG 
    GCCGCCCCCA AGCCCCGCAA CGAGAGCGAG CCGCCGCCGC CGCCGCCGCC GCCCGCGCTG
    TCCCCGGAGC GCCACGGGCC CGCGCCTACG GGCCACCGCT GGACAGCCGC TGTGCGTGGC
    ACCGCGACGC GGCCCGAACC TTCGGGCCGA GTCCCCCGAG GCGGGAGCAC CGCTGCTATG
    AGGAACCACT GGCCAGAAAG CAACACTGAA CCCCACAAAG AAAACATGAC CTTCGGCCCG
    AGTCAAGTGG ACTTTTCCAC CGTAGTCTCC AAAGAGGGCG TGATGATGGT TCAGACCCCA
    GGAAAGAACC ACACTTCTTC AGAGGCTCCA CAGAACTCCA CCCTGCAAAC TGAAACAGCA
    GCTGCTGAAG GAAGAAATGA CTCTCTAAGA AGATCACATT TCACTGTTTC CCCCGTCAGA
    CCTGACAGAG CAGCAGCTCT GACCTCCCAG AGCATCACCT CAGCCTGGAG AAGGCATCAC
    CTGCCATCCA GCAGTTCAAG GTCAGAAGGA AGAACTTACC CTGCTCACAC GGACGACGGG
    ACATCGCAGG GTTCTCCGGC CGGGGGGCAG AGGCTCCCAG GAGACGTGAC TGTGCACACC
    CAGGTGGCGG CCACTTCGGT TTTGAGCCAG TCGTCTCCTC CTGGTTTGGA GATGGGAGAA
    GAGACCACAC CCGCCAGGCA GCAAAATTCC TCAGGCCCAT GGCTCTCCTG GATGCCTTTC
    TCTGGGACAC CAGCTTCCTC CCCGCCCTCA GACCATTCTG CACCTTCTGG AATTCCGGAG
    GATTTAAACA ACTCCACTGC TGTCCAACAC CCCTCGGCCC ATGGGACAAA GAGTGCCCGT
    GTTACCACGG TCCCCACCGG TGTTTCTCGG ATGCTCCAAT CTTTAGCTGT CCATCCAGGA
    CCTCTAATGG AGACAGAGCC TTTCTCTGAG GACACTGCAA CTGCCACGAC TTCAGCATCA
    GCCCATTCCT CACCCCCTGA GGCAGAGTCC AGAAGAAACA GTGAAGTGGT CGGGAGCCCA
    GGGGATGGAG CGTTCATGGA ACCATCCACG GAGAATGCAT TTGGCCTTAC ATCTTCCAAG
    GTCTCAGTGG AGTCTGGGCA AAATGATTCC CCAACCTCGG GAGGACTCAG ACTTGCCAGC
    AGCTCTGGTG CCGGGGATGG AAGCCCCGGG TCTCAGACTG AGACCGTGTC CCGGTCAGCC
    CCGTTGGTCA GAGGTGGACA GAGCACTGCT CTCTGGTTCG TGAGCAACAG CGAGACGTTG
    GCAGATGCAC CGGGAAGCTC CACTGCCCAT CCCGAAGTTG AGAATGCTTC CGTGTTGACC
    CGGTTCTCAG CCTCGGCCCC ACAGTCTGGA AGGAGTGACA CCACGCTGGG TGGTGGGAGC
    TCTGAGCCAG ACACCGAGTC CTCCTCCTCG TCTTCCTCTT CCTCAACAAG CCTGGACTCC
    TGGGCACCAC ACGGGGAGCA CTTGACCTCG GGAGGACTCA GACTTGCCAG CAGCTCTGGT
    GCCGGGGATG GAAGCCCCGG GTCTCAGACT GAGACCGTGT CCCGGTCAGC CCCGTTGGTC
    AGAGGTGGAC AGAGCACTGC TCTCTGGTTC GTGAGCAACA GCGAGACGTT GGCAGATGCA
    CCGGGAAGCT CCACTGCCCA TCCCGAAGTT GAGAATGCTT CCGTGTTGAC CCGGTTCTCA
    GCCTCGGCCC CACAGTCTGG AAGGAGTGAC ACCACGCTGG GTGGTGGGAG CTCTGAGCCA
    GACACCGAGT CCTCCTCCTC GTCTTCCTCT TCCTCAACAA GCCTGGACTC CTGGGCACCA
    CACGGGGAGC ACTTGACCAC AGAAGACAGC CCCGAGCTGG GCGTCGGTTC TGAATCGGAG
    GAAGGAGCTT CCGAGGGGGC ATCCAGAACC CGCGCTCACC GCCCGCACAC TCTGGCCACC
    CTCACCGGGA TCGGAGAGCG CACGCTGCGG TCTCTCACCA ACGGGAGCAC CACTCCTGGG
    GATGCGGGAC GATCGGAGGC CGAGGACACG GAGAGCGCCA CTCTGCAGGG GAACGTGACC
    GCCGCTGGGG ACTCCCACCT GGTCTCCAGC TCCCTGGCAG CCTCGCGCAC CCTCGGAGTC
    ACTGGAACTA GCTCCGATCC AGTGCACAGC ACAGACCCTG AGCACAGGAC CTCAGGCGAC
    TACACGGACC ACACCTACGT TTCAGCCCCT TTCACCAAAG GGGAACGGAC ACTGCTGTCC
    ATTGCAGACA ACAGTTCAGC TGCAGACCTA AGGGAGAGCT CCACCTCTTC TGTTAAAATC
    TCAAACGCTT CACATCCAGA CTCTTCTTCT CCTTCTTTGG CTCAGACTGA GAGAGGTAAT
    GCCTCGTCCC ACAAAGGGGA GCCCGCCCTG CCTTCCACTG AGGTGCAGGT TCTGCACACA
    GCCCGCCCTC CGTCCCACAC ACCCACTGTC ATCTTGCCAA GCACCCTGGA CGCCCATGCT
    GACTCTGTGG GTGACCCATC ATCTTCTTCA TCAGGGCCCC CTCTGCCCCC ACCCTCAGTG
    TCACAGTCCT ACCACTTGTT CTTGTCAACG GTGCCATCAA CCGGGGCCTC CACGCACCGA
    CTGCAGTCCA CCCCTGATGT GCCCACACCT TTGTCTTCCT TGCCACCACC TTCGCCAGCG
    TCCTTGACGA CATCCACCCC TGCCGAGCCG GCCATCTCAC AAACAACCCT CCCACCTTTG
    TCATCAACCC TGGTCCCGCC CCGGCCAAGG GACGCTCCGG TGACTTCGGT CTGGACATTG
    ACGATGGCGT CATCCGTGGC GGTGCTCCCC AACAGTCAGA CAGCAGATCC TAAGATCCAG
    AGCAACCCAG ACCTCGGGAA CGTCATTACA GAATCAAAGC TCCCAAGCCT GGAGACTCGG
    ACCCCGGAGG CCACGGAAGC TGTGACAGTG AGGTCTACTC TGAGGATCCC TTCACCCCCA
    GCCTTCACAG AGGCCTCCAC TGAGCAAACC CCTCCAGCCA CCAGCACCAG CTTAGCCCAG
    ACATCTTCAG CTTCAGCAGC CACCACCCTC AAGATCTCTC ATCCCCCCAC ACCCAGCCCC
    AGCCCCCCTC CCCGCACAGC CGCCCCCGGG GGTGGCCCCA CAGCAGCACA GACAGTGGCT
    GGAAAGCAGC CCCCACCAAC CAGTCCTGAA ATGCTAGTGG TACAAGTCTC AACAGGAGGT
    GCTGTCATCC CAGAAAAGAG CCAAGCACAT GGAGATGCTG CCGCTGGGTC AGCCCGCCTG
    ACCAGCGCCC CCACGTCAGC AGAAGAACTG ACCACGGAGC GTGGCCGTGC AGGAGAGAAC
    AGCCCGGCTC CGCATTTCCT CAGAACATCT CCTGCTCCCC AGACCACACA TATTTCCACA
    GCTGAAGTGG TGACGGCTGC ATCAACCACC CCTGGTGCTC AGAACAGCAC CCAGTCACCC
    ACCACACCAT CATCCCCAGC CTCAGTGAAC AGCTGCACCC CTCGCCCTTG TCTCCACGAC
    GGGAAGTGCG TCGTGGACCC CACCACCAGC CGTGGGCACC GCTGTGTGTG CTCCCCTTCC
    TGGCAAGGGC AAGACTGCAG TGTGGATGTG AATGAATGCC TCTCAAACCC CTGCCCACCC
    CTGGCCACGT GCAACAATAC TCAGGGATCC TTCACCTGCA GATGCCCAGT GGGGTACCAG
    CTGGAAAAAG GGATATGCAA TTTGGTCAGA ACCTTCATGA CGGAGTTTAA GTTGAAGAAA
    ACGTTTCTGA ATACCACCAT GGAACAACAT GCAGACCTCC GTGAGGTTGA AAATGAGATC
    ACTAAGACGT TAAATGTGTG TTTTTCAACA TTGCCTGGTT ATACCCGATC CACAGCTCAT
    GCTTCTAGGG AGCCCAGTGC AGTGGTGATG TCACTGCAAA GCACCTTTTC CCTGGCCTCC
    AACGTGACGC TGTTCGACCT GGCCGACGGG ATGCAGAAAT GTGTCAATGC CTGCAGGTCC
    TCTGCTGAGG TCTGCCAGCT CTTGGGGTCT CAGAAGCGGA TCTTTAGAGC GGGCAGCTTG
    TGCAAGCGGA AGACTCCAGA ATGTGACAAA GAGACCTCCA TCTGTACCGA TCTGGATGGG
    GTCGCACTGT GCCAATGCAA GTCCGGCTAC TTCCAGTTCA ACAAGATGGA CCACTCCTGC
    CGAGGTACCG GCAGCTCTGG GGTCCTGTGG CCTGAGAGAC GGGGAAGGTG CTTTTCTCTG
    CCTGGGTTGC TCTCTCTGCT CTAG

    The stop codons will be deleted if pcDNA3.1+/C-(K)DYK vector is selected.