F12438

RIKEN Arabidopsis full-length cDNA overexpressed Arabidopsis lines

Original: F12438

Phenotypes

Morphological phenotypes

Generation Anatomical entity
/structure development stage
Phenotypes
T2 seed (PO:0009010)
  • increased size (PATO:0000586)
    (Seed::Shape::large)
  • increased length (PATO:0000573)
    (Seed::Shape::long)

Invisible phenotypes

Specific phenotypes were NOT observed.

Original: F12438

Determination of introduced cDNA(s)

  • RAFL09-31-J04 - RIKEN Arabidopsis Full-length cDNAs - Original: RAFL09-31-J04 - Resource: pda05864

    full reading [+] show sequence

    Nucleotide sequence: RAFL09-31-J04 - full reading
    CCGTTCGTCC TTCTCACAAG TCTGATTGCG GAAAAAGCAG AGAGAGAGAG 
    AAAGTTCGAG CGGAAGAGAA GCGGAAAGCT CGAGGAGTCA TCAATGGTGA
    CGGACGATAG CAACTCCTCT GGACGAATCA AGTCTCATGT AGATGATGAT
    GATGATGGTG AAGAAGAAGA AGATAGACTC GAGGGTTTGG AAAACAGATT
    AAGTGAGCTT AAAAGGAAAA TTCAAGGAGA AAGAGTTAGG TCTATTAAAG
    AGAAATTTGA GGCTAATAGA AAGAAAGTGG ATGCTCATGT TTCTCCCTTT
    TCATCTGCTG CATCGAGCCG AGCTACCGCA GAGGATAATG GAAATAGCAA
    TATGCTTTCT TCGAGAATGA GAATGCCACT CTGCAAGTTA AATGGTTTTT
    CTCATGGTGT GGGAGATAGA GACTATGTTC CTACTAAGGA TGTTATATCA
    GCAAGTGTCA AGCTTCCTAT TGCTGAGAGA ATACCGCCAT ACACTACCTG
    GATATTTTTG GACAGAAATC AAAGAATGGC TGAAGATCAG TCTGTGGTTG
    GTCGAAGACA AATCTACTAT GAACAACATG GTGGTGAGAC GCTAATATGC
    AGCGATAGTG AGGAAGAACC AGAACCTGAG GAGGAAAAAC GTGAATTTTC
    CGAGGGTGAA GATTCCATTA TATGGTTAAT TGGGCAGGAG TATGGCATGG
    GTGAGGAAGT GCAGGATGCC CTTTGCCAGT TGCTAAGCGT AGATGCTTCT
    GATATCCTAG AAAGATACAA TGAGCTCAAG TTGAAGGATA AGCAGAATAC
    CGAGGAATTT TCTAATTCCG GATTCAAGCT GGGAATATCT CTGGAAAAGG
    GCCTTGGTGC AGCTCTAGAT TCTTTTGATA ATCTTTTCTG CCGCCGTTGC
    TTGGTATTTG ACTGTCGTCT GCATGGATGT TCTCAGCCTT TGATTAGTGC
    TAGTGAAAAA CAGCCTTATT GGTCTGATTA TGAAGGTGAT AGGAAACCCT
    GCAGCAAACA TTGTTACCTC CAGCTCAAGG CGGTCAGAGA AGTACCAGAA
    ACATGCAGTA ATTTTGCATC TAAAGCAGAA GAGAAAGCTT CAGAAGAGGA
    ATGCAGCAAG GCTGTCTCCT CTGATGTTCC CCATGCTGCT GCTAGTGGTG
    TCAGTCTGCA AGTTGAGAAG ACTGATATTG GTATCAAGAA TGTAGATTCA
    TCCTCTGGTG TAGAACAAGA GCATGGAATT AGAGGAAAGC GTGAGGTCCC
    AATTCTAAAA GACTCCAATG ATCTGCCTAA TTTATCGAAC AAGAAACAGA
    AGACCGCAGC CTCAGATACA AAAATGTCAT TTGTTAATTC TGTCCCTAGC
    TTAGATCAGG CATTGGATAG CACAAAGGGT GATCAAGGTG GAACAACTGA
    CAATAAAGTA AACAGAGACT CAGAAGCTGA TGCAAAAGAA GTAGGTGAGC
    CTATTCCAGA CAATTCGGTC CATGATGGTG GTTCCTCAAT TTGTCAGCCA
    CACCATGGTA GTGGAAACGG AGCAATAATC ATTGCAGAAA TGTCTGAGAC
    AAGTCGACCA TCTACAGAGT GGAATCCTAT CGAGAAGGAT CTTTACTTGA
    AGGGAGTCGA AATCTTTGGA AGAAACAGCT GTCTTATTGC AAGAAACCTG
    CTTTCTGGCT TGAAGACATG CCTAGATGTG TCCAATTACA TGCGTGAAAA
    CGAAGTTTCA GTTTTTCGAA GATCTAGTAC CCCAAATTTG CTGTTGGATG
    ATGGCAGGAC TGACCCAGGG AATGATAATG ATGAGGTGCC TCCAAGGACA
    AGATTGTTCC GTAGAAAAGG CAAAACCCGG AAGCTAAAAT ACTCTACAAA
    GTCTGCTGGT CATCCGTCTG TCTGGAAAAG AATAGCTGGT GGCAAAAACC
    AGTCCTGTAA ACAATACACG CCGTGTGGAT GCCTGTCAAT GTGCGGAAAG
    GATTGCCCTT GTCTAACTAA TGAAACTTGC TGCGAGAAAT ATTGCGGGTG
    CTCAAAAAGC TGTAAAAATC GTTTCCGAGG ATGTCATTGT GCAAAGAGTC
    AATGCAGAAG TAGGCAGTGT CCCTGCTTTG CTGCTGGCAG AGAATGTGAT
    CCAGATGTTT GCAGAAATTG CTGGGTTAGT TGTGGAGATG GTTCTCTCGG
    TGAAGCACCA AGACGCGGAG AAGGGCAATG CGGAAACATG AGACTTCTCC
    TGAGGCAACA ACAGAGGATC CTATTGGGAA AGTCTGATGT TGCTGGATGG
    GGTGCTTTTC TAAAGAACTC GGTCAGCAAA AATGAATACC TTGGAGAATA
    CACCGGTGAA TTGATCTCAC ACCATGAGGC GGATAAGCGT GGGAAAATAT
    ATGACCGGGC AAATTCGTCC TTCCTCTTTG ACTTGAATGA TCAGTACGTC
    CTCGATGCTC AACGCAAAGG TGACAAGCTG AAATTTGCCA ATCACTCAGC
    TAAACCCAAT TGCTACGCTA AGGTGATGTT TGTAGCAGGA GATCACAGGG
    TCGGGATTTT TGCAAACGAA CGAATAGAAG CTAGCGAAGA GCTTTTCTAT
    GACTATAGAT ATGGACCAGA CCAAGCACCA GTGTGGGCTC GCAAACCTGA
    AGGCTCCAAG AAAGATGATT CAGCCATTAC TCATCGTAGA GCCAGAAAGC
    ACCAATCTCA TTGATGATTA CTGGCTAAGA GAAGTAACTT TTATAAAAAT
    AACTTATAGA GTTGTGAGAG ATGATATTTG AAGTTTGATA ACTTAAGCTT
    GTCTTTATTA ATTAATTATT ATAGAGTTGA GATTTTATTC T

    Gene models with high sequence identity

    1. AT4G02020.1 [+] show detail - SET domain-containing protein
      E-value: 0; Score: 99.96

     

    InterPro Scan Digest

    Program Description E-value
    superfamily SET domain 5e-61
    Coil coiled-coil 0
    Coil coiled-coil 0
    ProfileScan SET domain 30.465
    HMMPanther ENHANCER OF ZESTE, EZH 2.7e-208
    HMMPanther SET DOMAIN PROTEINS 2.7e-208
    HMMPfam SET domain 3.8e-17
    HMMSmart SANT, DNA-binding 0.0066
    HMMSmart SET domain 1.4e-33

    See the detailed result >

    AT4G02020.1

    Model type
    Protein coding
    Short Description
    SET domain-containing protein
    Curator Summary
    Encodes a polycomb group protein. Forms part of a large protein complex that can include VRN2 (VERNALIZATION 2), VIN3 (VERNALIZATION INSENSITIVE 3) and polycomb group proteins FERTILIZATION INDEPENDENT ENDOSPERM (FIE) and CURLY LEAF (CLF). The complex has a role in establishing FLC (FLOWERING LOCUS C) repression during vernalization. Performs a partially redundant role to MEA in controlling seed initiation by helping to suppress central cell nucleusendosperm proliferation within the FG.
    Computational Description
    SWINGER (SWN); CONTAINS InterPro DOMAIN/s: SANT, DNA-binding (InterPro:IPR001005), SET domain (InterPro:IPR001214); BEST Arabidopsis thaliana protein match is: SET domain-containing protein (TAIR:AT2G23380.1); Has 5041 Blast hits to 4734 proteins in 465 species: Archae - 0; Bacteria - 399; Metazoa - 2132; Fungi - 472; Plants - 1030; Viruses - 0; Other Eukaryotes - 1008 (source: NCBI BLink).
    Link
    InterPro Scan - TAIR