Sequence Description Alias PCC hrr evm.model.contig_3581.1 (at4g21710 : 1212.0) Encodes the unique second-largest subunit of DNA-dependent RNA polymerase II; the ortholog of budding yeast RPB2 and a homolog of the E. coli RNA polymerase beta subunit.; NRPB2; CONTAINS InterPro DOMAIN/s: DNA-directed RNA polymerase, subunit 2, domain 6 (InterPro:IPR007120), RNA polymerase Rpb2, domain 7 (InterPro:IPR007641), RNA polymerase, beta subunit, protrusion (InterPro:IPR007644), RNA polymerase Rpb2, domain 3 (InterPro:IPR007645), DNA-directed RNA polymerase, subunit 2 (InterPro:IPR015712), RNA polymerase Rpb2, domain 2 (InterPro:IPR007642), RNA polymerase Rpb2, domain 4 (InterPro:IPR007646), RNA polymerase, beta subunit, conserved site (InterPro:IPR007121), RNA polymerase Rpb2, domain 5 (InterPro:IPR007647); BEST Arabidopsis thaliana protein match is: nuclear RNA polymerase C2 (TAIR:AT5G45140.1); Has 37546 Blast hits to 27868 proteins in 9192 species: Archae - 496; Bacteria - 17572; Metazoa - 623; Fungi - 7193; Plants - 3397; Viruses - 232; Other Eukaryotes - 8033 (source: NCBI BLink). & (q9mus5|rpob_mesvi : 137.0) DNA-directed RNA polymerase beta chain (EC 2.7.7.6) (PEP) (Plastid-encoded RNA polymerase subunit beta) (RNA polymerase subunit beta) - Mesostigma viride & (reliability: 2424.0) & (original description: no original description) 0.9254547833804622 11 evm.model.contig_2173.7 (at3g23940 : 478.0) dehydratase family; CONTAINS InterPro DOMAIN/s: Dihydroxy-acid dehydratase (InterPro:IPR004404), Dihydroxy-acid/6-phosphogluconate dehydratase, conserved site (InterPro:IPR020558), Dihydroxy-acid/6-phosphogluconate dehydratase (InterPro:IPR000581). & (reliability: 956.0) & (original description: no original description) 0.9225029044241263 3 evm.model.contig_2655.2 no hits & (original description: no original description) 0.9217846729804805 6 evm.model.contig_858.1 (at4g11130 : 176.0) Encodes RNA-dependent RNA polymerase that is required for endogenous siRNA (but not miRNA) formation. Nomenclature according to Xie, et al. (2004).; RNA-dependent RNA polymerase 2 (RDR2); CONTAINS InterPro DOMAIN/s: RNA-dependent RNA polymerase, eukaryotic-type (InterPro:IPR007855); BEST Arabidopsis thaliana protein match is: RNA-dependent RNA polymerase 1 (TAIR:AT1G14790.1); Has 1807 Blast hits to 1807 proteins in 277 species: Archae - 0; Bacteria - 0; Metazoa - 736; Fungi - 347; Plants - 385; Viruses - 0; Other Eukaryotes - 339 (source: NCBI BLink). & (reliability: 352.0) & (original description: no original description) 0.9207318457357538 5 evm.model.contig_4413.5 (at1g02560 : 190.0) One of several nuclear-encoded ClpPs (caseinolytic protease). Contains a highly conserved catalytic triad of Ser-type proteases (Ser-His-Asp). The name reflects nomenclature described in Adam et. al (2001).; nuclear encoded CLP protease 5 (CLPP5); FUNCTIONS IN: serine-type endopeptidase activity; INVOLVED IN: peptidyl-cysteine S-nitrosylation; LOCATED IN: in 7 components; EXPRESSED IN: 23 plant structures; EXPRESSED DURING: 14 growth stages; CONTAINS InterPro DOMAIN/s: Peptidase S14, ClpP, active site (InterPro:IPR018215), Peptidase S14, ClpP (InterPro:IPR001907); BEST Arabidopsis thaliana protein match is: CLP protease proteolytic subunit 3 (TAIR:AT1G66670.1); Has 13512 Blast hits to 13510 proteins in 3028 species: Archae - 2; Bacteria - 8525; Metazoa - 147; Fungi - 82; Plants - 1082; Viruses - 85; Other Eukaryotes - 3589 (source: NCBI BLink). & (p56317|clpp_chlvu : 170.0) ATP-dependent Clp protease proteolytic subunit (EC 3.4.21.92) (Endopeptidase Clp) - Chlorella vulgaris (Green alga) & (reliability: 380.0) & (original description: no original description) 0.91924849033946 11 evm.model.contig_3404.14 no hits & (original description: no original description) 0.9177436094989104 45 evm.model.contig_554.1 no hits & (original description: no original description) 0.9155140265050291 19 evm.model.contig_2019.10 (at4g18360 : 176.0) Aldolase-type TIM barrel family protein; FUNCTIONS IN: glycolate oxidase activity, oxidoreductase activity, FMN binding, catalytic activity; INVOLVED IN: oxidation reduction, metabolic process; LOCATED IN: peroxisome; EXPRESSED IN: 12 plant structures; EXPRESSED DURING: 4 anthesis, F mature embryo stage, petal differentiation and expansion stage, E expanded cotyledon stage, D bilateral stage; CONTAINS InterPro DOMAIN/s: Aldolase-type TIM barrel (InterPro:IPR013785), FMN-dependent alpha-hydroxy acid dehydrogenase, active site (InterPro:IPR008259), FMN-dependent dehydrogenase (InterPro:IPR000262), Alpha-hydroxy acid dehydrogenase, FMN-dependent (InterPro:IPR012133); BEST Arabidopsis thaliana protein match is: Aldolase-type TIM barrel family protein (TAIR:AT3G14420.2); Has 9948 Blast hits to 9918 proteins in 1541 species: Archae - 28; Bacteria - 4496; Metazoa - 367; Fungi - 686; Plants - 255; Viruses - 0; Other Eukaryotes - 4116 (source: NCBI BLink). & (p05414|gox_spiol : 169.0) Peroxisomal (S)-2-hydroxy-acid oxidase (EC 1.1.3.15) (Glycolate oxidase) (GOX) (Short chain alpha-hydroxy acid oxidase) - Spinacia oleracea (Spinach) & (reliability: 352.0) & (original description: no original description) 0.9114852063993341 35 evm.model.contig_3445.7 no hits & (original description: no original description) 0.9111920520225211 72 evm.model.contig_2104.19 (at5g63920 : 413.0) Encodes topoisomerase 3alpha. Suppresses somatic crossovers. Essential for resolution of meiotic recombination intermediates.; topoisomerase 3alpha (TOP3A); FUNCTIONS IN: DNA topoisomerase activity, DNA topoisomerase type I activity, DNA binding, zinc ion binding, nucleic acid binding; INVOLVED IN: in 7 processes; LOCATED IN: chromosome; EXPRESSED IN: 17 plant structures; EXPRESSED DURING: 13 growth stages; CONTAINS InterPro DOMAIN/s: DNA topoisomerase, type IA, zn finger (InterPro:IPR013498), DNA topoisomerase, type IA, core (InterPro:IPR000380), DNA topoisomerase, type IA, domain 2 (InterPro:IPR003601), DNA topoisomerase, type IA, DNA-binding (InterPro:IPR003602), DNA topoisomerase, type IA, central (InterPro:IPR013497), Zinc finger, GRF-type (InterPro:IPR010666), DNA topoisomerase, type IA, central region, subdomain 3 (InterPro:IPR013826), Toprim domain, subgroup (InterPro:IPR006154), DNA topoisomerase, type IA, central region, subdomain 1 (InterPro:IPR013824), Toprim domain (InterPro:IPR006171), Zinc finger, CCHC-type (InterPro:IPR001878); BEST Arabidopsis thaliana protein match is: DNA topoisomerase, type IA, core (TAIR:AT2G32000.1); Has 1807 Blast hits to 1807 proteins in 277 species: Archae - 0; Bacteria - 0; Metazoa - 736; Fungi - 347; Plants - 385; Viruses - 0; Other Eukaryotes - 339 (source: NCBI BLink). & (reliability: 826.0) & (original description: no original description) 0.9065895528456288 40 evm.model.contig_2369.2 no hits & (original description: no original description) 0.8999731487302766 58 evm.model.contig_2113.8 (at1g14610 : 915.0) Required for proper proliferation of basal cells.; TWIN 2 (TWN2); FUNCTIONS IN: valine-tRNA ligase activity, aminoacyl-tRNA ligase activity, nucleotide binding, ATP binding; INVOLVED IN: tRNA aminoacylation for protein translation, embryo development ending in seed dormancy; LOCATED IN: mitochondrion, chloroplast; EXPRESSED IN: 23 plant structures; EXPRESSED DURING: 15 growth stages; CONTAINS InterPro DOMAIN/s: Valyl-tRNA synthetase, class Ia (InterPro:IPR002303), Aminoacyl-tRNA synthetase, class I, conserved site (InterPro:IPR001412), Aminoacyl-tRNA synthetase, class 1a, anticodon-binding (InterPro:IPR009080), Rossmann-like alpha/beta/alpha sandwich fold (InterPro:IPR014729), Valyl/Leucyl/Isoleucyl-tRNA synthetase, class Ia, editing (InterPro:IPR009008), Valyl/Leucyl/Isoleucyl-tRNA synthetase, class I, anticodon-binding (InterPro:IPR013155), Aminoacyl-tRNA synthetase, class Ia (InterPro:IPR002300), Valyl-tRNA synthetase, class Ia, N-terminal (InterPro:IPR019754); BEST Arabidopsis thaliana protein match is: ATP binding;valine-tRNA ligases;aminoacyl-tRNA ligases;nucleotide binding;ATP binding;aminoacyl-tRNA ligases (TAIR:AT5G16715.1); Has 39194 Blast hits to 36732 proteins in 3122 species: Archae - 839; Bacteria - 19755; Metazoa - 1534; Fungi - 892; Plants - 369; Viruses - 3; Other Eukaryotes - 15802 (source: NCBI BLink). & (reliability: 1830.0) & (original description: no original description) 0.8996451110468372 47 evm.model.contig_3455.1 (at4g24190 : 514.0) encodes an ortholog of GRP94, an ER-resident HSP90-like protein and is involved in regulation of meristem size and organization. Single and double mutant analyses suggest that SHD may be required for the correct folding and/or complex formation of CLV proteins. Lines carrying recessive mutations in this locus exhibits expanded shoot meristems, disorganized root meristems, and defective pollen tube elongation. Transcript is detected in all tissues examined and is not induced by heat. Endoplasmin supports the protein secretory pathway and has a role in proliferating tissues.; SHEPHERD (SHD); FUNCTIONS IN: unfolded protein binding, ATP binding; INVOLVED IN: in 8 processes; LOCATED IN: in 6 components; EXPRESSED IN: 25 plant structures; EXPRESSED DURING: 16 growth stages; CONTAINS InterPro DOMAIN/s: Chaperone protein htpG (InterPro:IPR001404), Heat shock protein Hsp90, C-terminal (InterPro:IPR020576), Heat shock protein Hsp90, N-terminal (InterPro:IPR020575), Molecular chaperone, heat shock protein, endoplasmin (InterPro:IPR015566), ATPase-like, ATP-binding domain (InterPro:IPR003594), Heat shock protein Hsp90, conserved site (InterPro:IPR019805), Ribosomal protein S5 domain 2-type fold (InterPro:IPR020568); BEST Arabidopsis thaliana protein match is: heat shock protein 90.1 (TAIR:AT5G52640.1); Has 35333 Blast hits to 34131 proteins in 2444 species: Archae - 798; Bacteria - 22429; Metazoa - 974; Fungi - 991; Plants - 531; Viruses - 0; Other Eukaryotes - 9610 (source: NCBI BLink). & (p36183|enpl_horvu : 514.0) Endoplasmin homolog precursor (GRP94 homolog) - Hordeum vulgare (Barley) & (reliability: 1028.0) & (original description: no original description) 0.8992490275578107 33 evm.model.contig_2069.4 no hits & (original description: no original description) 0.8985419421650951 61 evm.model.contig_4449.6 (at5g51660 : 178.0) cleavage and polyadenylation specificity factor 160 (CPSF160); FUNCTIONS IN: nucleic acid binding; INVOLVED IN: mRNA cleavage, mRNA polyadenylation; LOCATED IN: nucleus; EXPRESSED IN: 21 plant structures; EXPRESSED DURING: 13 growth stages; CONTAINS InterPro DOMAIN/s: Cleavage/polyadenylation specificity factor, A subunit, C-terminal (InterPro:IPR004871); BEST Arabidopsis thaliana protein match is: damaged DNA binding protein 1A (TAIR:AT4G05420.2); Has 1568 Blast hits to 1022 proteins in 220 species: Archae - 0; Bacteria - 0; Metazoa - 654; Fungi - 429; Plants - 267; Viruses - 0; Other Eukaryotes - 218 (source: NCBI BLink). & (q7xwp1|cpsf1_orysa : 154.0) Probable cleavage and polyadenylation specificity factor 160 kDa subunit (CPSF 160 kDa subunit) - Oryza sativa (Rice) & (reliability: 356.0) & (original description: no original description) 0.8966176105496128 28 evm.model.contig_2275.6 no hits & (original description: no original description) 0.8951047422175871 46 evm.model.contig_2149.18 (at4g38130 : 505.0) Encodes a histone deacetylase that enhances AtERF7-mediated transcriptional repression. Binds SIM3 and ERF7. Expressed in the nucleus in most tissues examined and throughout the life of the plant. Involved in jasmonic acid and ethylene dependent pathogen resistance. The sequence in GenBank has 17 AG dinucleotide repeats missing, which is also missing in Ler shotgun sequence from Cereon. Although it is annotated to be in Columbia, the GB sequence is probably not of Columbia origin. Plays a role in embryogenesis as mutants grown at higher temperatures display abnormalities in the organization of the root and shoot. Plant lines expressing an RNAi construct targeted against HDA19 shows some resistance to agrobacterium-mediated root transformation.; histone deacetylase 1 (HD1); CONTAINS InterPro DOMAIN/s: Histone deacetylase (InterPro:IPR003084), Histone deacetylase superfamily (InterPro:IPR000286); BEST Arabidopsis thaliana protein match is: histone deacetylase 6 (TAIR:AT5G63110.1); Has 8759 Blast hits to 8549 proteins in 1452 species: Archae - 219; Bacteria - 3192; Metazoa - 1525; Fungi - 536; Plants - 478; Viruses - 0; Other Eukaryotes - 2809 (source: NCBI BLink). & (p56521|hdac_maize : 483.0) Probable histone deacetylase (RPD3 homolog) - Zea mays (Maize) & (reliability: 1010.0) & (original description: no original description) 0.8944151314449448 52 evm.model.contig_579.8 no hits & (original description: no original description) 0.8918145991075329 96 evm.model.contig_3529.1 no hits & (original description: no original description) 0.8911319518245652 27 evm.model.contig_3555.4 no hits & (original description: no original description) 0.890859113020407 60 evm.model.contig_2116.8 (at1g64790 : 446.0) ILITYHIA (ILA) is a HEAT repeat protein involved in plant immunity. The gene is also involved in systemic acquired resistance induced by P. syringae expressing avrRps4. Loss-of-function mutants of ILA caused pleiotropic defects in the mutant plants. The mutant plants are smaller in size and the leaves are serrated and yellow to light green in color.; ILITYHIA (ILA); FUNCTIONS IN: binding; INVOLVED IN: systemic acquired resistance, defense response to bacterium; LOCATED IN: cytosol, nucleus; EXPRESSED IN: 23 plant structures; EXPRESSED DURING: 13 growth stages; CONTAINS InterPro DOMAIN/s: HEAT (InterPro:IPR000357), Armadillo-like helical (InterPro:IPR011989), HEAT, type 2 (InterPro:IPR021133), Armadillo-type fold (InterPro:IPR016024). & (reliability: 892.0) & (original description: no original description) 0.8872380412617095 65 evm.model.contig_3423.34 no hits & (original description: no original description) 0.8822707402502364 94 evm.model.contig_4512.5 (at5g57590 : 516.0) Mutant complemented by E coli Bio A gene encoding 7,8-diaminopelargonic acid aminotransferase.; biotin auxotroph 1 (BIO1); CONTAINS InterPro DOMAIN/s: Pyridoxal phosphate-dependent transferase, major domain (InterPro:IPR015424), Aminotransferase class-III (InterPro:IPR005814), Pyridoxal phosphate-dependent transferase, major region, subdomain 1 (InterPro:IPR015421); BEST Arabidopsis thaliana protein match is: HOPW1-1-interacting 1 (TAIR:AT1G80600.1); Has 30201 Blast hits to 17322 proteins in 780 species: Archae - 12; Bacteria - 1396; Metazoa - 17338; Fungi - 3422; Plants - 5037; Viruses - 0; Other Eukaryotes - 2996 (source: NCBI BLink). & (reliability: 1032.0) & (original description: no original description) 0.880041521449258 41 evm.model.contig_4470.2 no hits & (original description: no original description) 0.8758456816482858 99 evm.model.contig_4530.2 no hits & (original description: no original description) 0.8751953709626554 51 evm.model.contig_790.4 (at3g57890 : 118.0) Tubulin binding cofactor C domain-containing protein; FUNCTIONS IN: binding; INVOLVED IN: biological_process unknown; LOCATED IN: cellular_component unknown; EXPRESSED IN: 24 plant structures; EXPRESSED DURING: 14 growth stages; CONTAINS InterPro DOMAIN/s: CARP motif (InterPro:IPR006599), C-CAP/cofactor C-like domain (InterPro:IPR017901), Tubulin binding cofactor C (InterPro:IPR012945); BEST Arabidopsis thaliana protein match is: C-CAP/cofactor C-like domain-containing protein (TAIR:AT2G42230.2). & (reliability: 224.0) & (original description: no original description) 0.8747029628283605 55 evm.model.contig_3957.1 (p29610|cy12_soltu : 322.0) Cytochrome c1, heme protein, mitochondrial precursor (Clone PC18I) (Fragment) - Solanum tuberosum (Potato) & (at3g27240 : 320.0) Cytochrome C1 family; FUNCTIONS IN: electron carrier activity, iron ion binding, heme binding, electron transporter, transferring electrons within CoQH2-cytochrome c reductase complex activity; LOCATED IN: in 6 components; EXPRESSED IN: 26 plant structures; EXPRESSED DURING: 15 growth stages; CONTAINS InterPro DOMAIN/s: Cytochrome c1 (InterPro:IPR002326), Cytochrome c1, transmembrane anchor, C-terminal (InterPro:IPR021157), Cytochrome c domain (InterPro:IPR009056); BEST Arabidopsis thaliana protein match is: Cytochrome C1 family (TAIR:AT5G40810.1); Has 3450 Blast hits to 3450 proteins in 754 species: Archae - 0; Bacteria - 1111; Metazoa - 210; Fungi - 210; Plants - 102; Viruses - 0; Other Eukaryotes - 1817 (source: NCBI BLink). & (reliability: 640.0) & (original description: no original description) 0.8702384896576848 65