Sequence Description Alias PCC hrr evm.model.contig_2343.3 no hits & (original description: no original description) 0.9729930518122388 1 evm.model.contig_2348.5 (at3g09320 : 104.0) DHHC-type zinc finger family protein; FUNCTIONS IN: zinc ion binding; INVOLVED IN: biological_process unknown; LOCATED IN: endomembrane system; EXPRESSED IN: 24 plant structures; EXPRESSED DURING: 15 growth stages; CONTAINS InterPro DOMAIN/s: Zinc finger, DHHC-type (InterPro:IPR001594); BEST Arabidopsis thaliana protein match is: DHHC-type zinc finger family protein (TAIR:AT5G04270.1); Has 5137 Blast hits to 5129 proteins in 251 species: Archae - 0; Bacteria - 0; Metazoa - 2212; Fungi - 755; Plants - 839; Viruses - 0; Other Eukaryotes - 1331 (source: NCBI BLink). & (reliability: 208.0) & (original description: no original description) 0.9448337064728032 5 evm.model.contig_2094.13 (at1g30910 : 156.0) Molybdenum cofactor sulfurase family protein; FUNCTIONS IN: molybdenum ion binding, Mo-molybdopterin cofactor sulfurase activity, pyridoxal phosphate binding, catalytic activity; INVOLVED IN: biological_process unknown; LOCATED IN: chloroplast; EXPRESSED IN: 22 plant structures; EXPRESSED DURING: 13 growth stages; CONTAINS InterPro DOMAIN/s: Pyruvate kinase, beta-barrel-like (InterPro:IPR011037), MOSC, N-terminal beta barrel (InterPro:IPR005303), Molybdenum cofactor sulfurase, C-terminal (InterPro:IPR005302); BEST Arabidopsis thaliana protein match is: Molybdenum cofactor sulfurase family protein (TAIR:AT5G44720.1); Has 1932 Blast hits to 1913 proteins in 692 species: Archae - 10; Bacteria - 1072; Metazoa - 332; Fungi - 283; Plants - 100; Viruses - 0; Other Eukaryotes - 135 (source: NCBI BLink). & (q655r6|mocos_orysa : 105.0) Molybdenum cofactor sulfurase (EC 4.4.-.-) (MoCo sulfurase) (MOS) - Oryza sativa (Rice) & (reliability: 312.0) & (original description: no original description) 0.9437931301916327 3 evm.model.contig_2275.8 no hits & (original description: no original description) 0.9415997254484125 4 evm.model.contig_2102.3 (at3g44850 : 209.0) Protein kinase superfamily protein; FUNCTIONS IN: protein serine/threonine kinase activity, protein kinase activity, kinase activity, ATP binding; INVOLVED IN: protein amino acid phosphorylation; LOCATED IN: cellular_component unknown; EXPRESSED IN: 23 plant structures; EXPRESSED DURING: 13 growth stages; CONTAINS InterPro DOMAIN/s: Protein kinase, ATP binding site (InterPro:IPR017441), Protein kinase, catalytic domain (InterPro:IPR000719), Serine/threonine-protein kinase-like domain (InterPro:IPR017442), Protein kinase-like domain (InterPro:IPR011009), Serine/threonine-protein kinase, active site (InterPro:IPR008271); BEST Arabidopsis thaliana protein match is: Protein kinase superfamily protein (TAIR:AT5G22840.1); Has 38681 Blast hits to 29843 proteins in 1092 species: Archae - 4; Bacteria - 1517; Metazoa - 15700; Fungi - 7143; Plants - 6601; Viruses - 16; Other Eukaryotes - 7700 (source: NCBI BLink). & (reliability: 418.0) & (original description: no original description) 0.9414436596877432 5 evm.model.contig_446.7 (at1g34130 : 659.0) Encodes homolog of yeast STT3, a subunit of oligosaccharyltransferase.; staurosporin and temperature sensitive 3-like b (STT3B); FUNCTIONS IN: oligosaccharyl transferase activity; INVOLVED IN: protein amino acid glycosylation; LOCATED IN: endoplasmic reticulum, plasma membrane, membrane; EXPRESSED IN: 24 plant structures; EXPRESSED DURING: 13 growth stages; CONTAINS InterPro DOMAIN/s: Oligosaccharyl transferase, STT3 subunit (InterPro:IPR003674); BEST Arabidopsis thaliana protein match is: staurosporin and temperature sensitive 3-like A (TAIR:AT5G19690.1); Has 1054 Blast hits to 1026 proteins in 313 species: Archae - 251; Bacteria - 48; Metazoa - 304; Fungi - 138; Plants - 87; Viruses - 0; Other Eukaryotes - 226 (source: NCBI BLink). & (reliability: 1318.0) & (original description: no original description) 0.937013738267133 6 evm.model.contig_4456.2 no hits & (original description: no original description) 0.9339139959155136 14 evm.model.contig_2172.8 (p29677|mppa_soltu : 192.0) Mitochondrial-processing peptidase alpha subunit, mitochondrial precursor (EC 3.4.24.64) (Alpha-MPP) (Ubiquinol-cytochrome-c reductase subunit II) (EC 1.10.2.2) - Solanum tuberosum (Potato) & (at1g51980 : 186.0) Insulinase (Peptidase family M16) protein; FUNCTIONS IN: metalloendopeptidase activity, ATP binding; INVOLVED IN: proteolysis, response to salt stress; LOCATED IN: mitochondrion, plasma membrane, plastid, mitochondrial respiratory chain complex III, membrane; EXPRESSED IN: 26 plant structures; EXPRESSED DURING: 16 growth stages; CONTAINS InterPro DOMAIN/s: Peptidase M16, zinc-binding site (InterPro:IPR001431), Peptidase M16, C-terminal (InterPro:IPR007863), Peptidase M16, N-terminal (InterPro:IPR011765), Metalloenzyme, LuxS/M16 peptidase-like, metal-binding (InterPro:IPR011249), Peptidase M16, core (InterPro:IPR011237); BEST Arabidopsis thaliana protein match is: mitochondrial processing peptidase alpha subunit (TAIR:AT3G16480.1); Has 5945 Blast hits to 5839 proteins in 1469 species: Archae - 10; Bacteria - 3395; Metazoa - 673; Fungi - 538; Plants - 242; Viruses - 3; Other Eukaryotes - 1084 (source: NCBI BLink). & (reliability: 372.0) & (original description: no original description) 0.9335345327944798 17 evm.model.contig_667.4 (at2g21250 : 236.0) NAD(P)-linked oxidoreductase superfamily protein; FUNCTIONS IN: oxidoreductase activity; INVOLVED IN: response to cadmium ion; EXPRESSED IN: cultured cell, leaf; EXPRESSED DURING: seedling growth; CONTAINS InterPro DOMAIN/s: Aldo/keto reductase (InterPro:IPR001395), Aldo/keto reductase subgroup (InterPro:IPR020471), Aldo/keto reductase, conserved site (InterPro:IPR018170); BEST Arabidopsis thaliana protein match is: NAD(P)-linked oxidoreductase superfamily protein (TAIR:AT2G21260.1); Has 35333 Blast hits to 34131 proteins in 2444 species: Archae - 798; Bacteria - 22429; Metazoa - 974; Fungi - 991; Plants - 531; Viruses - 0; Other Eukaryotes - 9610 (source: NCBI BLink). & (p28475|s6pd_maldo : 205.0) NADP-dependent D-sorbitol-6-phosphate dehydrogenase (EC 1.1.1.200) (Aldose-6-phosphate reductase [NADPH]) (NADP-S6PDH) - Malus domestica (Apple) (Malus sylvestris) & (reliability: 472.0) & (original description: no original description) 0.9328221299195045 9 evm.model.contig_4449.6 (at5g51660 : 178.0) cleavage and polyadenylation specificity factor 160 (CPSF160); FUNCTIONS IN: nucleic acid binding; INVOLVED IN: mRNA cleavage, mRNA polyadenylation; LOCATED IN: nucleus; EXPRESSED IN: 21 plant structures; EXPRESSED DURING: 13 growth stages; CONTAINS InterPro DOMAIN/s: Cleavage/polyadenylation specificity factor, A subunit, C-terminal (InterPro:IPR004871); BEST Arabidopsis thaliana protein match is: damaged DNA binding protein 1A (TAIR:AT4G05420.2); Has 1568 Blast hits to 1022 proteins in 220 species: Archae - 0; Bacteria - 0; Metazoa - 654; Fungi - 429; Plants - 267; Viruses - 0; Other Eukaryotes - 218 (source: NCBI BLink). & (q7xwp1|cpsf1_orysa : 154.0) Probable cleavage and polyadenylation specificity factor 160 kDa subunit (CPSF 160 kDa subunit) - Oryza sativa (Rice) & (reliability: 356.0) & (original description: no original description) 0.9325960559792739 10 evm.model.contig_2032.3 (at4g11820 : 339.0) Encodes a protein with hydroxymethylglutaryl-CoA synthase activity which was characterized by phenotypical complementation of the S. cerevisiae mutant.; MVA1; CONTAINS InterPro DOMAIN/s: Thiolase-like (InterPro:IPR016039), Hydroxymethylglutaryl-coenzyme A synthase C-terminal (InterPro:IPR013746), Hydroxymethylglutaryl-coenzyme A synthase, N-terminal (InterPro:IPR013528), Hydroxymethylglutaryl-CoA synthase, eukaryotic (InterPro:IPR010122), Hydroxymethylglutaryl-coenzyme A synthase, active site (InterPro:IPR000590); Has 2176 Blast hits to 2172 proteins in 850 species: Archae - 228; Bacteria - 1039; Metazoa - 300; Fungi - 184; Plants - 117; Viruses - 0; Other Eukaryotes - 308 (source: NCBI BLink). & (reliability: 678.0) & (original description: no original description) 0.9325403810146379 31 evm.model.contig_2015.6 (p49964|srp19_orysa : 89.0) Signal recognition particle 19 kDa protein (SRP19) - Oryza sativa (Rice) & (at1g48160 : 82.4) signal recognition particle 19 kDa protein, putative / SRP19, putative; FUNCTIONS IN: 7S RNA binding; INVOLVED IN: protein targeting, SRP-dependent cotranslational protein targeting to membrane; LOCATED IN: signal recognition particle, signal recognition particle, endoplasmic reticulum targeting; EXPRESSED IN: 24 plant structures; EXPRESSED DURING: 15 growth stages; CONTAINS InterPro DOMAIN/s: Signal recognition particle, SRP19 subunit (InterPro:IPR002778); Has 500 Blast hits to 499 proteins in 236 species: Archae - 81; Bacteria - 0; Metazoa - 158; Fungi - 144; Plants - 60; Viruses - 0; Other Eukaryotes - 57 (source: NCBI BLink). & (reliability: 164.8) & (original description: no original description) 0.9298195391350662 12 evm.model.contig_4402.7 (gnl|cdd|68872 : 129.0) no description available & (at3g56570 : 128.0) SET domain-containing protein; CONTAINS InterPro DOMAIN/s: RuBisCO-cytochrome methylase, RMS1 (InterPro:IPR011383); BEST Arabidopsis thaliana protein match is: Rubisco methyltransferase family protein (TAIR:AT1G14030.1); Has 25210 Blast hits to 12491 proteins in 636 species: Archae - 52; Bacteria - 1284; Metazoa - 10981; Fungi - 2786; Plants - 1267; Viruses - 743; Other Eukaryotes - 8097 (source: NCBI BLink). & (gnl|cdd|39774 : 82.1) no description available & (reliability: 256.0) & (original description: no original description) 0.9291219851529599 13 evm.model.contig_2150.11 no hits & (original description: no original description) 0.9268676546623306 14 evm.model.contig_600.2 (at1g75330 : 376.0) ornithine carbamoyltransferase (OTC); FUNCTIONS IN: amino acid binding, ornithine carbamoyltransferase activity, carboxyl- or carbamoyltransferase activity; INVOLVED IN: cellular amino acid metabolic process; LOCATED IN: chloroplast, chloroplast stroma; EXPRESSED IN: 23 plant structures; EXPRESSED DURING: 13 growth stages; CONTAINS InterPro DOMAIN/s: Aspartate/ornithine carbamoyltransferase, carbamoyl-P binding (InterPro:IPR006132), Aspartate/ornithine carbamoyltransferase (InterPro:IPR006130), Aspartate/ornithine carbamoyltransferase, Asp/Orn-binding domain (InterPro:IPR006131), Ornithine carbamoyltransferase (InterPro:IPR002292); BEST Arabidopsis thaliana protein match is: PYRIMIDINE B (TAIR:AT3G20330.1); Has 16793 Blast hits to 16793 proteins in 2905 species: Archae - 534; Bacteria - 11079; Metazoa - 203; Fungi - 280; Plants - 150; Viruses - 6; Other Eukaryotes - 4541 (source: NCBI BLink). & (q43814|otc_pea : 368.0) Ornithine carbamoyltransferase, chloroplast precursor (EC 2.1.3.3) (OTCase) (Ornithine transcarbamylase) - Pisum sativum (Garden pea) & (reliability: 752.0) & (original description: no original description) 0.9261841555656575 40 evm.model.contig_3423.34 no hits & (original description: no original description) 0.9259286806651343 16 evm.model.contig_558.6 no hits & (original description: no original description) 0.9257748000013957 49 evm.model.contig_3445.7 no hits & (original description: no original description) 0.9257445691118336 37 evm.model.contig_2039.3 no hits & (original description: no original description) 0.9251908497746795 19 evm.model.contig_2044.21 no hits & (original description: no original description) 0.9229083442350294 37 evm.model.contig_3491.10 (at4g22720 : 450.0) Actin-like ATPase superfamily protein; FUNCTIONS IN: metalloendopeptidase activity; INVOLVED IN: proteolysis; LOCATED IN: cellular_component unknown; EXPRESSED IN: 24 plant structures; EXPRESSED DURING: 13 growth stages; CONTAINS InterPro DOMAIN/s: Peptidase M22, glycoprotease (InterPro:IPR000905), Peptidase M22, glycoprotease, subgroup (InterPro:IPR017861); BEST Arabidopsis thaliana protein match is: glycoprotease 1 (TAIR:AT2G45270.1); Has 35333 Blast hits to 34131 proteins in 2444 species: Archae - 798; Bacteria - 22429; Metazoa - 974; Fungi - 991; Plants - 531; Viruses - 0; Other Eukaryotes - 9610 (source: NCBI BLink). & (reliability: 900.0) & (original description: no original description) 0.9228996692582738 71 evm.model.contig_3542.3 no hits & (original description: no original description) 0.9200397999680665 22 evm.model.contig_589.2 no hits & (original description: no original description) 0.9199193638726249 23 evm.model.contig_522.22 (at1g17760 : 299.0) Encodes a homolog of the mammalian protein CstF77, a polyadenylation factor subunit. RNA 3′-endñprocessing factor of antisense FLC transcript. Mediates silencing of the floral repressor gene FLC.; CSTF77; FUNCTIONS IN: protein binding, mRNA binding, transcription repressor activity; INVOLVED IN: RNA 3'-end processing, mRNA processing, embryo sac development; LOCATED IN: intracellular, nucleus; EXPRESSED IN: 23 plant structures; EXPRESSED DURING: 12 growth stages; CONTAINS InterPro DOMAIN/s: RNA-processing protein, HAT helix (InterPro:IPR003107), Tetratricopeptide-like helical (InterPro:IPR011990), Tetratricopeptide repeat-containing (InterPro:IPR013026), Tetratricopeptide repeat (InterPro:IPR019734), Suppressor of forked (InterPro:IPR008847); BEST Arabidopsis thaliana protein match is: crooked neck protein, putative / cell cycle protein, putative (TAIR:AT5G45990.1); Has 2092 Blast hits to 1537 proteins in 234 species: Archae - 0; Bacteria - 14; Metazoa - 771; Fungi - 713; Plants - 343; Viruses - 0; Other Eukaryotes - 251 (source: NCBI BLink). & (reliability: 598.0) & (original description: no original description) 0.9180614137754197 38 evm.model.contig_2282.8 (at1g14240 : 109.0) GDA1/CD39 nucleoside phosphatase family protein; FUNCTIONS IN: hydrolase activity; INVOLVED IN: biological_process unknown; LOCATED IN: cellular_component unknown; EXPRESSED IN: 20 plant structures; EXPRESSED DURING: 11 growth stages; CONTAINS InterPro DOMAIN/s: Nucleoside phosphatase GDA1/CD39 (InterPro:IPR000407); BEST Arabidopsis thaliana protein match is: GDA1/CD39 nucleoside phosphatase family protein (TAIR:AT1G14250.1); Has 30201 Blast hits to 17322 proteins in 780 species: Archae - 12; Bacteria - 1396; Metazoa - 17338; Fungi - 3422; Plants - 5037; Viruses - 0; Other Eukaryotes - 2996 (source: NCBI BLink). & (p80595|apy_soltu : 109.0) Apyrase precursor (EC 3.6.1.5) (ATP-diphosphatase) (Adenosine diphosphatase) (ADPase) (ATP-diphosphohydrolase) - Solanum tuberosum (Potato) & (reliability: 218.0) & (original description: no original description) 0.9177915378827733 97 evm.model.contig_3702.1 (at2g38560 : 98.2) Encodes RNA polymerase II transcript elongation factor TFIIS. Complements yeast TFIIS mutation. Mutant plants display essentially normal development, but they flower slightly earlier than the wild type and show clearly reduced seed dormancy.; transcript elongation factor IIS (TFIIS); CONTAINS InterPro DOMAIN/s: Zinc finger, TFIIS-type (InterPro:IPR001222), Transcription elongation factor, TFIIS/CRSP70, N-terminal, sub-type (InterPro:IPR003617), Transcription elongation factor S-II, central domain (InterPro:IPR003618), Transcription factor IIS, N-terminal (InterPro:IPR017923), Transcription elongation factor S-IIM (InterPro:IPR017890), Transcription elongation factor, IIS (InterPro:IPR016492), Transcription elongation factor, TFIIS (InterPro:IPR006289), Transcription elongation factor, TFIIS/elongin A/CRSP70, N-terminal (InterPro:IPR010990); BEST Arabidopsis thaliana protein match is: F-box family protein (TAIR:AT2G42730.1); Has 1858 Blast hits to 1830 proteins in 294 species: Archae - 58; Bacteria - 2; Metazoa - 702; Fungi - 370; Plants - 279; Viruses - 52; Other Eukaryotes - 395 (source: NCBI BLink). & (reliability: 196.4) & (original description: no original description) 0.917337681969262 26 evm.model.contig_2070.7 no hits & (original description: no original description) 0.9165190795324475 54 evm.model.contig_3590.1 no hits & (original description: no original description) 0.91535809526996 29 evm.model.contig_681.3 no hits & (original description: no original description) 0.9152479473059372 35 evm.model.contig_441.8 no hits & (original description: no original description) 0.9150282033408927 48 evm.model.contig_527.9 no hits & (original description: no original description) 0.9149677660520981 33 evm.model.contig_4512.2 (at5g62930 : 99.0) SGNH hydrolase-type esterase superfamily protein; FUNCTIONS IN: hydrolase activity, hydrolase activity, acting on ester bonds, carboxylesterase activity; INVOLVED IN: lipid metabolic process; LOCATED IN: cellular_component unknown; EXPRESSED IN: 23 plant structures; EXPRESSED DURING: 14 growth stages; CONTAINS InterPro DOMAIN/s: Esterase, SGNH hydrolase-type, subgroup (InterPro:IPR013831), Lipase, GDSL (InterPro:IPR001087), Esterase, SGNH hydrolase-type (InterPro:IPR013830); BEST Arabidopsis thaliana protein match is: SGNH hydrolase-type esterase superfamily protein (TAIR:AT5G45920.1); Has 689 Blast hits to 688 proteins in 254 species: Archae - 0; Bacteria - 229; Metazoa - 77; Fungi - 158; Plants - 171; Viruses - 0; Other Eukaryotes - 54 (source: NCBI BLink). & (reliability: 198.0) & (original description: no original description) 0.914105382230669 75 evm.model.contig_2203.1 no hits & (original description: no original description) 0.9134276095939031 35 evm.model.contig_478.2 no hits & (original description: no original description) 0.9132535205164702 37 evm.model.contig_3690.2 no hits & (original description: no original description) 0.9131679823564292 39 evm.model.contig_3455.1 (at4g24190 : 514.0) encodes an ortholog of GRP94, an ER-resident HSP90-like protein and is involved in regulation of meristem size and organization. Single and double mutant analyses suggest that SHD may be required for the correct folding and/or complex formation of CLV proteins. Lines carrying recessive mutations in this locus exhibits expanded shoot meristems, disorganized root meristems, and defective pollen tube elongation. Transcript is detected in all tissues examined and is not induced by heat. Endoplasmin supports the protein secretory pathway and has a role in proliferating tissues.; SHEPHERD (SHD); FUNCTIONS IN: unfolded protein binding, ATP binding; INVOLVED IN: in 8 processes; LOCATED IN: in 6 components; EXPRESSED IN: 25 plant structures; EXPRESSED DURING: 16 growth stages; CONTAINS InterPro DOMAIN/s: Chaperone protein htpG (InterPro:IPR001404), Heat shock protein Hsp90, C-terminal (InterPro:IPR020576), Heat shock protein Hsp90, N-terminal (InterPro:IPR020575), Molecular chaperone, heat shock protein, endoplasmin (InterPro:IPR015566), ATPase-like, ATP-binding domain (InterPro:IPR003594), Heat shock protein Hsp90, conserved site (InterPro:IPR019805), Ribosomal protein S5 domain 2-type fold (InterPro:IPR020568); BEST Arabidopsis thaliana protein match is: heat shock protein 90.1 (TAIR:AT5G52640.1); Has 35333 Blast hits to 34131 proteins in 2444 species: Archae - 798; Bacteria - 22429; Metazoa - 974; Fungi - 991; Plants - 531; Viruses - 0; Other Eukaryotes - 9610 (source: NCBI BLink). & (p36183|enpl_horvu : 514.0) Endoplasmin homolog precursor (GRP94 homolog) - Hordeum vulgare (Barley) & (reliability: 1028.0) & (original description: no original description) 0.9128039082028546 40 evm.model.contig_3484.2 (at4g05420 : 736.0) Structurally similar to damaged DNA binding proteins.DDB1a is part of a 350 KDa nuclear localized DET1 protein complex. This complex may physically interact with histone tails and while bound to chromatin- repress transcription of genes involved in photomorphogenesis.; damaged DNA binding protein 1A (DDB1A); FUNCTIONS IN: protein binding, DNA binding; INVOLVED IN: negative regulation of transcription, negative regulation of photomorphogenesis; LOCATED IN: nucleus, CUL4 RING ubiquitin ligase complex; EXPRESSED IN: 23 plant structures; EXPRESSED DURING: 14 growth stages; CONTAINS InterPro DOMAIN/s: WD40 repeat-like-containing domain (InterPro:IPR011046), Cleavage/polyadenylation specificity factor, A subunit, C-terminal (InterPro:IPR004871); BEST Arabidopsis thaliana protein match is: damaged DNA binding protein 1B (TAIR:AT4G21100.1); Has 30201 Blast hits to 17322 proteins in 780 species: Archae - 12; Bacteria - 1396; Metazoa - 17338; Fungi - 3422; Plants - 5037; Viruses - 0; Other Eukaryotes - 2996 (source: NCBI BLink). & (q7xwp1|cpsf1_orysa : 81.6) Probable cleavage and polyadenylation specificity factor 160 kDa subunit (CPSF 160 kDa subunit) - Oryza sativa (Rice) & (reliability: 1472.0) & (original description: no original description) 0.912505481755732 84 evm.model.contig_4418.20 no hits & (original description: no original description) 0.9118163945787576 43 evm.model.contig_436.11 (at3g16990 : 117.0) Haem oxygenase-like, multi-helical; CONTAINS InterPro DOMAIN/s: Haem oxygenase-like, multi-helical (InterPro:IPR016084), TENA/THI-4 protein/Coenzyme PQQ biosynthesis protein C (InterPro:IPR004305); Has 259 Blast hits to 259 proteins in 88 species: Archae - 23; Bacteria - 94; Metazoa - 0; Fungi - 34; Plants - 38; Viruses - 0; Other Eukaryotes - 70 (source: NCBI BLink). & (q9swb6|pm36_soybn : 115.0) Seed maturation protein PM36 - Glycine max (Soybean) & (reliability: 234.0) & (original description: no original description) 0.910646713572059 44 evm.model.contig_2121.27 (at5g23880 : 382.0) Encodes a protein similar to the 100kD subunit of cleavage and polyadenylation specificity factor (CPSF), the factor responsible for the recognition of the AAUAAA motif during mRNA polyadenylation. The protein interacts with a portion of a nuclear poly(A) polymerase. It is likely to be a part of the mRNA 3'end formation apparatus.; cleavage and polyadenylation specificity factor 100 (CPSF100); FUNCTIONS IN: protein binding, DNA binding; INVOLVED IN: mRNA cleavage, mRNA polyadenylation, posttranscriptional gene silencing by RNA, embryo development ending in seed dormancy; LOCATED IN: mRNA cleavage and polyadenylation specificity factor complex, nucleus; EXPRESSED IN: 24 plant structures; EXPRESSED DURING: 13 growth stages; CONTAINS InterPro DOMAIN/s: Beta-Casp domain (InterPro:IPR022712), RNA-metabolising metallo-beta-lactamase (InterPro:IPR011108), Beta-lactamase-like (InterPro:IPR001279); BEST Arabidopsis thaliana protein match is: cleavage and polyadenylation specificity factor 73-I (TAIR:AT1G61010.3); Has 1807 Blast hits to 1807 proteins in 277 species: Archae - 0; Bacteria - 0; Metazoa - 736; Fungi - 347; Plants - 385; Viruses - 0; Other Eukaryotes - 339 (source: NCBI BLink). & (q652p4|cpsf2_orysa : 280.0) Cleavage and polyadenylation specificity factor 100 kDa subunit (CPSF 100 kDa subunit) - Oryza sativa (Rice) & (reliability: 764.0) & (original description: no original description) 0.9105127360666098 45 evm.model.contig_444.22 no hits & (original description: no original description) 0.9101099155273772 49 evm.model.contig_2079.2 (at1g21370 : 243.0) unknown protein; CONTAINS InterPro DOMAIN/s: Protein of unknown function DUF218 (InterPro:IPR003848); Has 35333 Blast hits to 34131 proteins in 2444 species: Archae - 798; Bacteria - 22429; Metazoa - 974; Fungi - 991; Plants - 531; Viruses - 0; Other Eukaryotes - 9610 (source: NCBI BLink). & (reliability: 486.0) & (original description: no original description) 0.9094581384225099 47 evm.model.contig_522.21 no hits & (original description: no original description) 0.9094196474397451 48 evm.model.contig_2046.2 (at1g18260 : 107.0) HCP-like superfamily protein; FUNCTIONS IN: binding; INVOLVED IN: biological_process unknown; LOCATED IN: endoplasmic reticulum, membrane; EXPRESSED IN: 25 plant structures; EXPRESSED DURING: 15 growth stages; CONTAINS InterPro DOMAIN/s: Tetratricopeptide-like helical (InterPro:IPR011990), Sel1-like (InterPro:IPR006597); BEST Arabidopsis thaliana protein match is: HCP-like superfamily protein (TAIR:AT1G73570.1); Has 24350 Blast hits to 8436 proteins in 1359 species: Archae - 0; Bacteria - 17163; Metazoa - 848; Fungi - 960; Plants - 547; Viruses - 27; Other Eukaryotes - 4805 (source: NCBI BLink). & (reliability: 214.0) & (original description: no original description) 0.908644849625388 49 evm.model.contig_4547.2 no hits & (original description: no original description) 0.9077737164079888 50 evm.model.contig_3581.1 (at4g21710 : 1212.0) Encodes the unique second-largest subunit of DNA-dependent RNA polymerase II; the ortholog of budding yeast RPB2 and a homolog of the E. coli RNA polymerase beta subunit.; NRPB2; CONTAINS InterPro DOMAIN/s: DNA-directed RNA polymerase, subunit 2, domain 6 (InterPro:IPR007120), RNA polymerase Rpb2, domain 7 (InterPro:IPR007641), RNA polymerase, beta subunit, protrusion (InterPro:IPR007644), RNA polymerase Rpb2, domain 3 (InterPro:IPR007645), DNA-directed RNA polymerase, subunit 2 (InterPro:IPR015712), RNA polymerase Rpb2, domain 2 (InterPro:IPR007642), RNA polymerase Rpb2, domain 4 (InterPro:IPR007646), RNA polymerase, beta subunit, conserved site (InterPro:IPR007121), RNA polymerase Rpb2, domain 5 (InterPro:IPR007647); BEST Arabidopsis thaliana protein match is: nuclear RNA polymerase C2 (TAIR:AT5G45140.1); Has 37546 Blast hits to 27868 proteins in 9192 species: Archae - 496; Bacteria - 17572; Metazoa - 623; Fungi - 7193; Plants - 3397; Viruses - 232; Other Eukaryotes - 8033 (source: NCBI BLink). & (q9mus5|rpob_mesvi : 137.0) DNA-directed RNA polymerase beta chain (EC 2.7.7.6) (PEP) (Plastid-encoded RNA polymerase subunit beta) (RNA polymerase subunit beta) - Mesostigma viride & (reliability: 2424.0) & (original description: no original description) 0.9076947566956212 51 evm.model.contig_3820.2 no hits & (original description: no original description) 0.9056342243882896 54 evm.model.contig_724.6 (at2g45730 : 131.0) eukaryotic initiation factor 3 gamma subunit family protein; FUNCTIONS IN: translation initiation factor activity; INVOLVED IN: translational initiation, regulation of translational initiation; LOCATED IN: cellular_component unknown; EXPRESSED IN: 24 plant structures; EXPRESSED DURING: 13 growth stages; CONTAINS InterPro DOMAIN/s: Eukaryotic initiation factor 3, gamma subunit (InterPro:IPR007316), tRNA (adenine-N(1)-)-methyltransferase, non-catalytic TRM6 subunit (InterPro:IPR017423); Has 402 Blast hits to 378 proteins in 203 species: Archae - 0; Bacteria - 4; Metazoa - 118; Fungi - 152; Plants - 45; Viruses - 0; Other Eukaryotes - 83 (source: NCBI BLink). & (reliability: 262.0) & (original description: no original description) 0.9048061236935969 54 evm.model.contig_3445.6 no hits & (original description: no original description) 0.9035582780385438 83 evm.model.contig_4438.23 (at3g58750 : 318.0) Encodes a peroxisomal citrate synthase that is expressed throughout seedling and shoot development.; citrate synthase 2 (CSY2); FUNCTIONS IN: citrate (SI)-synthase activity; INVOLVED IN: fatty acid beta-oxidation, tricarboxylic acid cycle; LOCATED IN: peroxisome; EXPRESSED IN: 25 plant structures; EXPRESSED DURING: 16 growth stages; CONTAINS InterPro DOMAIN/s: Citrate synthase, type II (InterPro:IPR010953), Citrate synthase-like, large alpha subdomain (InterPro:IPR016142), Citrate synthase active site (InterPro:IPR019810), Citrate synthase-like, core (InterPro:IPR016141), Citrate synthase-like (InterPro:IPR002020); BEST Arabidopsis thaliana protein match is: citrate synthase 3 (TAIR:AT2G42790.1); Has 13448 Blast hits to 13446 proteins in 3190 species: Archae - 173; Bacteria - 8550; Metazoa - 303; Fungi - 319; Plants - 178; Viruses - 0; Other Eukaryotes - 3925 (source: NCBI BLink). & (p49299|cysz_cucma : 314.0) Citrate synthase, glyoxysomal precursor (EC 2.3.3.1) (GCS) - Cucurbita maxima (Pumpkin) (Winter squash) & (reliability: 636.0) & (original description: no original description) 0.9035493073097964 57 evm.model.contig_2070.8 no hits & (original description: no original description) 0.9029316070611002 62 evm.model.contig_4445.5 no hits & (original description: no original description) 0.9027628384882702 75 evm.model.contig_3410.4 no hits & (original description: no original description) 0.9023172234121591 60 evm.model.contig_3464.2 (at4g17300 : 185.0) Asparaginyl-tRNA synthetase protein involved in amino acid activation/protein synthesis.; NS1; FUNCTIONS IN: asparagine-tRNA ligase activity; INVOLVED IN: asparaginyl-tRNA aminoacylation, ovule development; LOCATED IN: mitochondrion, chloroplast; EXPRESSED IN: 23 plant structures; EXPRESSED DURING: 13 growth stages; CONTAINS InterPro DOMAIN/s: Nucleic acid binding, OB-fold, tRNA/helicase-type (InterPro:IPR004365), Asparaginyl-tRNA synthetase, class IIb (InterPro:IPR004522), Aminoacyl-tRNA synthetase, class II, conserved domain (InterPro:IPR006195), Aspartyl/Asparaginyl-tRNA synthetase, class IIb (InterPro:IPR002312), Nucleic acid-binding, OB-fold-like (InterPro:IPR016027), Aminoacyl-tRNA synthetase, class II (D/K/N) (InterPro:IPR004364), Aminoacyl-tRNA synthetase, class II (D/K/N)-like (InterPro:IPR018150); BEST Arabidopsis thaliana protein match is: Class II aminoacyl-tRNA and biotin synthetases superfamily protein (TAIR:AT1G70980.1); Has 19374 Blast hits to 17086 proteins in 2835 species: Archae - 447; Bacteria - 14373; Metazoa - 505; Fungi - 670; Plants - 294; Viruses - 0; Other Eukaryotes - 3085 (source: NCBI BLink). & (reliability: 370.0) & (original description: no original description) 0.9017557785697756 62 evm.model.contig_2480.3 no hits & (original description: no original description) 0.901536167558734 84 evm.model.contig_2034.9 (at5g42970 : 231.0) encodes subunit 4 of COP9 signalosome complex. sequence is similar to a subunit of the 19S regulatory particle of the 26S proteasome. recessive mutation causes derepression of photomorphogenesis.; CONSTITUTIVE PHOTOMORPHOGENIC 8 (COP8); FUNCTIONS IN: protein binding; INVOLVED IN: cullin deneddylation, negative regulation of photomorphogenesis, G2 phase of mitotic cell cycle, photomorphogenesis; LOCATED IN: signalosome; EXPRESSED IN: 24 plant structures; EXPRESSED DURING: 15 growth stages; CONTAINS InterPro DOMAIN/s: Winged helix-turn-helix transcription repressor DNA-binding (InterPro:IPR011991), Proteasome component (PCI) domain (InterPro:IPR000717); BEST Arabidopsis thaliana protein match is: regulatory particle non-ATPase subunit 5B (TAIR:AT5G64760.2); Has 1807 Blast hits to 1807 proteins in 277 species: Archae - 0; Bacteria - 0; Metazoa - 736; Fungi - 347; Plants - 385; Viruses - 0; Other Eukaryotes - 339 (source: NCBI BLink). & (reliability: 462.0) & (original description: no original description) 0.9014792965846381 64 evm.model.contig_2130.1 no hits & (original description: no original description) 0.9014727822553045 65 evm.model.contig_471.6 no hits & (original description: no original description) 0.9011240621747076 66 evm.model.contig_653.2 no hits & (original description: no original description) 0.9010906162395158 67 evm.model.contig_3385.8 no hits & (original description: no original description) 0.9010025344798986 68 evm.model.contig_2100.3 no hits & (original description: no original description) 0.9005065648486144 69 evm.model.contig_579.8 no hits & (original description: no original description) 0.8994191088892648 72 evm.model.contig_4447.6 (original description: no original description) 0.8988078054010702 97 evm.model.contig_3426.2 no hits & (original description: no original description) 0.8985006240010333 74 evm.model.contig_4429.7 no hits & (original description: no original description) 0.8977648149637757 75 evm.model.contig_2293.13 no hits & (original description: no original description) 0.8970674800557626 78 evm.model.contig_2083.15 (at3g02870 : 126.0) Encodes a L-galactose-1-phosphate phosphatase, involved in ascorbate biosynthesis.; VTC4; FUNCTIONS IN: 3'(2'),5'-bisphosphate nucleotidase activity, L-galactose-1-phosphate phosphatase activity, inositol or phosphatidylinositol phosphatase activity, inositol-1(or 4)-monophosphatase activity; INVOLVED IN: sulfur metabolic process, L-ascorbic acid biosynthetic process, response to karrikin, response to cold, inositol biosynthetic process; LOCATED IN: plasma membrane; EXPRESSED IN: 25 plant structures; EXPRESSED DURING: 13 growth stages; CONTAINS InterPro DOMAIN/s: Inositol monophosphatase, conserved site (InterPro:IPR020550), Inositol monophosphatase (InterPro:IPR000760), Inositol monophosphatase, Lithium-sensitive (InterPro:IPR020552), Inositol monophosphatase, metal-binding site (InterPro:IPR020583); BEST Arabidopsis thaliana protein match is: myo-inositol monophosphatase like 1 (TAIR:AT1G31190.1); Has 30201 Blast hits to 17322 proteins in 780 species: Archae - 12; Bacteria - 1396; Metazoa - 17338; Fungi - 3422; Plants - 5037; Viruses - 0; Other Eukaryotes - 2996 (source: NCBI BLink). & (o49071|impp_mescr : 114.0) Inositol monophosphatase (EC 3.1.3.25) (IMPase) (IMP) (Inositol-1(or 4)-monophosphatase) - Mesembryanthemum crystallinum (Common ice plant) & (reliability: 252.0) & (original description: no original description) 0.8966773968455193 78 evm.model.contig_4536.3 (at5g40440 : 176.0) encodes a mitogen-activated protein kinase kinase; mitogen-activated protein kinase kinase 3 (MKK3); CONTAINS InterPro DOMAIN/s: Protein kinase, ATP binding site (InterPro:IPR017441), Nuclear transport factor 2, Eukaryote (InterPro:IPR018222), Protein kinase, catalytic domain (InterPro:IPR000719), Serine/threonine-protein kinase domain (InterPro:IPR002290), Serine/threonine-protein kinase-like domain (InterPro:IPR017442), Protein kinase-like domain (InterPro:IPR011009), Serine/threonine-protein kinase, active site (InterPro:IPR008271); BEST Arabidopsis thaliana protein match is: MAP kinase kinase 6 (TAIR:AT5G56580.1); Has 122843 Blast hits to 121436 proteins in 4056 species: Archae - 133; Bacteria - 13736; Metazoa - 45702; Fungi - 12099; Plants - 30883; Viruses - 510; Other Eukaryotes - 19780 (source: NCBI BLink). & (q5qn75|m2k1_orysa : 159.0) Mitogen-activated protein kinase kinase 1 (EC 2.7.12.2) (MAP kinase kinase 1) (MAPKK1) (OsMEK1) - Oryza sativa (Rice) & (reliability: 346.0) & (original description: no original description) 0.896311134361632 79 evm.model.contig_2192.4 no hits & (original description: no original description) 0.8959279821434801 80 evm.model.contig_508.2 (at1g55090 : 680.0) carbon-nitrogen hydrolase family protein; FUNCTIONS IN: hydrolase activity, acting on carbon-nitrogen (but not peptide) bonds, NAD+ synthase (glutamine-hydrolyzing) activity, ATP binding; INVOLVED IN: nitrogen compound metabolic process, NAD biosynthetic process; LOCATED IN: cellular_component unknown; EXPRESSED IN: 22 plant structures; EXPRESSED DURING: 13 growth stages; CONTAINS InterPro DOMAIN/s: Rossmann-like alpha/beta/alpha sandwich fold (InterPro:IPR014729), Nitrilase/cyanide hydratase and apolipoprotein N-acyltransferase (InterPro:IPR003010), NAD synthase (InterPro:IPR003694), Glutamine-dependent NAD(+) synthetase, GAT domain-containing (InterPro:IPR014445), NAD/GMP synthase (InterPro:IPR022310); Has 5923 Blast hits to 5903 proteins in 2409 species: Archae - 233; Bacteria - 4478; Metazoa - 145; Fungi - 142; Plants - 70; Viruses - 0; Other Eukaryotes - 855 (source: NCBI BLink). & (reliability: 1360.0) & (original description: no original description) 0.8959102263299595 81 evm.model.contig_482.11 (at5g66680 : 223.0) Encodes a protein ortholog of human SOT48 or yeast WBP1, an essential protein subunit of the oligosaccharyltransferase (OST) complex, which is responsible for the transfer in the ER of the N-linked glycan precursor onto Asn residues of candidate proteins.; DEFECTIVE GLYCOSYLATION (DGL1); FUNCTIONS IN: dolichyl-diphosphooligosaccharide-protein glycotransferase activity; INVOLVED IN: plant-type cell wall organization, protein amino acid N-linked glycosylation via asparagine, unidimensional cell growth; LOCATED IN: in 8 components; EXPRESSED IN: 26 plant structures; EXPRESSED DURING: 15 growth stages; CONTAINS InterPro DOMAIN/s: Dolichyl-diphosphooligosaccharide-protein glycosyltransferase 48kDa subunit (InterPro:IPR005013); Has 30201 Blast hits to 17322 proteins in 780 species: Archae - 12; Bacteria - 1396; Metazoa - 17338; Fungi - 3422; Plants - 5037; Viruses - 0; Other Eukaryotes - 2996 (source: NCBI BLink). & (reliability: 446.0) & (original description: no original description) 0.8953441236154309 82 evm.model.contig_4419.3 (at1g27980 : 331.0) dihydrosphingosine phosphate lyase (DPL1); FUNCTIONS IN: pyridoxal phosphate binding, carboxy-lyase activity, catalytic activity; INVOLVED IN: sphingolipid catabolic process, cellular amino acid metabolic process; LOCATED IN: endoplasmic reticulum, membrane; EXPRESSED IN: 23 plant structures; EXPRESSED DURING: 13 growth stages; CONTAINS InterPro DOMAIN/s: Pyridoxal phosphate-dependent transferase, major domain (InterPro:IPR015424), Pyridoxal phosphate-dependent decarboxylase (InterPro:IPR002129), Pyridoxal phosphate-dependent transferase, major region, subdomain 1 (InterPro:IPR015421); BEST Arabidopsis thaliana protein match is: glutamate decarboxylase (TAIR:AT5G17330.1); Has 6215 Blast hits to 6205 proteins in 1686 species: Archae - 244; Bacteria - 4383; Metazoa - 292; Fungi - 491; Plants - 296; Viruses - 3; Other Eukaryotes - 506 (source: NCBI BLink). & (q52rg7|sgpl_orysa : 316.0) Sphingosine-1-phosphate lyase precursor (EC 4.1.2.27) (SP-lyase) (SPL) (Sphingosine-1-phosphate aldolase) - Oryza sativa (Rice) & (reliability: 662.0) & (original description: no original description) 0.8953361712296595 83 evm.model.contig_2100.7 no hits & (original description: no original description) 0.8952692909512923 91 evm.model.contig_2225.3 no hits & (original description: no original description) 0.8949983979451451 87 evm.model.contig_3421.4 (at2g26930 : 233.0) Encodes a 4-(cytidine 5'-phospho)-2-C-methyl-D-erithritol kinase.; 4-(cytidine 5'-phospho)-2-C-methyl-D-erithritol kinase (CDPMEK); FUNCTIONS IN: 4-(cytidine 5'-diphospho)-2-C-methyl-D-erythritol kinase activity; INVOLVED IN: response to light stimulus; LOCATED IN: chloroplast; EXPRESSED IN: 23 plant structures; EXPRESSED DURING: 13 growth stages; CONTAINS InterPro DOMAIN/s: 4-diphosphocytidyl-2C-methyl-D-erythritol kinase (InterPro:IPR004424), Ribosomal protein S5 domain 2-type fold (InterPro:IPR020568), GHMP kinase (InterPro:IPR006204), Ribosomal protein S5 domain 2-type fold, subgroup (InterPro:IPR014721), GHMP kinase, C-terminal (InterPro:IPR013750); Has 6617 Blast hits to 6617 proteins in 2226 species: Archae - 3; Bacteria - 4583; Metazoa - 0; Fungi - 2; Plants - 69; Viruses - 0; Other Eukaryotes - 1960 (source: NCBI BLink). & (reliability: 466.0) & (original description: no original description) 0.8948411574536063 88 evm.model.contig_4452.1 (at2g41790 : 661.0) Insulinase (Peptidase family M16) family protein; FUNCTIONS IN: metalloendopeptidase activity, zinc ion binding, catalytic activity, metal ion binding; INVOLVED IN: proteolysis; LOCATED IN: cellular_component unknown; EXPRESSED IN: 23 plant structures; EXPRESSED DURING: 13 growth stages; CONTAINS InterPro DOMAIN/s: Peptidase M16, zinc-binding site (InterPro:IPR001431), Peptidase M16, C-terminal (InterPro:IPR007863), Peptidase M16, N-terminal (InterPro:IPR011765), Metalloenzyme, LuxS/M16 peptidase-like, metal-binding (InterPro:IPR011249), Peptidase M16, core (InterPro:IPR011237); BEST Arabidopsis thaliana protein match is: Insulinase (Peptidase family M16) family protein (TAIR:AT3G57470.2); Has 9660 Blast hits to 9541 proteins in 2186 species: Archae - 9; Bacteria - 6247; Metazoa - 831; Fungi - 633; Plants - 271; Viruses - 3; Other Eukaryotes - 1666 (source: NCBI BLink). & (reliability: 1322.0) & (original description: no original description) 0.8941783085457441 90 evm.model.contig_3427.8 (at4g34450 : 528.0) coatomer gamma-2 subunit, putative / gamma-2 coat protein, putative / gamma-2 COP, putative; FUNCTIONS IN: clathrin binding, structural molecule activity, binding; INVOLVED IN: intracellular protein transport, vesicle-mediated transport; LOCATED IN: chloroplast, membrane; EXPRESSED IN: 26 plant structures; EXPRESSED DURING: 15 growth stages; CONTAINS InterPro DOMAIN/s: Coatomer, gamma subunit, appendage, Ig-like subdomain (InterPro:IPR013040), Armadillo-like helical (InterPro:IPR011989), Clathrin/coatomer adaptor, adaptin-like, N-terminal (InterPro:IPR002553), Coatomer, gamma subunit (InterPro:IPR017106), Coatomer, gamma subunit , appendage (InterPro:IPR014863), Armadillo-type fold (InterPro:IPR016024), Clathrin alpha-adaptin/coatomer adaptor, appendage, C-terminal subdomain (InterPro:IPR015873), Clathrin/coatomer adaptor, adaptin-like, appendage, C-terminal subdomain (InterPro:IPR009028), Clathrin/coatomer adaptor, adaptin-like, appendage, Ig-like subdomain (InterPro:IPR013041); BEST Arabidopsis thaliana protein match is: structural molecules (TAIR:AT2G16200.1); Has 1647 Blast hits to 1638 proteins in 222 species: Archae - 2; Bacteria - 2; Metazoa - 707; Fungi - 446; Plants - 176; Viruses - 0; Other Eukaryotes - 314 (source: NCBI BLink). & (reliability: 1056.0) & (original description: no original description) 0.8935493609719749 93 evm.model.contig_489.1 no hits & (original description: no original description) 0.8909705951637548 99 evm.model.contig_2044.22 no hits & (original description: no original description) 0.8908385756076318 100