Sequence Description Alias PCC hrr evm.model.contig_558.7 (at1g53800 : 82.4) unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT1G53250.1); Has 35333 Blast hits to 34131 proteins in 2444 species: Archae - 798; Bacteria - 22429; Metazoa - 974; Fungi - 991; Plants - 531; Viruses - 0; Other Eukaryotes - 9610 (source: NCBI BLink). & (reliability: 164.8) & (original description: no original description) 0.95558449563743 15 evm.model.contig_4487.8 (at5g64940 : 121.0) Encodes a member of ATH subfamily of ATP-binding cassette (ABC) proteins.; ABC2 homolog 13 (ATH13); FUNCTIONS IN: transporter activity; INVOLVED IN: transport; LOCATED IN: chloroplast, chloroplast envelope; EXPRESSED IN: 22 plant structures; EXPRESSED DURING: 13 growth stages; CONTAINS InterPro DOMAIN/s: ABC-1 (InterPro:IPR004147), Protein kinase-like domain (InterPro:IPR011009); BEST Arabidopsis thaliana protein match is: Protein kinase superfamily protein (TAIR:AT3G07700.2); Has 30201 Blast hits to 17322 proteins in 780 species: Archae - 12; Bacteria - 1396; Metazoa - 17338; Fungi - 3422; Plants - 5037; Viruses - 0; Other Eukaryotes - 2996 (source: NCBI BLink). & (reliability: 242.0) & (original description: no original description) 0.9526398592769533 14 evm.model.contig_2056.2 (at1g80410 : 491.0) EMBRYO DEFECTIVE 2753 (EMB2753); FUNCTIONS IN: binding; INVOLVED IN: embryo development ending in seed dormancy; EXPRESSED IN: 23 plant structures; EXPRESSED DURING: 14 growth stages; CONTAINS InterPro DOMAIN/s: Tetratricopeptide TPR-1 (InterPro:IPR001440), Tetratricopeptide-like helical (InterPro:IPR011990), Tetratricopeptide repeat-containing (InterPro:IPR013026), N-terminal acetyltransferase A, auxiliary subunit (InterPro:IPR021183), Tetratricopeptide repeat (InterPro:IPR019734). & (reliability: 982.0) & (original description: no original description) 0.9503685871418008 83 evm.model.contig_3545.3 no hits & (original description: no original description) 0.9488452322641491 31 evm.model.contig_3435.4 no hits & (original description: no original description) 0.947197191224585 8 evm.model.contig_2098.2 (at5g27620 : 88.6) core cell cycle genes; cyclin H;1 (CYCH;1); CONTAINS InterPro DOMAIN/s: Cyclin H (InterPro:IPR015432), Cyclin-like (InterPro:IPR011028), Transcription regulator cyclin (InterPro:IPR015429), Cyclin-related (InterPro:IPR013763), Cyclin, N-terminal (InterPro:IPR006671), Cyclin (InterPro:IPR006670); BEST Arabidopsis thaliana protein match is: Cyclin family protein (TAIR:AT5G48640.1); Has 30201 Blast hits to 17322 proteins in 780 species: Archae - 12; Bacteria - 1396; Metazoa - 17338; Fungi - 3422; Plants - 5037; Viruses - 0; Other Eukaryotes - 2996 (source: NCBI BLink). & (reliability: 177.2) & (original description: no original description) 0.945496441469189 8 evm.model.contig_2008.5 no hits & (original description: no original description) 0.9439574264512287 9 evm.model.contig_4428.3 (at3g25585 : 181.0) aminoalcoholphosphotransferase (AAPT2) mRNA, complete cds; aminoalcoholphosphotransferase (AAPT2); FUNCTIONS IN: phosphatidyltransferase activity, phosphotransferase activity, for other substituted phosphate groups; INVOLVED IN: phospholipid biosynthetic process; LOCATED IN: membrane; EXPRESSED IN: 24 plant structures; EXPRESSED DURING: 15 growth stages; CONTAINS InterPro DOMAIN/s: Choline/ethanolaminephosphotransferase (InterPro:IPR014472), CDP-alcohol phosphatidyltransferase (InterPro:IPR000462); BEST Arabidopsis thaliana protein match is: aminoalcoholphosphotransferase 1 (TAIR:AT1G13560.1); Has 35333 Blast hits to 34131 proteins in 2444 species: Archae - 798; Bacteria - 22429; Metazoa - 974; Fungi - 991; Plants - 531; Viruses - 0; Other Eukaryotes - 9610 (source: NCBI BLink). & (reliability: 362.0) & (original description: no original description) 0.9436450152459219 58 evm.model.contig_2059.3 (at5g42400 : 144.0) Encodes ATXR7 (ARABIDOPSIS TRITHORAX-RELATED7), required for histone H3-K4 methylation and for transcriptional activation of Flowering Locus C.; SET domain protein 25 (SDG25); CONTAINS InterPro DOMAIN/s: SET domain (InterPro:IPR001214), GYF (InterPro:IPR003169); BEST Arabidopsis thaliana protein match is: homologue of trithorax (TAIR:AT2G31650.1); Has 5838 Blast hits to 5683 proteins in 501 species: Archae - 3; Bacteria - 461; Metazoa - 2434; Fungi - 507; Plants - 1016; Viruses - 2; Other Eukaryotes - 1415 (source: NCBI BLink). & (reliability: 288.0) & (original description: no original description) 0.9434039282797341 87 evm.model.contig_2085.5 (at5g63890 : 478.0) Encodes histidinol dehydrogenase. Up-regulated in response to UV-B.; histidinol dehydrogenase (HDH); FUNCTIONS IN: histidinol dehydrogenase activity; INVOLVED IN: response to UV, pollen development; LOCATED IN: chloroplast, chloroplast stroma; EXPRESSED IN: 23 plant structures; EXPRESSED DURING: 14 growth stages; CONTAINS InterPro DOMAIN/s: Aldehyde/histidinol dehydrogenase (InterPro:IPR016161), Histidinol dehydrogenase, conserved site (InterPro:IPR001692), Histidinol dehydrogenase, prokaryotic-type (InterPro:IPR012131); Has 9146 Blast hits to 9146 proteins in 2211 species: Archae - 179; Bacteria - 4194; Metazoa - 4; Fungi - 211; Plants - 72; Viruses - 0; Other Eukaryotes - 4486 (source: NCBI BLink). & (q5nay4|hisx_orysa : 471.0) Histidinol dehydrogenase, chloroplast precursor (EC 1.1.1.23) (HDH) - Oryza sativa (Rice) & (reliability: 956.0) & (original description: no original description) 0.9430719634501478 35 evm.model.contig_2090.34 (q58fk4|ard2_orysa : 213.0) 1,2-dihydroxy-3-keto-5-methylthiopentene dioxygenase 2 (EC 1.13.-.-) (Aci-reductone dioxygenase 2) (Submergence-induced protein 2A) - Oryza sativa (Rice) & (at5g43850 : 209.0) ARD4; FUNCTIONS IN: acireductone dioxygenase [iron(II)-requiring] activity, metal ion binding; INVOLVED IN: L-methionine salvage from methylthioadenosine, oxidation reduction; LOCATED IN: cellular_component unknown; EXPRESSED IN: 22 plant structures; EXPRESSED DURING: 13 growth stages; CONTAINS InterPro DOMAIN/s: Acireductone dioxygenase, ARD (InterPro:IPR004313), Cupin, RmlC-type (InterPro:IPR011051), RmlC-like jelly roll fold (InterPro:IPR014710); BEST Arabidopsis thaliana protein match is: RmlC-like cupins superfamily protein (TAIR:AT4G14710.2); Has 30201 Blast hits to 17322 proteins in 780 species: Archae - 12; Bacteria - 1396; Metazoa - 17338; Fungi - 3422; Plants - 5037; Viruses - 0; Other Eukaryotes - 2996 (source: NCBI BLink). & (reliability: 416.0) & (original description: no original description) 0.9382024273529327 39 evm.model.contig_2083.8 no hits & (original description: no original description) 0.934960369354063 37 evm.model.contig_485.2 (at4g21770 : 142.0) Pseudouridine synthase family protein; FUNCTIONS IN: pseudouridine synthase activity; INVOLVED IN: pseudouridine synthesis, RNA modification; LOCATED IN: chloroplast; CONTAINS InterPro DOMAIN/s: Pseudouridine synthase, catalytic domain (InterPro:IPR020103), Pseudouridine synthase, RsuA and RluB/C/D/E/F (InterPro:IPR006145); Has 5935 Blast hits to 5933 proteins in 1806 species: Archae - 0; Bacteria - 4596; Metazoa - 112; Fungi - 68; Plants - 78; Viruses - 0; Other Eukaryotes - 1081 (source: NCBI BLink). & (reliability: 284.0) & (original description: no original description) 0.9327151116211871 44 evm.model.contig_2194.8 (at2g31170 : 261.0) SYCO ARATH; FUNCTIONS IN: cysteine-tRNA ligase activity, nucleotide binding, aminoacyl-tRNA ligase activity, ATP binding; INVOLVED IN: cysteinyl-tRNA aminoacylation, translation, tRNA aminoacylation for protein translation; LOCATED IN: mitochondrion, chloroplast; EXPRESSED IN: 21 plant structures; EXPRESSED DURING: 13 growth stages; CONTAINS InterPro DOMAIN/s: Rossmann-like alpha/beta/alpha sandwich fold (InterPro:IPR014729), Cysteinyl-tRNA synthetase, class Ia (InterPro:IPR002308), Cysteinyl-tRNA synthetase, class Ia, N-terminal (InterPro:IPR015803), Cysteinyl-tRNA synthetase, class Ia, DALR (InterPro:IPR015273), Aminoacyl-tRNA synthetase, class 1a, anticodon-binding (InterPro:IPR009080), Cysteinyl-tRNA synthetase, class Ia, C-terminal (InterPro:IPR015804); BEST Arabidopsis thaliana protein match is: Cysteinyl-tRNA synthetase, class Ia family protein (TAIR:AT5G38830.1); Has 10676 Blast hits to 10676 proteins in 2860 species: Archae - 252; Bacteria - 6117; Metazoa - 332; Fungi - 154; Plants - 139; Viruses - 3; Other Eukaryotes - 3679 (source: NCBI BLink). & (reliability: 522.0) & (original description: no original description) 0.9323723235738425 79 evm.model.contig_3693.8 no hits & (original description: no original description) 0.9302398021589234 76 evm.model.contig_444.15 no hits & (original description: no original description) 0.9294308127364918 92 evm.model.contig_3452.7 (at3g55160 : 117.0) unknown protein; EXPRESSED IN: 11 plant structures; EXPRESSED DURING: 4 anthesis, F mature embryo stage, petal differentiation and expansion stage, E expanded cotyledon stage, D bilateral stage; CONTAINS InterPro DOMAIN/s: Protein of unknown function DUF2428, death-receptor-like (InterPro:IPR019442); Has 357 Blast hits to 330 proteins in 163 species: Archae - 0; Bacteria - 0; Metazoa - 144; Fungi - 118; Plants - 50; Viruses - 0; Other Eukaryotes - 45 (source: NCBI BLink). & (reliability: 234.0) & (original description: no original description) 0.9286822152381045 35 evm.model.contig_546.1 (at4g10320 : 1110.0) tRNA synthetase class I (I, L, M and V) family protein; FUNCTIONS IN: isoleucine-tRNA ligase activity, nucleotide binding, aminoacyl-tRNA ligase activity, zinc ion binding, ATP binding; INVOLVED IN: response to cadmium ion, tRNA aminoacylation for protein translation; LOCATED IN: cytosol; EXPRESSED IN: male gametophyte, guard cell, epidermis, cultured cell, pollen tube; EXPRESSED DURING: L mature pollen stage, M germinated pollen stage; CONTAINS InterPro DOMAIN/s: Aminoacyl-tRNA synthetase, class I, conserved site (InterPro:IPR001412), Isoleucyl-tRNA synthetase (InterPro:IPR018353), Isoleucyl-tRNA synthetase, class Ia (InterPro:IPR002301), Aminoacyl-tRNA synthetase, class 1a, anticodon-binding (InterPro:IPR009080), Rossmann-like alpha/beta/alpha sandwich fold (InterPro:IPR014729), Isoleucyl-tRNA synthetase, class Ia, N-terminal (InterPro:IPR015905), Valyl/Leucyl/Isoleucyl-tRNA synthetase, class I, anticodon-binding (InterPro:IPR013155), Valyl/Leucyl/Isoleucyl-tRNA synthetase, class Ia, editing (InterPro:IPR009008), Aminoacyl-tRNA synthetase, class Ia (InterPro:IPR002300); BEST Arabidopsis thaliana protein match is: tRNA synthetase class I (I, L, M and V) family protein (TAIR:AT5G49030.3); Has 38868 Blast hits to 32849 proteins in 3074 species: Archae - 1055; Bacteria - 22228; Metazoa - 780; Fungi - 735; Plants - 304; Viruses - 0; Other Eukaryotes - 13766 (source: NCBI BLink). & (reliability: 2220.0) & (original description: no original description) 0.9285733496807689 98 evm.model.contig_2025.10 no hits & (original description: no original description) 0.9277189057939473 33 evm.model.contig_3444.1 (original description: no original description) 0.9273071847478797 75 evm.model.contig_579.5 (at3g13070 : 184.0) CBS domain-containing protein / transporter associated domain-containing protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknown; LOCATED IN: chloroplast; EXPRESSED IN: 22 plant structures; EXPRESSED DURING: 13 growth stages; CONTAINS InterPro DOMAIN/s: Protein of unknown function DUF21 (InterPro:IPR002550), Transporter-associated domain (InterPro:IPR005170), Cystathionine beta-synthase, core (InterPro:IPR000644); BEST Arabidopsis thaliana protein match is: CBS domain-containing protein / transporter associated domain-containing protein (TAIR:AT1G55930.1); Has 15808 Blast hits to 15803 proteins in 2590 species: Archae - 162; Bacteria - 11531; Metazoa - 244; Fungi - 136; Plants - 197; Viruses - 0; Other Eukaryotes - 3538 (source: NCBI BLink). & (reliability: 368.0) & (original description: no original description) 0.9262506666581414 55 evm.model.contig_2033.14 no hits & (original description: no original description) 0.9261601875065276 90 evm.model.contig_522.23 no hits & (original description: no original description) 0.9258135166388555 85 evm.model.contig_3396.10 no hits & (original description: no original description) 0.9224431215997848 82 evm.model.contig_448.13 (at1g79440 : 331.0) Encodes a mitochondrial succinic semialdehyde dehydrogenase (SSADH). Nomenclature according to Kirch, et al (2004).; aldehyde dehydrogenase 5F1 (ALDH5F1); FUNCTIONS IN: 3-chloroallyl aldehyde dehydrogenase activity, NAD or NADH binding, copper ion binding, succinate-semialdehyde dehydrogenase activity; INVOLVED IN: in 6 processes; LOCATED IN: mitochondrion, chloroplast, mitochondrial matrix; EXPRESSED IN: 23 plant structures; EXPRESSED DURING: 13 growth stages; CONTAINS InterPro DOMAIN/s: Aldehyde/histidinol dehydrogenase (InterPro:IPR016161), Aldehyde dehydrogenase (InterPro:IPR015590), Aldehyde dehydrogenase, N-terminal (InterPro:IPR016162), Aldehyde dehydrogenase, conserved site (InterPro:IPR016160), Succinic semialdehyde dehydrogenase (InterPro:IPR010102); BEST Arabidopsis thaliana protein match is: aldehyde dehydrogenase 2B4 (TAIR:AT3G48000.1); Has 62487 Blast hits to 62143 proteins in 3037 species: Archae - 481; Bacteria - 36218; Metazoa - 2614; Fungi - 2131; Plants - 1502; Viruses - 0; Other Eukaryotes - 19541 (source: NCBI BLink). & (p17202|badh_spiol : 220.0) Betaine-aldehyde dehydrogenase, chloroplast precursor (EC 1.2.1.8) (BADH) - Spinacia oleracea (Spinach) & (reliability: 662.0) & (original description: no original description) 0.9214401922779684 49 evm.model.contig_2284.20 (at5g53770 : 133.0) Nucleotidyltransferase family protein; FUNCTIONS IN: nucleic acid binding, nucleotidyltransferase activity; INVOLVED IN: biological_process unknown; LOCATED IN: cellular_component unknown; EXPRESSED IN: 6 plant structures; EXPRESSED DURING: 4 anthesis, F mature embryo stage, petal differentiation and expansion stage, E expanded cotyledon stage, D bilateral stage; CONTAINS InterPro DOMAIN/s: Nucleotidyl transferase domain (InterPro:IPR002934), PAP/25A-associated (InterPro:IPR002058); BEST Arabidopsis thaliana protein match is: Nucleotidyltransferase family protein (TAIR:AT4G00060.1); Has 30201 Blast hits to 17322 proteins in 780 species: Archae - 12; Bacteria - 1396; Metazoa - 17338; Fungi - 3422; Plants - 5037; Viruses - 0; Other Eukaryotes - 2996 (source: NCBI BLink). & (reliability: 266.0) & (original description: no original description) 0.9192390087798308 89 evm.model.contig_2108.8 no hits & (original description: no original description) 0.9191050977298555 54 evm.model.contig_4464.7 (at3g13230 : 239.0) RNA-binding KH domain-containing protein; FUNCTIONS IN: RNA binding; LOCATED IN: cellular_component unknown; EXPRESSED IN: 22 plant structures; EXPRESSED DURING: 13 growth stages; CONTAINS InterPro DOMAIN/s: K Homology, type 1, subgroup (InterPro:IPR018111), K Homology (InterPro:IPR004087); Has 734 Blast hits to 734 proteins in 323 species: Archae - 189; Bacteria - 0; Metazoa - 158; Fungi - 192; Plants - 83; Viruses - 0; Other Eukaryotes - 112 (source: NCBI BLink). & (reliability: 478.0) & (original description: no original description) 0.9189515282269077 80 evm.model.contig_2285.2 (at2g28305 : 84.0) LONELY GUY 1 (LOG1); CONTAINS InterPro DOMAIN/s: Conserved hypothetical protein CHP00730 (InterPro:IPR005269); BEST Arabidopsis thaliana protein match is: lysine decarboxylase family protein (TAIR:AT2G37210.1); Has 5303 Blast hits to 5301 proteins in 1556 species: Archae - 26; Bacteria - 3660; Metazoa - 10; Fungi - 132; Plants - 396; Viruses - 0; Other Eukaryotes - 1079 (source: NCBI BLink). & (reliability: 167.2) & (original description: no original description) 0.9170650477658312 63 evm.model.contig_526.1 (at1g05000 : 203.0) Phosphotyrosine protein phosphatases superfamily protein; FUNCTIONS IN: phosphatase activity, protein tyrosine phosphatase activity, phosphoprotein phosphatase activity; INVOLVED IN: dephosphorylation; LOCATED IN: cellular_component unknown; EXPRESSED IN: 12 plant structures; EXPRESSED DURING: 8 growth stages; CONTAINS InterPro DOMAIN/s: Protein-tyrosine phosphatase, active site (InterPro:IPR016130), Protein-tyrosine phosphatase, dual specificity phosphatase, eukaryotic (InterPro:IPR020428), Protein-tyrosine phosphatase, SIW14-like (InterPro:IPR004861); BEST Arabidopsis thaliana protein match is: Phosphotyrosine protein phosphatases superfamily protein (TAIR:AT2G32960.1); Has 580 Blast hits to 572 proteins in 119 species: Archae - 0; Bacteria - 14; Metazoa - 1; Fungi - 314; Plants - 145; Viruses - 0; Other Eukaryotes - 106 (source: NCBI BLink). & (reliability: 406.0) & (original description: no original description) 0.9170244417626755 99 evm.model.contig_2015.10 (at1g56050 : 425.0) GTP-binding protein-related; FUNCTIONS IN: GTP binding; INVOLVED IN: biological_process unknown; LOCATED IN: chloroplast, chloroplast stroma; EXPRESSED IN: 21 plant structures; EXPRESSED DURING: 13 growth stages; CONTAINS InterPro DOMAIN/s: Protein of unknown function DUF933 (InterPro:IPR013029), TGS-like (InterPro:IPR012676), GTP1/OBG (InterPro:IPR006073), Conserved hypothetical protein CHP00092 (InterPro:IPR004396), GTP-binding protein, HSR1-related (InterPro:IPR002917), Beta-grasp fold, ferredoxin-type (InterPro:IPR012675); BEST Arabidopsis thaliana protein match is: GTP binding (TAIR:AT1G30580.1); Has 18400 Blast hits to 18396 proteins in 3002 species: Archae - 377; Bacteria - 10244; Metazoa - 785; Fungi - 603; Plants - 304; Viruses - 0; Other Eukaryotes - 6087 (source: NCBI BLink). & (reliability: 850.0) & (original description: no original description) 0.9158827157764451 87 evm.model.contig_444.33 no hits & (original description: no original description) 0.9142614379298357 78 evm.model.contig_436.10 (at2g39990 : 133.0) translation initiation factor eIF2 p47 subunit homolog; eukaryotic translation initiation factor 2 (EIF2); FUNCTIONS IN: translation initiation factor activity; INVOLVED IN: pollen germination, translational initiation, embryo development; LOCATED IN: eukaryotic translation initiation factor 3 complex, nucleus, membrane, cytoplasm; EXPRESSED IN: 26 plant structures; EXPRESSED DURING: 13 growth stages; CONTAINS InterPro DOMAIN/s: Mov34/MPN/PAD-1 (InterPro:IPR000555); BEST Arabidopsis thaliana protein match is: Mov34/MPN/PAD-1 family protein (TAIR:AT3G11270.2); Has 1103 Blast hits to 1103 proteins in 237 species: Archae - 0; Bacteria - 0; Metazoa - 465; Fungi - 269; Plants - 187; Viruses - 0; Other Eukaryotes - 182 (source: NCBI BLink). & (reliability: 266.0) & (original description: no original description) 0.9140952090569887 79 evm.model.contig_2340.8 no hits & (original description: no original description) 0.9130082906049447 87 evm.model.contig_4436.3 no hits & (original description: no original description) 0.9126612730372079 89 evm.model.contig_2173.3 no hits & (original description: no original description) 0.9121664454342968 92