Sequence Description Alias PCC hrr evm.model.contig_2626.1 (at5g12040 : 297.0) Nitrilase/cyanide hydratase and apolipoprotein N-acyltransferase family protein; FUNCTIONS IN: hydrolase activity, acting on carbon-nitrogen (but not peptide) bonds, zinc ion binding; INVOLVED IN: nitrogen compound metabolic process; LOCATED IN: chloroplast; EXPRESSED IN: 23 plant structures; EXPRESSED DURING: 13 growth stages; CONTAINS InterPro DOMAIN/s: Nitrilase/cyanide hydratase and apolipoprotein N-acyltransferase (InterPro:IPR003010); BEST Arabidopsis thaliana protein match is: Nitrilase/cyanide hydratase and apolipoprotein N-acyltransferase family protein (TAIR:AT4G08790.1); Has 35333 Blast hits to 34131 proteins in 2444 species: Archae - 798; Bacteria - 22429; Metazoa - 974; Fungi - 991; Plants - 531; Viruses - 0; Other Eukaryotes - 9610 (source: NCBI BLink). & (q3hvn1|agub_soltu : 87.8) N-carbamoylputrescine amidase (EC 3.5.1.53) - Solanum tuberosum (Potato) & (reliability: 594.0) & (original description: no original description) 0.9203080278955696 1 evm.model.contig_512.12 no hits & (original description: no original description) 0.9034153831055314 3 evm.model.contig_2468.8 no hits & (original description: no original description) 0.8974250023988873 9 evm.model.contig_2396.3 no hits & (original description: no original description) 0.8957062362437554 4 evm.model.contig_2169.5 no hits & (original description: no original description) 0.8822750868496936 6 evm.model.contig_815.2 no hits & (original description: no original description) 0.8816614926145366 6 evm.model.contig_477.4 no hits & (original description: no original description) 0.8810639641311899 7 evm.model.contig_2175.3 (at5g13070 : 85.5) MSF1-like family protein; CONTAINS InterPro DOMAIN/s: PRELI/MSF1 (InterPro:IPR006797); Has 1807 Blast hits to 1807 proteins in 277 species: Archae - 0; Bacteria - 0; Metazoa - 736; Fungi - 347; Plants - 385; Viruses - 0; Other Eukaryotes - 339 (source: NCBI BLink). & (reliability: 171.0) & (original description: no original description) 0.8752497405605391 8 evm.model.contig_472.16 (at3g54210 : 159.0) Ribosomal protein L17 family protein; FUNCTIONS IN: structural constituent of ribosome; INVOLVED IN: translation; LOCATED IN: ribosome, chloroplast; EXPRESSED IN: 22 plant structures; EXPRESSED DURING: 13 growth stages; CONTAINS InterPro DOMAIN/s: Ribosomal protein L17 (InterPro:IPR000456); BEST Arabidopsis thaliana protein match is: Ribosomal protein L17 family protein (TAIR:AT5G09770.1); Has 8019 Blast hits to 8019 proteins in 2737 species: Archae - 0; Bacteria - 5501; Metazoa - 121; Fungi - 128; Plants - 120; Viruses - 0; Other Eukaryotes - 2149 (source: NCBI BLink). & (o80363|rk17_tobac : 156.0) 50S ribosomal protein L17, chloroplast precursor (CL17) - Nicotiana tabacum (Common tobacco) & (reliability: 318.0) & (original description: no original description) 0.8735551570766819 9 evm.model.contig_2687.2 (at5g02710 : 112.0) unknown protein; LOCATED IN: chloroplast; EXPRESSED IN: 21 plant structures; EXPRESSED DURING: 13 growth stages; CONTAINS InterPro DOMAIN/s: Uncharacterised protein family UPF0153 (InterPro:IPR005358); Has 240 Blast hits to 240 proteins in 73 species: Archae - 10; Bacteria - 110; Metazoa - 0; Fungi - 0; Plants - 25; Viruses - 0; Other Eukaryotes - 95 (source: NCBI BLink). & (reliability: 224.0) & (original description: no original description) 0.8667435267987225 18 evm.model.contig_2012.1 (at5g66120 : 352.0) 3-dehydroquinate synthase, putative; FUNCTIONS IN: 3-dehydroquinate synthase activity; INVOLVED IN: aromatic amino acid family biosynthetic process; LOCATED IN: chloroplast; EXPRESSED IN: 23 plant structures; EXPRESSED DURING: 14 growth stages; CONTAINS InterPro DOMAIN/s: 3-dehydroquinate synthase AroB, subgroup (InterPro:IPR016037), 3-dehydroquinate synthase AroB (InterPro:IPR002658); Has 30201 Blast hits to 17322 proteins in 780 species: Archae - 12; Bacteria - 1396; Metazoa - 17338; Fungi - 3422; Plants - 5037; Viruses - 0; Other Eukaryotes - 2996 (source: NCBI BLink). & (reliability: 704.0) & (original description: no original description) 0.8626841411626848 37 evm.model.contig_2027.6 no hits & (original description: no original description) 0.8623665246666801 27 evm.model.contig_601.3 no hits & (original description: no original description) 0.8603277415268069 17 evm.model.contig_3423.39 (at4g29060 : 100.0) embryo defective 2726 (emb2726); FUNCTIONS IN: RNA binding, translation elongation factor activity; INVOLVED IN: translational elongation, response to cadmium ion, embryo development ending in seed dormancy; LOCATED IN: chloroplast; EXPRESSED IN: 24 plant structures; EXPRESSED DURING: 14 growth stages; CONTAINS InterPro DOMAIN/s: Nucleic acid-binding, OB-fold (InterPro:IPR012340), Ubiquitin-associated/translation elongation factor EF1B, N-terminal (InterPro:IPR000449), Ribosomal protein S1, RNA-binding domain (InterPro:IPR003029), Translation elongation factor EFTs/EF1B (InterPro:IPR001816), Translation elongation factor EFTs/EF1B, dimerisation (InterPro:IPR014039), Nucleic acid-binding, OB-fold-like (InterPro:IPR016027), Translation elongation factor Ts, conserved site (InterPro:IPR018101), UBA-like (InterPro:IPR009060); BEST Arabidopsis thaliana protein match is: translation elongation factor Ts (EF-Ts), putative (TAIR:AT4G11120.1); Has 35333 Blast hits to 34131 proteins in 2444 species: Archae - 798; Bacteria - 22429; Metazoa - 974; Fungi - 991; Plants - 531; Viruses - 0; Other Eukaryotes - 9610 (source: NCBI BLink). & (reliability: 200.0) & (original description: no original description) 0.8587220979221865 16 evm.model.contig_606.9 no hits & (original description: no original description) 0.858480270579768 51 evm.model.contig_3452.5 (at3g04710 : 100.0) ankyrin repeat family protein; FUNCTIONS IN: binding; INVOLVED IN: biological_process unknown; LOCATED IN: cellular_component unknown; EXPRESSED IN: 22 plant structures; EXPRESSED DURING: 14 growth stages; CONTAINS InterPro DOMAIN/s: Protein of unknown function DUF1685 (InterPro:IPR012881), Tetratricopeptide-like helical (InterPro:IPR011990), Ankyrin repeat-containing domain (InterPro:IPR020683), Tetratricopeptide repeat-containing (InterPro:IPR013026), Tetratricopeptide repeat (InterPro:IPR019734), Ankyrin repeat (InterPro:IPR002110); BEST Arabidopsis thaliana protein match is: Protein of unknown function (DUF1685) (TAIR:AT3G04700.1). & (reliability: 200.0) & (original description: no original description) 0.8553520077908063 33 evm.model.contig_690.2 no hits & (original description: no original description) 0.8479101327155054 41 evm.model.contig_541.4 no hits & (original description: no original description) 0.8460927853748224 41 evm.model.contig_2318.1 no hits & (original description: no original description) 0.8420945776712188 19 evm.model.contig_514.1 (at1g53400 : 88.2) Ubiquitin domain-containing protein; INVOLVED IN: N-terminal protein myristoylation; LOCATED IN: cellular_component unknown; EXPRESSED IN: 24 plant structures; EXPRESSED DURING: 15 growth stages; BEST Arabidopsis thaliana protein match is: Ubiquitin domain-containing protein (TAIR:AT5G45740.1); Has 319 Blast hits to 318 proteins in 112 species: Archae - 0; Bacteria - 0; Metazoa - 144; Fungi - 63; Plants - 89; Viruses - 0; Other Eukaryotes - 23 (source: NCBI BLink). & (reliability: 176.4) & (original description: no original description) 0.8409022175909856 57 evm.model.contig_917.1 no hits & (original description: no original description) 0.8392747579459279 36 evm.model.contig_2090.41 no hits & (original description: no original description) 0.824445977431961 26 evm.model.contig_4465.10 no hits & (original description: no original description) 0.823964976981238 76 evm.model.contig_3571.2 (at3g07270 : 113.0) GTP cyclohydrolase I; CONTAINS InterPro DOMAIN/s: GTP cyclohydrolase I/Nitrile oxidoreductase (InterPro:IPR020602); Has 35333 Blast hits to 34131 proteins in 2444 species: Archae - 798; Bacteria - 22429; Metazoa - 974; Fungi - 991; Plants - 531; Viruses - 0; Other Eukaryotes - 9610 (source: NCBI BLink). & (reliability: 226.0) & (original description: no original description) 0.8237160476806031 24 evm.model.contig_435.3 no hits & (original description: no original description) 0.8230925092613665 25 evm.model.contig_2187.7 no hits & (original description: no original description) 0.821036574824045 26 evm.model.contig_612.2 (at5g61540 : 138.0) N-terminal nucleophile aminohydrolases (Ntn hydrolases) superfamily protein; FUNCTIONS IN: asparaginase activity, hydrolase activity; INVOLVED IN: glycoprotein catabolic process; EXPRESSED IN: 21 plant structures; EXPRESSED DURING: 13 growth stages; CONTAINS InterPro DOMAIN/s: Peptidase T2, asparaginase 2 (InterPro:IPR000246); BEST Arabidopsis thaliana protein match is: N-terminal nucleophile aminohydrolases (Ntn hydrolases) superfamily protein (TAIR:AT3G16150.1); Has 2748 Blast hits to 2738 proteins in 812 species: Archae - 113; Bacteria - 1407; Metazoa - 453; Fungi - 78; Plants - 148; Viruses - 0; Other Eukaryotes - 549 (source: NCBI BLink). & (reliability: 276.0) & (original description: no original description) 0.8163932368982222 90 evm.model.contig_2094.1 no hits & (original description: no original description) 0.8163635670804389 28 evm.model.contig_2386.1 no hits & (original description: no original description) 0.8156952029510194 53 evm.model.contig_4416.15 no hits & (original description: no original description) 0.814844159530522 30 evm.model.contig_2386.2 no hits & (original description: no original description) 0.8104318175557547 54 evm.model.contig_2041.2 no hits & (original description: no original description) 0.8076260599481918 32 evm.model.contig_436.15 (at3g25920 : 92.4) encodes a plastid ribosomal protein CL15, a constituent of the large subunit of the ribosomal complex; ribosomal protein L15 (RPL15); FUNCTIONS IN: structural constituent of ribosome; INVOLVED IN: translation; LOCATED IN: plastid large ribosomal subunit, chloroplast stroma, chloroplast; EXPRESSED IN: 24 plant structures; EXPRESSED DURING: 15 growth stages; CONTAINS InterPro DOMAIN/s: Ribosomal protein L18e/L15 (InterPro:IPR021131), Ribosomal protein L15, bacterial-type (InterPro:IPR005749), Ribosomal protein L15, conserved site (InterPro:IPR001196); BEST Arabidopsis thaliana protein match is: Ribosomal protein L18e/L15 superfamily protein (TAIR:AT5G64670.1); Has 7744 Blast hits to 7744 proteins in 2658 species: Archae - 15; Bacteria - 5386; Metazoa - 72; Fungi - 86; Plants - 90; Viruses - 0; Other Eukaryotes - 2095 (source: NCBI BLink). & (p31165|rk15_pea : 92.4) 50S ribosomal protein L15, chloroplast precursor (CL15) (Fragment) - Pisum sativum (Garden pea) & (reliability: 184.8) & (original description: no original description) 0.8025085341574622 34 evm.model.contig_2069.2 no hits & (original description: no original description) 0.8022277208174624 35 evm.model.contig_787.2 no hits & (original description: no original description) 0.7956354767673852 37 evm.model.contig_507.2 no hits & (original description: no original description) 0.7895244361996239 70 evm.model.contig_2130.4 no hits & (original description: no original description) 0.7892622229828872 39 evm.model.contig_4506.5 no hits & (original description: no original description) 0.786155715175727 60 evm.model.contig_617.2 (q43843|rpe_soltu : 346.0) Ribulose-phosphate 3-epimerase, chloroplast precursor (EC 5.1.3.1) (Pentose-5-phosphate 3-epimerase) (PPE) (RPE) (R5P3E) (Fragment) - Solanum tuberosum (Potato) & (at5g61410 : 345.0) Arabidopsis thaliana ribulose-5-phosphate-3-epimerase mRNA; D-ribulose-5-phosphate-3-epimerase (RPE); FUNCTIONS IN: ribulose-phosphate 3-epimerase activity, catalytic activity; INVOLVED IN: response to cold, carbohydrate metabolic process, response to nematode, embryo development ending in seed dormancy; LOCATED IN: thylakoid, apoplast, stromule, chloroplast, chloroplast envelope; EXPRESSED IN: 24 plant structures; EXPRESSED DURING: 17 growth stages; CONTAINS InterPro DOMAIN/s: Aldolase-type TIM barrel (InterPro:IPR013785), Ribulose-phosphate 3-epimerase (InterPro:IPR000056), Ribulose-phosphate binding barrel (InterPro:IPR011060); BEST Arabidopsis thaliana protein match is: Aldolase-type TIM barrel family protein (TAIR:AT3G01850.2); Has 1807 Blast hits to 1807 proteins in 277 species: Archae - 0; Bacteria - 0; Metazoa - 736; Fungi - 347; Plants - 385; Viruses - 0; Other Eukaryotes - 339 (source: NCBI BLink). & (reliability: 690.0) & (original description: no original description) 0.7848729240367122 41 evm.model.contig_2059.14 (at5g56760 : 187.0) Encodes a cytosolic serine O-acetyltransferase involved in sulfur assimilation and cysteine biosynthesis. Expressed in the vascular system.; serine acetyltransferase 1;1 (SERAT1;1); FUNCTIONS IN: serine O-acetyltransferase activity; INVOLVED IN: cysteine biosynthetic process from serine; LOCATED IN: cytosol; EXPRESSED IN: 26 plant structures; EXPRESSED DURING: 15 growth stages; CONTAINS InterPro DOMAIN/s: Hexapeptide transferase, conserved site (InterPro:IPR018357), Serine O-acetyltransferase (InterPro:IPR005881), Trimeric LpxA-like (InterPro:IPR011004), Bacterial transferase hexapeptide repeat (InterPro:IPR001451), Serine acetyltransferase, N-terminal (InterPro:IPR010493); BEST Arabidopsis thaliana protein match is: serine acetyltransferase 2;2 (TAIR:AT3G13110.1); Has 18874 Blast hits to 18857 proteins in 2524 species: Archae - 292; Bacteria - 13784; Metazoa - 5; Fungi - 219; Plants - 250; Viruses - 18; Other Eukaryotes - 4306 (source: NCBI BLink). & (reliability: 374.0) & (original description: no original description) 0.7840971601344312 70 evm.model.contig_4427.7 no hits & (original description: no original description) 0.7830756682096738 96 evm.model.contig_2024.24 no hits & (original description: no original description) 0.7795434675080867 45 evm.model.contig_765.2 no hits & (original description: no original description) 0.771411708465941 100 evm.model.contig_2181.11 (at3g56510 : 118.0) RNA-binding (RRM/RBD/RNP motifs) family protein; FUNCTIONS IN: TATA-binding protein binding, nucleic acid binding; INVOLVED IN: biological_process unknown; LOCATED IN: cellular_component unknown; EXPRESSED IN: 22 plant structures; EXPRESSED DURING: 13 growth stages; CONTAINS InterPro DOMAIN/s: RNA recognition motif, RNP-1 (InterPro:IPR000504); Has 35333 Blast hits to 34131 proteins in 2444 species: Archae - 798; Bacteria - 22429; Metazoa - 974; Fungi - 991; Plants - 531; Viruses - 0; Other Eukaryotes - 9610 (source: NCBI BLink). & (reliability: 236.0) & (original description: no original description) 0.7701622554426928 48 evm.model.contig_2622.8 no hits & (original description: no original description) 0.7676205627200601 75 evm.model.contig_3765.1 no hits & (original description: no original description) 0.7671163187873298 84 evm.model.contig_3401.23 no hits & (original description: no original description) 0.7615030042123839 53 evm.model.contig_2175.4 (at3g43700 : 92.8) BTB-POZ and MATH domain 6 (BPM6); CONTAINS InterPro DOMAIN/s: TRAF-like (InterPro:IPR008974), MATH (InterPro:IPR002083), BTB/POZ fold (InterPro:IPR011333), BTB/POZ (InterPro:IPR013069), Kelch related (InterPro:IPR013089), BTB/POZ-like (InterPro:IPR000210), TRAF-type (InterPro:IPR013322); BEST Arabidopsis thaliana protein match is: BTB-POZ and MATH domain 5 (TAIR:AT5G21010.1); Has 7239 Blast hits to 7013 proteins in 213 species: Archae - 0; Bacteria - 0; Metazoa - 5298; Fungi - 161; Plants - 1386; Viruses - 66; Other Eukaryotes - 328 (source: NCBI BLink). & (reliability: 185.6) & (original description: no original description) 0.7613849483367567 60 evm.model.contig_2027.3 no hits & (original description: no original description) 0.7599773571977707 100 evm.model.contig_3652.1 no hits & (original description: no original description) 0.7592266231120577 57 evm.model.contig_499.5 (original description: no original description) 0.7569670422363202 58 evm.model.contig_711.2 no hits & (original description: no original description) 0.740183289233892 67 evm.model.contig_2109.6 no hits & (original description: no original description) 0.740132564273081 71 evm.model.contig_2173.16 no hits & (original description: no original description) 0.731889478554007 97 evm.model.contig_2027.4 no hits & (original description: no original description) 0.7301665909758285 73 evm.model.contig_2066.2 no hits & (original description: no original description) 0.7290310965560962 99 evm.model.contig_2092.9 (original description: no original description) 0.7272688252545267 76 evm.model.contig_4485.3 (at4g04950 : 215.0) thioredoxin family protein; FUNCTIONS IN: electron carrier activity, protein disulfide oxidoreductase activity; INVOLVED IN: cell redox homeostasis; EXPRESSED IN: 23 plant structures; EXPRESSED DURING: 13 growth stages; CONTAINS InterPro DOMAIN/s: Thioredoxin fold (InterPro:IPR012335), Glutaredoxin (InterPro:IPR002109), Thioredoxin-like (InterPro:IPR017936), Thioredoxin domain (InterPro:IPR013766), Thioredoxin-like fold (InterPro:IPR012336), Glutaredoxin-related protein (InterPro:IPR004480); BEST Arabidopsis thaliana protein match is: Thioredoxin superfamily protein (TAIR:AT4G32580.1); Has 26535 Blast hits to 17137 proteins in 2757 species: Archae - 249; Bacteria - 14010; Metazoa - 1647; Fungi - 1426; Plants - 1759; Viruses - 3; Other Eukaryotes - 7441 (source: NCBI BLink). & (reliability: 430.0) & (original description: no original description) 0.7271133623602064 77 evm.model.contig_4426.10 (q42698|ggpps_catro : 268.0) Geranylgeranyl pyrophosphate synthetase, chloroplast precursor (GGPP synthetase) (GGPS) [Includes: Dimethylallyltranstransferase (EC 2.5.1.1); Geranyltranstransferase (EC 2.5.1.10); Farnesyltranstransferase (EC 2.5.1.29)] - Catharanthu & (at4g36810 : 254.0) Encodes a protein with geranylgeranyl pyrophosphate synthase activity involved in isoprenoid biosynthesis. The enzyme appears to be targeted to the chloroplast in epidermal cells and guard cells of leaves, and in etioplasts in roots.; geranylgeranyl pyrophosphate synthase 1 (GGPS1); FUNCTIONS IN: farnesyltranstransferase activity; INVOLVED IN: isoprenoid biosynthetic process; LOCATED IN: etioplast, chloroplast; EXPRESSED IN: 23 plant structures; EXPRESSED DURING: 13 growth stages; CONTAINS InterPro DOMAIN/s: Polyprenyl synthetase-related (InterPro:IPR017446), Terpenoid synthase (InterPro:IPR008949), Polyprenyl synthetase (InterPro:IPR000092); BEST Arabidopsis thaliana protein match is: Terpenoid synthases superfamily protein (TAIR:AT2G18620.1); Has 16617 Blast hits to 16612 proteins in 2936 species: Archae - 341; Bacteria - 9385; Metazoa - 291; Fungi - 423; Plants - 452; Viruses - 12; Other Eukaryotes - 5713 (source: NCBI BLink). & (reliability: 508.0) & (original description: no original description) 0.7256247040963181 78 evm.model.contig_2036.11 no hits & (original description: no original description) 0.7173419626965063 82 evm.model.contig_2076.1 no hits & (original description: no original description) 0.7168705594942282 83 evm.model.contig_444.3 (at2g47590 : 285.0) photolyase/blue light photoreceptor PHR2 (PHR2) mRNA,; photolyase/blue-light receptor 2 (PHR2); FUNCTIONS IN: DNA photolyase activity; INVOLVED IN: DNA repair; LOCATED IN: cellular_component unknown; EXPRESSED IN: 22 plant structures; EXPRESSED DURING: 13 growth stages; CONTAINS InterPro DOMAIN/s: Rossmann-like alpha/beta/alpha sandwich fold (InterPro:IPR014729), DNA photolyase, N-terminal (InterPro:IPR006050), DNA photolyase, FAD-binding/Cryptochrome, C-terminal (InterPro:IPR005101); BEST Arabidopsis thaliana protein match is: cryptochrome 3 (TAIR:AT5G24850.1); Has 4854 Blast hits to 4851 proteins in 1205 species: Archae - 82; Bacteria - 2283; Metazoa - 348; Fungi - 105; Plants - 417; Viruses - 0; Other Eukaryotes - 1619 (source: NCBI BLink). & (q651u1|cryd_orysa : 218.0) Cryptochrome DASH, chloroplast/mitochondrial precursor - Oryza sativa (Rice) & (reliability: 570.0) & (original description: no original description) 0.7089756657528764 88 evm.model.contig_3693.3 no hits & (original description: no original description) 0.6993784705844417 95 evm.model.contig_2032.12 no hits & (original description: no original description) 0.699202940623567 99 evm.model.contig_527.20 no hits & (original description: no original description) 0.6952030671829533 97 evm.model.contig_2400.3 (q9zrf1|mtdh_fraan : 125.0) Probable mannitol dehydrogenase (EC 1.1.1.255) (NAD-dependent mannitol dehydrogenase) - Fragaria ananassa (Strawberry) & (at4g37970 : 122.0) cinnamyl alcohol dehydrogenase 6 (CAD6); FUNCTIONS IN: oxidoreductase activity, zinc ion binding; INVOLVED IN: oxidation reduction; LOCATED IN: cellular_component unknown; EXPRESSED IN: 9 plant structures; EXPRESSED DURING: petal differentiation and expansion stage; CONTAINS InterPro DOMAIN/s: GroES-like (InterPro:IPR011032), Alcohol dehydrogenase GroES-like (InterPro:IPR013154), Alcohol dehydrogenase, zinc-containing, conserved site (InterPro:IPR002328), Alcohol dehydrogenase, C-terminal (InterPro:IPR013149), Alcohol dehydrogenase superfamily, zinc-containing (InterPro:IPR002085); BEST Arabidopsis thaliana protein match is: cinnamyl alcohol dehydrogenase 9 (TAIR:AT4G39330.1); Has 35385 Blast hits to 35371 proteins in 2946 species: Archae - 789; Bacteria - 23408; Metazoa - 1218; Fungi - 2700; Plants - 2633; Viruses - 3; Other Eukaryotes - 4634 (source: NCBI BLink). & (reliability: 244.0) & (original description: no original description) 0.6934246478802587 99