Sequence Description Alias PCC hrr evm.model.contig_4488.2 no hits & (original description: no original description) 0.9306620976072084 16 evm.model.contig_3607.2 no hits & (original description: no original description) 0.9295128360248084 2 evm.model.contig_2059.1 (at3g63140 : 146.0) Encodes a protein with ribonuclease activity that is involved in plastid rRNA maturation.; chloroplast stem-loop binding protein of 41 kDa (CSP41A); FUNCTIONS IN: mRNA binding, poly(U) RNA binding; INVOLVED IN: rRNA processing; LOCATED IN: in 6 components; EXPRESSED IN: 25 plant structures; EXPRESSED DURING: 15 growth stages; CONTAINS InterPro DOMAIN/s: NAD-dependent epimerase/dehydratase (InterPro:IPR001509), NAD(P)-binding domain (InterPro:IPR016040); BEST Arabidopsis thaliana protein match is: chloroplast RNA binding (TAIR:AT1G09340.1); Has 1047 Blast hits to 1047 proteins in 372 species: Archae - 70; Bacteria - 649; Metazoa - 6; Fungi - 5; Plants - 106; Viruses - 0; Other Eukaryotes - 211 (source: NCBI BLink). & (reliability: 292.0) & (original description: no original description) 0.9227060375104572 40 evm.model.contig_3388.8 (at3g11040 : 138.0) Glycosyl hydrolase family 85 ; FUNCTIONS IN: hydrolase activity, acting on glycosyl bonds, mannosyl-glycoprotein endo-beta-N-acetylglucosaminidase activity; INVOLVED IN: biological_process unknown; LOCATED IN: cytoplasm; EXPRESSED IN: 23 plant structures; EXPRESSED DURING: 13 growth stages; CONTAINS InterPro DOMAIN/s: Glycoside hydrolase, family 85 (InterPro:IPR005201), Glycoside hydrolase, catalytic core (InterPro:IPR017853); BEST Arabidopsis thaliana protein match is: Glycosyl hydrolase family 85 (TAIR:AT5G05460.1); Has 486 Blast hits to 477 proteins in 213 species: Archae - 0; Bacteria - 256; Metazoa - 108; Fungi - 38; Plants - 49; Viruses - 0; Other Eukaryotes - 35 (source: NCBI BLink). & (reliability: 276.0) & (original description: no original description) 0.9148292178657328 21 evm.model.contig_2285.2 (at2g28305 : 84.0) LONELY GUY 1 (LOG1); CONTAINS InterPro DOMAIN/s: Conserved hypothetical protein CHP00730 (InterPro:IPR005269); BEST Arabidopsis thaliana protein match is: lysine decarboxylase family protein (TAIR:AT2G37210.1); Has 5303 Blast hits to 5301 proteins in 1556 species: Archae - 26; Bacteria - 3660; Metazoa - 10; Fungi - 132; Plants - 396; Viruses - 0; Other Eukaryotes - 1079 (source: NCBI BLink). & (reliability: 167.2) & (original description: no original description) 0.9103666087094645 39 evm.model.contig_4591.1 no hits & (original description: no original description) 0.9081407030338368 64 evm.model.contig_4478.5 no hits & (original description: no original description) 0.9022737187264226 11 evm.model.contig_479.21 no hits & (original description: no original description) 0.9016524401102486 34 evm.model.contig_3677.3 no hits & (original description: no original description) 0.8991310449895051 41 evm.model.contig_3729.2 (at4g17740 : 286.0) Peptidase S41 family protein; FUNCTIONS IN: serine-type peptidase activity; INVOLVED IN: proteolysis, intracellular signaling pathway; LOCATED IN: thylakoid, thylakoid lumen, mitochondrion, chloroplast thylakoid lumen; EXPRESSED IN: 22 plant structures; EXPRESSED DURING: 15 growth stages; CONTAINS InterPro DOMAIN/s: Peptidase S41 (InterPro:IPR005151), PDZ/DHR/GLGF (InterPro:IPR001478), Peptidase S41A, C-terminal peptidase (InterPro:IPR004447); BEST Arabidopsis thaliana protein match is: Peptidase S41 family protein (TAIR:AT3G57680.1); Has 9160 Blast hits to 9150 proteins in 1973 species: Archae - 0; Bacteria - 5658; Metazoa - 14; Fungi - 0; Plants - 153; Viruses - 0; Other Eukaryotes - 3335 (source: NCBI BLink). & (reliability: 572.0) & (original description: no original description) 0.8989685044175653 31 evm.model.contig_4429.12 no hits & (original description: no original description) 0.8934046196328499 23 evm.model.contig_3488.10 no hits & (original description: no original description) 0.8848556514481485 65 evm.model.contig_4398.16 no hits & (original description: no original description) 0.8843866306913621 53 evm.model.contig_3490.1 (at3g14390 : 125.0) Pyridoxal-dependent decarboxylase family protein; FUNCTIONS IN: diaminopimelate decarboxylase activity; INVOLVED IN: lysine biosynthetic process via diaminopimelate; LOCATED IN: chloroplast; EXPRESSED IN: guard cell, cultured cell; CONTAINS InterPro DOMAIN/s: Alanine racemase/group IV decarboxylase, C-terminal (InterPro:IPR009006), Ornithine/DAP/Arg decarboxylase (InterPro:IPR000183), Orn/DAP/Arg decarboxylase 2, N-terminal (InterPro:IPR022644), Orn/DAP/Arg decarboxylase 2, C-terminal (InterPro:IPR022643), Diaminopimelate decarboxylase (InterPro:IPR002986), Orn/DAP/Arg decarboxylase 2, conserved site (InterPro:IPR022657), Orn/DAP/Arg decarboxylase 2, pyridoxal-phosphate binding site (InterPro:IPR022653); BEST Arabidopsis thaliana protein match is: Pyridoxal-dependent decarboxylase family protein (TAIR:AT5G11880.1); Has 13020 Blast hits to 12980 proteins in 2586 species: Archae - 150; Bacteria - 7800; Metazoa - 435; Fungi - 194; Plants - 400; Viruses - 27; Other Eukaryotes - 4014 (source: NCBI BLink). & (reliability: 250.0) & (original description: no original description) 0.882462762620089 51 evm.model.contig_2116.2 no hits & (original description: no original description) 0.8805270904304596 57 evm.model.contig_3486.4 (at3g19830 : 179.0) NTMC2T5.2; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknown; LOCATED IN: chloroplast; CONTAINS InterPro DOMAIN/s: Protein of unknown function DUF2404, transmembrane (InterPro:IPR019411), C2 membrane targeting protein (InterPro:IPR018029), C2 calcium/lipid-binding domain, CaLB (InterPro:IPR008973), C2 calcium-dependent membrane targeting (InterPro:IPR000008); BEST Arabidopsis thaliana protein match is: N-terminal-transmembrane-C2 domain type 5.1 (TAIR:AT1G50260.1); Has 1308 Blast hits to 1143 proteins in 177 species: Archae - 0; Bacteria - 2; Metazoa - 350; Fungi - 235; Plants - 647; Viruses - 0; Other Eukaryotes - 74 (source: NCBI BLink). & (reliability: 358.0) & (original description: no original description) 0.8804738460027499 66 evm.model.contig_2121.20 no hits & (original description: no original description) 0.879164600839063 52 evm.model.contig_588.5 no hits & (original description: no original description) 0.8755977227323289 49 evm.model.contig_4427.4 no hits & (original description: no original description) 0.8734256726102031 50 evm.model.contig_503.1 no hits & (original description: no original description) 0.8727202821669983 100 evm.model.contig_2062.19 (at3g24430 : 234.0) encodes a MRP-like protein with a nucleotide-binding domain.; HIGH-CHLOROPHYLL-FLUORESCENCE 101 (HCF101); FUNCTIONS IN: ATP binding; INVOLVED IN: oxidation reduction; LOCATED IN: chloroplast; EXPRESSED IN: 22 plant structures; EXPRESSED DURING: 13 growth stages; CONTAINS InterPro DOMAIN/s: Mrp, conserved site (InterPro:IPR000808), Gamma-butyrobetaine dioxygenase/Trimethyllysine dioxygenase, N-terminal (InterPro:IPR010376), Protein of unknown function DUF59 (InterPro:IPR002744), ATPase-like, ParA/MinD (InterPro:IPR019591); BEST Arabidopsis thaliana protein match is: IND1(iron-sulfur protein required for NADH dehydrogenase)-like (TAIR:AT4G19540.1); Has 16372 Blast hits to 16340 proteins in 2775 species: Archae - 600; Bacteria - 10162; Metazoa - 436; Fungi - 428; Plants - 202; Viruses - 0; Other Eukaryotes - 4544 (source: NCBI BLink). & (reliability: 468.0) & (original description: no original description) 0.8723570453530355 36 evm.model.contig_4488.9 (at4g01320 : 332.0) CAAX protease with broad substrate specificity. Localized exclusively to the endoplasmic reticulum.; ATSTE24; FUNCTIONS IN: endopeptidase activity, metalloendopeptidase activity; INVOLVED IN: CAAX-box protein maturation, proteolysis; LOCATED IN: endoplasmic reticulum, vacuole; EXPRESSED IN: 25 plant structures; EXPRESSED DURING: 13 growth stages; CONTAINS InterPro DOMAIN/s: Peptidase M48 (InterPro:IPR001915); Has 2991 Blast hits to 2984 proteins in 996 species: Archae - 162; Bacteria - 1572; Metazoa - 206; Fungi - 172; Plants - 49; Viruses - 0; Other Eukaryotes - 830 (source: NCBI BLink). & (reliability: 664.0) & (original description: no original description) 0.8682388038080106 46 evm.model.contig_2194.6 (original description: no original description) 0.8642264532124094 61 evm.model.contig_2044.13 (at3g18524 : 222.0) Encodes a DNA mismatch repair homolog of human MutS gene, MSH6. MSH2 is involved in maintaining genome stability and repressing recombination of mismatched heteroduplexes.There are four MutS genes in Arabidopsis, MSH2, MSH3, MSH6, and MSH7, which all act as heterodimers and bind to 51-mer duplexes. MSH2 has different binding specificity to different mismatches in combination with MSH3, MSH6, or MSH7.; MUTS homolog 2 (MSH2); FUNCTIONS IN: damaged DNA binding, protein binding, mismatched DNA binding, ATP binding; INVOLVED IN: mismatch repair, negative regulation of reciprocal meiotic recombination; LOCATED IN: plasma membrane; EXPRESSED IN: 21 plant structures; EXPRESSED DURING: 13 growth stages; CONTAINS InterPro DOMAIN/s: DNA mismatch repair protein MutS, clamp (InterPro:IPR007861), DNA mismatch repair protein MutS, connector (InterPro:IPR007860), DNA mismatch repair protein MutS, core (InterPro:IPR007696), DNA mismatch repair protein MutS, C-terminal (InterPro:IPR000432), DNA mismatch repair protein MutS-like, N-terminal (InterPro:IPR007695), DNA mismatch repair protein, MSH2 (InterPro:IPR011184); BEST Arabidopsis thaliana protein match is: homolog of DNA mismatch repair protein MSH3 (TAIR:AT4G25540.1); Has 13560 Blast hits to 13453 proteins in 2654 species: Archae - 128; Bacteria - 8942; Metazoa - 734; Fungi - 813; Plants - 457; Viruses - 3; Other Eukaryotes - 2483 (source: NCBI BLink). & (q9xgc9|msh2_maize : 172.0) DNA mismatch repair protein MSH2 (MUS1) - Zea mays (Maize) & (reliability: 444.0) & (original description: no original description) 0.8622921845516616 50 evm.model.contig_2284.17 (at3g02660 : 391.0) EMBRYO DEFECTIVE 2768 (emb2768); FUNCTIONS IN: RNA binding, tyrosine-tRNA ligase activity, aminoacyl-tRNA ligase activity, nucleotide binding, ATP binding; INVOLVED IN: tRNA aminoacylation for protein translation, embryo development ending in seed dormancy; LOCATED IN: mitochondrion, chloroplast; EXPRESSED IN: 23 plant structures; EXPRESSED DURING: 14 growth stages; CONTAINS InterPro DOMAIN/s: Aminoacyl-tRNA synthetase, class I, conserved site (InterPro:IPR001412), Rossmann-like alpha/beta/alpha sandwich fold (InterPro:IPR014729), Tyrosyl-tRNA synthetase, class Ib, bacterial/mitochondrial (InterPro:IPR002307), RNA-binding S4 (InterPro:IPR002942), Aminoacyl-tRNA synthetase, class Ib (InterPro:IPR002305); Has 9022 Blast hits to 9013 proteins in 2715 species: Archae - 16; Bacteria - 5542; Metazoa - 116; Fungi - 145; Plants - 38; Viruses - 0; Other Eukaryotes - 3165 (source: NCBI BLink). & (reliability: 782.0) & (original description: no original description) 0.8612302062268345 55 evm.model.contig_3583.2 (at1g11870 : 475.0) Seryl-tRNA synthetase targeted to chloroplasts and mitochondria. Its inactivation causes developmental arrest of chloroplasts and mitochondria in Nicotiana benthamiana.; Seryl-tRNA synthetase (SRS); FUNCTIONS IN: serine-tRNA ligase activity; INVOLVED IN: chloroplast organization, mitochondrion organization, seryl-tRNA aminoacylation, tRNA aminoacylation, ovule development; LOCATED IN: mitochondrion, chloroplast; EXPRESSED IN: 22 plant structures; EXPRESSED DURING: 13 growth stages; CONTAINS InterPro DOMAIN/s: tRNA-binding arm (InterPro:IPR010978), Aminoacyl-tRNA synthetase, class II (G/ H/ P/ S), conserved domain (InterPro:IPR002314), Seryl-tRNA synthetase, class IIa, N-terminal (InterPro:IPR015866), Seryl-tRNA synthetase, class IIa (InterPro:IPR002317), Ubiquitin supergroup (InterPro:IPR019955), Aminoacyl-tRNA synthetase, class II, conserved domain (InterPro:IPR006195), Seryl-tRNA synthetase, class IIa, C-terminal (InterPro:IPR018156); BEST Arabidopsis thaliana protein match is: seryl-tRNA synthetase / serine--tRNA ligase (TAIR:AT5G27470.1); Has 35333 Blast hits to 34131 proteins in 2444 species: Archae - 798; Bacteria - 22429; Metazoa - 974; Fungi - 991; Plants - 531; Viruses - 0; Other Eukaryotes - 9610 (source: NCBI BLink). & (o81983|sys_helan : 229.0) Seryl-tRNA synthetase (EC 6.1.1.11) (Serine--tRNA ligase) (SerRS) - Helianthus annuus (Common sunflower) & (reliability: 950.0) & (original description: no original description) 0.8606540726427514 61 evm.model.contig_3395.9 (at2g35040 : 417.0) AICARFT/IMPCHase bienzyme family protein; FUNCTIONS IN: phosphoribosylaminoimidazolecarboxamide formyltransferase activity, IMP cyclohydrolase activity, catalytic activity; INVOLVED IN: response to cold, purine nucleotide biosynthetic process; LOCATED IN: stromule; EXPRESSED IN: 23 plant structures; EXPRESSED DURING: 13 growth stages; CONTAINS InterPro DOMAIN/s: AICARFT/IMPCHase bienzyme, transformylase domain (InterPro:IPR013982), AICARFT/IMPCHase bienzyme (InterPro:IPR002695), MGS-like (InterPro:IPR011607). & (reliability: 834.0) & (original description: no original description) 0.8599848849956587 63 evm.model.contig_2253.2 (at5g06060 : 105.0) NAD(P)-binding Rossmann-fold superfamily protein; FUNCTIONS IN: oxidoreductase activity, binding, catalytic activity; INVOLVED IN: oxidation reduction, metabolic process; EXPRESSED IN: 23 plant structures; EXPRESSED DURING: 14 growth stages; CONTAINS InterPro DOMAIN/s: Short-chain dehydrogenase/reductase, conserved site (InterPro:IPR020904), NAD(P)-binding domain (InterPro:IPR016040), Glucose/ribitol dehydrogenase (InterPro:IPR002347), Short-chain dehydrogenase/reductase SDR (InterPro:IPR002198); BEST Arabidopsis thaliana protein match is: NAD(P)-binding Rossmann-fold superfamily protein (TAIR:AT2G29290.2); Has 1807 Blast hits to 1807 proteins in 277 species: Archae - 0; Bacteria - 0; Metazoa - 736; Fungi - 347; Plants - 385; Viruses - 0; Other Eukaryotes - 339 (source: NCBI BLink). & (reliability: 210.0) & (original description: no original description) 0.8593734457455229 64 evm.model.contig_4443.24 (at1g30520 : 165.0) Encodes a chloroplast O-succinylbenzoyl-CoA ligase. Involved in phylloquinone biosynthesis. Knock mutant is seedling lethal.; acyl-activating enzyme 14 (AAE14); CONTAINS InterPro DOMAIN/s: AMP-binding, conserved site (InterPro:IPR020845), AMP-dependent synthetase/ligase (InterPro:IPR000873); BEST Arabidopsis thaliana protein match is: AMP-dependent synthetase and ligase family protein (TAIR:AT4G19010.1); Has 73301 Blast hits to 67448 proteins in 3614 species: Archae - 1088; Bacteria - 49563; Metazoa - 3259; Fungi - 3557; Plants - 2200; Viruses - 1; Other Eukaryotes - 13633 (source: NCBI BLink). & (o24145|4cl1_tobac : 120.0) 4-coumarate--CoA ligase 1 (EC 6.2.1.12) (4CL 1) (4-coumaroyl-CoA synthase 1) - Nicotiana tabacum (Common tobacco) & (reliability: 330.0) & (original description: no original description) 0.8588521980216823 66 evm.model.contig_2086.1 (at5g13410 : 221.0) FKBP-like peptidyl-prolyl cis-trans isomerase family protein; FUNCTIONS IN: FK506 binding, peptidyl-prolyl cis-trans isomerase activity; INVOLVED IN: protein folding; LOCATED IN: thylakoid, thylakoid lumen, chloroplast thylakoid membrane, chloroplast thylakoid lumen, chloroplast; EXPRESSED IN: 22 plant structures; EXPRESSED DURING: 13 growth stages; CONTAINS InterPro DOMAIN/s: Peptidyl-prolyl cis-trans isomerase, FKBP-type (InterPro:IPR001179); BEST Arabidopsis thaliana protein match is: FKBP-like peptidyl-prolyl cis-trans isomerase family protein (TAIR:AT4G19830.1); Has 1807 Blast hits to 1807 proteins in 277 species: Archae - 0; Bacteria - 0; Metazoa - 736; Fungi - 347; Plants - 385; Viruses - 0; Other Eukaryotes - 339 (source: NCBI BLink). & (reliability: 442.0) & (original description: no original description) 0.8548773658058261 79