Sequence Description Alias PCC hrr evm.model.contig_532.7 no hits & (original description: no original description) 0.9566436912183574 3 evm.model.contig_4491.7 no hits & (original description: no original description) 0.9457576180658889 3 evm.model.contig_579.7 (at1g50670 : 171.0) OTU-like cysteine protease family protein; FUNCTIONS IN: cysteine-type peptidase activity; INVOLVED IN: biological_process unknown; EXPRESSED IN: 23 plant structures; EXPRESSED DURING: 16 growth stages; CONTAINS InterPro DOMAIN/s: Ovarian tumour, otubain (InterPro:IPR003323); Has 406 Blast hits to 406 proteins in 179 species: Archae - 0; Bacteria - 0; Metazoa - 159; Fungi - 136; Plants - 63; Viruses - 0; Other Eukaryotes - 48 (source: NCBI BLink). & (reliability: 342.0) & (original description: no original description) 0.9422557609272839 22 evm.model.contig_4470.2 no hits & (original description: no original description) 0.9416118325865535 4 evm.model.contig_3558.10 no hits & (original description: no original description) 0.9405296615456872 24 evm.model.contig_693.11 (at4g29830 : 103.0) The protein is composed of repeats of WD motif which is involved in protein complex formation. The gene is involved in flower timing and flower development. This gene is predicted to encode a protein with a DWD motif. It can bind to DDB1a in Y2H assays, and DDB1b in co-IP assays, and may be involved in the formation of a CUL4-based E3 ubiquitin ligase. Loss of gene function leads to a redistribution of H3K4me3 and K3K36me2 modifications within genes but not a change in the overall abundance of these modifications within chromatin.; vernalization independence 3 (VIP3); FUNCTIONS IN: protein binding, nucleotide binding; INVOLVED IN: histone H3-K4 methylation, histone H3-K36 methylation, negative regulation of flower development; LOCATED IN: CUL4 RING ubiquitin ligase complex, heterotrimeric G-protein complex; EXPRESSED IN: 22 plant structures; EXPRESSED DURING: 13 growth stages; CONTAINS InterPro DOMAIN/s: WD40 repeat 2 (InterPro:IPR019782), WD40 repeat, conserved site (InterPro:IPR019775), WD40 repeat (InterPro:IPR001680), G-protein beta WD-40 repeat, region (InterPro:IPR020472), WD40 repeat-like-containing domain (InterPro:IPR011046), WD40-repeat-containing domain (InterPro:IPR017986), WD40/YVTN repeat-like-containing domain (InterPro:IPR015943), WD40 repeat, subgroup (InterPro:IPR019781); BEST Arabidopsis thaliana protein match is: WD-40 repeat family protein / small nuclear ribonucleoprotein Prp4p-related (TAIR:AT2G41500.1); Has 81424 Blast hits to 33679 proteins in 849 species: Archae - 94; Bacteria - 9736; Metazoa - 32144; Fungi - 18699; Plants - 10476; Viruses - 0; Other Eukaryotes - 10275 (source: NCBI BLink). & (reliability: 206.0) & (original description: no original description) 0.9397514239753367 17 evm.model.contig_2122.6 (at3g02090 : 457.0) MPPBETA; FUNCTIONS IN: metalloendopeptidase activity, zinc ion binding; INVOLVED IN: proteolysis; LOCATED IN: in 11 components; EXPRESSED IN: 27 plant structures; EXPRESSED DURING: 17 growth stages; CONTAINS InterPro DOMAIN/s: Peptidase M16, zinc-binding site (InterPro:IPR001431), Peptidase M16, C-terminal (InterPro:IPR007863), Peptidase M16, N-terminal (InterPro:IPR011765), Metalloenzyme, LuxS/M16 peptidase-like, metal-binding (InterPro:IPR011249), Peptidase M16, core (InterPro:IPR011237); BEST Arabidopsis thaliana protein match is: Insulinase (Peptidase family M16) protein (TAIR:AT1G51980.1); Has 13067 Blast hits to 12610 proteins in 2372 species: Archae - 22; Bacteria - 8565; Metazoa - 1070; Fungi - 780; Plants - 365; Viruses - 3; Other Eukaryotes - 2262 (source: NCBI BLink). & (p29677|mppa_soltu : 202.0) Mitochondrial-processing peptidase alpha subunit, mitochondrial precursor (EC 3.4.24.64) (Alpha-MPP) (Ubiquinol-cytochrome-c reductase subunit II) (EC 1.10.2.2) - Solanum tuberosum (Potato) & (reliability: 914.0) & (original description: no original description) 0.9387046482292959 13 evm.model.contig_2149.18 (at4g38130 : 505.0) Encodes a histone deacetylase that enhances AtERF7-mediated transcriptional repression. Binds SIM3 and ERF7. Expressed in the nucleus in most tissues examined and throughout the life of the plant. Involved in jasmonic acid and ethylene dependent pathogen resistance. The sequence in GenBank has 17 AG dinucleotide repeats missing, which is also missing in Ler shotgun sequence from Cereon. Although it is annotated to be in Columbia, the GB sequence is probably not of Columbia origin. Plays a role in embryogenesis as mutants grown at higher temperatures display abnormalities in the organization of the root and shoot. Plant lines expressing an RNAi construct targeted against HDA19 shows some resistance to agrobacterium-mediated root transformation.; histone deacetylase 1 (HD1); CONTAINS InterPro DOMAIN/s: Histone deacetylase (InterPro:IPR003084), Histone deacetylase superfamily (InterPro:IPR000286); BEST Arabidopsis thaliana protein match is: histone deacetylase 6 (TAIR:AT5G63110.1); Has 8759 Blast hits to 8549 proteins in 1452 species: Archae - 219; Bacteria - 3192; Metazoa - 1525; Fungi - 536; Plants - 478; Viruses - 0; Other Eukaryotes - 2809 (source: NCBI BLink). & (p56521|hdac_maize : 483.0) Probable histone deacetylase (RPD3 homolog) - Zea mays (Maize) & (reliability: 1010.0) & (original description: no original description) 0.9352936287433019 8 evm.model.contig_534.3 no hits & (original description: no original description) 0.9328728804829481 21 evm.model.contig_2273.22 no hits & (original description: no original description) 0.931018901612725 20 evm.model.contig_3583.3 (at1g18480 : 202.0) Calcineurin-like metallo-phosphoesterase superfamily protein; FUNCTIONS IN: hydrolase activity, protein serine/threonine phosphatase activity; INVOLVED IN: biological_process unknown; LOCATED IN: cellular_component unknown; EXPRESSED IN: 24 plant structures; EXPRESSED DURING: 15 growth stages; CONTAINS InterPro DOMAIN/s: Metallophosphoesterase (InterPro:IPR004843); BEST Arabidopsis thaliana protein match is: Calcineurin-like metallo-phosphoesterase superfamily protein (TAIR:AT1G07010.1); Has 638 Blast hits to 634 proteins in 194 species: Archae - 15; Bacteria - 274; Metazoa - 0; Fungi - 21; Plants - 102; Viruses - 3; Other Eukaryotes - 223 (source: NCBI BLink). & (reliability: 404.0) & (original description: no original description) 0.9290594904182776 13 evm.model.contig_3404.14 no hits & (original description: no original description) 0.9288339545748467 23 evm.model.contig_3383.2 (q9fns4|mbb1_chlre : 143.0) PsbB mRNA maturation factor Mbb1, chloroplast precursor - Chlamydomonas reinhardtii & (at3g17040 : 134.0) It is a RNA tetratricopeptide repeat-containing protein required for normal processing of transcripts from the polycistronic chloroplast psbB-psbT-psbH-petB-petD operon coding for proteins of the photosystem II and cytochrome b6/f complexes. Localizes to the chloroplast membrane. Involved in regulating plastidial gene expression and biogenesis.; high chlorophyll fluorescent 107 (HCF107); FUNCTIONS IN: binding; INVOLVED IN: plastid organization, RNA processing, regulation of translation; LOCATED IN: chloroplast envelope; EXPRESSED IN: 21 plant structures; EXPRESSED DURING: 13 growth stages; CONTAINS InterPro DOMAIN/s: RNA-processing protein, HAT helix (InterPro:IPR003107), Tetratricopeptide-like helical (InterPro:IPR011990), Tetratricopeptide repeat-containing (InterPro:IPR013026), Tetratricopeptide repeat (InterPro:IPR019734); BEST Arabidopsis thaliana protein match is: pre-mRNA splicing factor-related (TAIR:AT4G03430.1). & (reliability: 268.0) & (original description: no original description) 0.9254726822436107 23 evm.model.contig_4407.6 (original description: no original description) 0.9219482374603779 14 evm.model.contig_4455.4 (at3g25860 : 246.0) Nuclear encoded dihydrolipoamide S-acetyltransferase (LTA2) that encodes teh Pyruvate Decarboxylase E2 subunit. Mutant has embryo defect.; LTA2; FUNCTIONS IN: dihydrolipoyllysine-residue acetyltransferase activity; INVOLVED IN: metabolic process, acetyl-CoA biosynthetic process from pyruvate; LOCATED IN: cytosolic ribosome, chloroplast stroma, chloroplast, membrane, chloroplast envelope; EXPRESSED IN: 24 plant structures; EXPRESSED DURING: 15 growth stages; CONTAINS InterPro DOMAIN/s: 2-oxo acid dehydrogenase, lipoyl-binding site (InterPro:IPR003016), E3 binding (InterPro:IPR004167), 2-oxoacid dehydrogenase acyltransferase, catalytic domain (InterPro:IPR001078), Single hybrid motif (InterPro:IPR011053), Biotin/lipoyl attachment (InterPro:IPR000089); BEST Arabidopsis thaliana protein match is: 2-oxoacid dehydrogenases acyltransferase family protein (TAIR:AT1G34430.1); Has 23844 Blast hits to 20819 proteins in 2344 species: Archae - 101; Bacteria - 13163; Metazoa - 1146; Fungi - 799; Plants - 467; Viruses - 32; Other Eukaryotes - 8136 (source: NCBI BLink). & (reliability: 478.0) & (original description: no original description) 0.9219087163035066 15 evm.model.contig_2126.13 no hits & (original description: no original description) 0.9210177955429192 16 evm.model.contig_818.2 (at3g04650 : 217.0) FAD/NAD(P)-binding oxidoreductase family protein; LOCATED IN: chloroplast; EXPRESSED IN: 19 plant structures; EXPRESSED DURING: 12 growth stages; BEST Arabidopsis thaliana protein match is: FAD/NAD(P)-binding oxidoreductase family protein (TAIR:AT1G56000.1); Has 902 Blast hits to 899 proteins in 231 species: Archae - 14; Bacteria - 382; Metazoa - 7; Fungi - 2; Plants - 133; Viruses - 0; Other Eukaryotes - 364 (source: NCBI BLink). & (reliability: 434.0) & (original description: no original description) 0.9200407852752555 57 evm.model.contig_4541.4 no hits & (original description: no original description) 0.919951818911924 19 evm.model.contig_4448.12 no hits & (original description: no original description) 0.919459132135168 64 evm.model.contig_2107.1 (q6k9n6|sucb_orysa : 409.0) Succinyl-CoA ligase [GDP-forming] beta-chain, mitochondrial precursor (EC 6.2.1.4) (Succinyl-CoA synthetase, beta chain) (SCS-beta) - Oryza sativa (Rice) & (at2g20420 : 407.0) ATP citrate lyase (ACL) family protein; FUNCTIONS IN: succinate-CoA ligase (GDP-forming) activity, copper ion binding, ATP binding; INVOLVED IN: response to cadmium ion, metabolic process; LOCATED IN: mitochondrion; EXPRESSED IN: 26 plant structures; EXPRESSED DURING: 16 growth stages; CONTAINS InterPro DOMAIN/s: Succinyl-CoA synthetase, beta subunit (InterPro:IPR005809), Succinyl-CoA synthetase, beta subunit, conserved site (InterPro:IPR017866), ATP-citrate lyase/succinyl-CoA ligase (InterPro:IPR005811), ATP-grasp fold (InterPro:IPR011761), ATP-grasp fold, subdomain 2 (InterPro:IPR013816), ATP-grasp fold, succinyl-CoA synthetase-type (InterPro:IPR013650), Succinyl-CoA synthetase-like (InterPro:IPR016102); BEST Arabidopsis thaliana protein match is: ATP-citrate lyase A-1 (TAIR:AT1G10670.4); Has 9337 Blast hits to 9333 proteins in 2108 species: Archae - 181; Bacteria - 4147; Metazoa - 466; Fungi - 228; Plants - 81; Viruses - 0; Other Eukaryotes - 4234 (source: NCBI BLink). & (gnl|cdd|68872 : 84.3) no description available & (reliability: 814.0) & (original description: no original description) 0.919322531079729 68 evm.model.contig_4594.5 no hits & (original description: no original description) 0.9192053163215435 21 evm.model.contig_433.3 no hits & (original description: no original description) 0.9182400335359395 63 evm.model.contig_2189.4 (q6zl94|suca_orysa : 332.0) Probable succinyl-CoA ligase [GDP-forming] subunit alpha, mitochondrial precursor (EC 6.2.1.4) (Succinyl-CoA synthetase subunit alpha) (SCS-alpha) - Oryza sativa (Rice) & (at5g23250 : 327.0) Succinyl-CoA ligase, alpha subunit; FUNCTIONS IN: succinate-CoA ligase (GDP-forming) activity, copper ion binding; INVOLVED IN: metabolic process; LOCATED IN: mitochondrion; EXPRESSED IN: 23 plant structures; EXPRESSED DURING: 14 growth stages; CONTAINS InterPro DOMAIN/s: Succinyl-CoA ligase, alpha subunit (InterPro:IPR005810), ATP-citrate lyase/succinyl-CoA ligase (InterPro:IPR005811), NAD(P)-binding domain (InterPro:IPR016040), ATP-citrate lyase/succinyl-CoA ligase, active site (InterPro:IPR017440), CoA-binding (InterPro:IPR003781), Succinyl-CoA synthetase-like (InterPro:IPR016102); BEST Arabidopsis thaliana protein match is: Succinyl-CoA ligase, alpha subunit (TAIR:AT5G08300.1); Has 30201 Blast hits to 17322 proteins in 780 species: Archae - 12; Bacteria - 1396; Metazoa - 17338; Fungi - 3422; Plants - 5037; Viruses - 0; Other Eukaryotes - 2996 (source: NCBI BLink). & (reliability: 654.0) & (original description: no original description) 0.9173139382713549 26 evm.model.contig_3624.2 (at3g47590 : 174.0) alpha/beta-Hydrolases superfamily protein; CONTAINS InterPro DOMAIN/s: BAAT/Acyl-CoA thioester hydrolase C-terminal (InterPro:IPR014940); BEST Arabidopsis thaliana protein match is: alpha/beta-Hydrolases superfamily protein (TAIR:AT1G29840.1); Has 4006 Blast hits to 4003 proteins in 1107 species: Archae - 84; Bacteria - 2507; Metazoa - 7; Fungi - 75; Plants - 235; Viruses - 7; Other Eukaryotes - 1091 (source: NCBI BLink). & (reliability: 348.0) & (original description: no original description) 0.9171224579457963 41 evm.model.contig_4439.3 no hits & (original description: no original description) 0.9165767586087668 26 evm.model.contig_2049.2 (at2g29360 : 137.0) NAD(P)-binding Rossmann-fold superfamily protein; FUNCTIONS IN: oxidoreductase activity, binding, catalytic activity; INVOLVED IN: oxidation reduction, metabolic process; LOCATED IN: cellular_component unknown; EXPRESSED IN: 22 plant structures; EXPRESSED DURING: 14 growth stages; CONTAINS InterPro DOMAIN/s: Short-chain dehydrogenase/reductase, conserved site (InterPro:IPR020904), NAD(P)-binding domain (InterPro:IPR016040), Glucose/ribitol dehydrogenase (InterPro:IPR002347), Short-chain dehydrogenase/reductase SDR (InterPro:IPR002198); BEST Arabidopsis thaliana protein match is: NAD(P)-binding Rossmann-fold superfamily protein (TAIR:AT2G29150.1); Has 124543 Blast hits to 124288 proteins in 3623 species: Archae - 1000; Bacteria - 81334; Metazoa - 5904; Fungi - 6580; Plants - 2878; Viruses - 5; Other Eukaryotes - 26842 (source: NCBI BLink). & (q949m2|fabg4_brana : 102.0) 3-oxoacyl-[acyl-carrier-protein] reductase 4 (EC 1.1.1.100) (3-ketoacyl-acyl carrier protein reductase 4) (Fragment) - Brassica napus (Rape) & (reliability: 266.0) & (original description: no original description) 0.9143051858552105 27 evm.model.contig_2369.2 no hits & (original description: no original description) 0.913843799104918 36 evm.model.contig_3420.6 no hits & (original description: no original description) 0.91297637827674 29 evm.model.contig_2141.7 (at5g36160 : 213.0) Tyrosine transaminase family protein; FUNCTIONS IN: 1-aminocyclopropane-1-carboxylate synthase activity, pyridoxal phosphate binding, transferase activity, transferring nitrogenous groups, transaminase activity, catalytic activity; INVOLVED IN: tyrosine catabolic process to phosphoenolpyruvate, cellular amino acid and derivative metabolic process, biosynthetic process; LOCATED IN: cellular_component unknown; EXPRESSED IN: 22 plant structures; EXPRESSED DURING: 13 growth stages; CONTAINS InterPro DOMAIN/s: 1-aminocyclopropane-1-carboxylate synthase (InterPro:IPR001176), Aminotransferase, class I/classII (InterPro:IPR004839), Pyridoxal phosphate-dependent transferase, major domain (InterPro:IPR015424), Tyrosine transaminase (InterPro:IPR021178), Tyrosine/nicotianamine aminotransferase (InterPro:IPR005958), Pyridoxal phosphate-dependent transferase, major region, subdomain 1 (InterPro:IPR015421); BEST Arabidopsis thaliana protein match is: Tyrosine transaminase family protein (TAIR:AT5G53970.1); Has 35333 Blast hits to 34131 proteins in 2444 species: Archae - 798; Bacteria - 22429; Metazoa - 974; Fungi - 991; Plants - 531; Viruses - 0; Other Eukaryotes - 9610 (source: NCBI BLink). & (reliability: 414.0) & (original description: no original description) 0.9127614112699324 75 evm.model.contig_4408.18 no hits & (original description: no original description) 0.9123946028418006 31 evm.model.contig_4408.4 (at2g26060 : 174.0) embryo defective 1345 (emb1345); CONTAINS InterPro DOMAIN/s: WD40 repeat 2 (InterPro:IPR019782), WD40 repeat, conserved site (InterPro:IPR019775), WD40 repeat (InterPro:IPR001680), G-protein beta WD-40 repeat, region (InterPro:IPR020472), WD40 repeat-like-containing domain (InterPro:IPR011046), WD40-repeat-containing domain (InterPro:IPR017986), WD40/YVTN repeat-like-containing domain (InterPro:IPR015943), WD40 repeat, subgroup (InterPro:IPR019781); BEST Arabidopsis thaliana protein match is: Transducin/WD40 repeat-like superfamily protein (TAIR:AT4G32990.1); Has 35333 Blast hits to 34131 proteins in 2444 species: Archae - 798; Bacteria - 22429; Metazoa - 974; Fungi - 991; Plants - 531; Viruses - 0; Other Eukaryotes - 9610 (source: NCBI BLink). & (reliability: 348.0) & (original description: no original description) 0.9096504327653412 35 evm.model.contig_3436.5 (at3g18524 : 456.0) Encodes a DNA mismatch repair homolog of human MutS gene, MSH6. MSH2 is involved in maintaining genome stability and repressing recombination of mismatched heteroduplexes.There are four MutS genes in Arabidopsis, MSH2, MSH3, MSH6, and MSH7, which all act as heterodimers and bind to 51-mer duplexes. MSH2 has different binding specificity to different mismatches in combination with MSH3, MSH6, or MSH7.; MUTS homolog 2 (MSH2); FUNCTIONS IN: damaged DNA binding, protein binding, mismatched DNA binding, ATP binding; INVOLVED IN: mismatch repair, negative regulation of reciprocal meiotic recombination; LOCATED IN: plasma membrane; EXPRESSED IN: 21 plant structures; EXPRESSED DURING: 13 growth stages; CONTAINS InterPro DOMAIN/s: DNA mismatch repair protein MutS, clamp (InterPro:IPR007861), DNA mismatch repair protein MutS, connector (InterPro:IPR007860), DNA mismatch repair protein MutS, core (InterPro:IPR007696), DNA mismatch repair protein MutS, C-terminal (InterPro:IPR000432), DNA mismatch repair protein MutS-like, N-terminal (InterPro:IPR007695), DNA mismatch repair protein, MSH2 (InterPro:IPR011184); BEST Arabidopsis thaliana protein match is: homolog of DNA mismatch repair protein MSH3 (TAIR:AT4G25540.1); Has 13560 Blast hits to 13453 proteins in 2654 species: Archae - 128; Bacteria - 8942; Metazoa - 734; Fungi - 813; Plants - 457; Viruses - 3; Other Eukaryotes - 2483 (source: NCBI BLink). & (q9xgc9|msh2_maize : 412.0) DNA mismatch repair protein MSH2 (MUS1) - Zea mays (Maize) & (reliability: 912.0) & (original description: no original description) 0.9092630024536341 34 evm.model.contig_2094.15 no hits & (original description: no original description) 0.9085054827307587 35 evm.model.contig_2185.8 (p23957|vatl_avesa : 150.0) Vacuolar ATP synthase 16 kDa proteolipid subunit (EC 3.6.3.14) - Avena sativa (Oat) & (at4g34720 : 146.0) vacuolar H+-pumping ATPase 16 kDa proteolipid (ava-p1); AVA-P1; FUNCTIONS IN: ATPase activity, proton-transporting ATPase activity, rotational mechanism; INVOLVED IN: proton transport, ATP synthesis coupled proton transport; LOCATED IN: vacuole; EXPRESSED IN: 23 plant structures; EXPRESSED DURING: 15 growth stages; CONTAINS InterPro DOMAIN/s: ATPase, F0/V0 complex, subunit C (InterPro:IPR002379), ATPase, V0 complex, proteolipid subunit C, eukaryotic (InterPro:IPR011555), ATPase, V0 complex, proteolipid subunit C (InterPro:IPR000245); BEST Arabidopsis thaliana protein match is: vacuolar-type H(+)-ATPase C3 (TAIR:AT4G38920.1); Has 2722 Blast hits to 2495 proteins in 678 species: Archae - 169; Bacteria - 703; Metazoa - 633; Fungi - 468; Plants - 344; Viruses - 0; Other Eukaryotes - 405 (source: NCBI BLink). & (reliability: 292.0) & (original description: no original description) 0.9077885998277557 37 evm.model.contig_2069.4 no hits & (original description: no original description) 0.9072971671637627 40 evm.model.contig_3491.8 (at2g43760 : 110.0) molybdopterin biosynthesis MoaE family protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: Mo-molybdopterin cofactor biosynthetic process; LOCATED IN: cellular_component unknown; EXPRESSED IN: 22 plant structures; EXPRESSED DURING: 13 growth stages; CONTAINS InterPro DOMAIN/s: Molybdopterin biosynthesis MoaE (InterPro:IPR003448); Has 30201 Blast hits to 17322 proteins in 780 species: Archae - 12; Bacteria - 1396; Metazoa - 17338; Fungi - 3422; Plants - 5037; Viruses - 0; Other Eukaryotes - 2996 (source: NCBI BLink). & (reliability: 220.0) & (original description: no original description) 0.9065046469278509 39 evm.model.contig_2130.6 (at1g17745 : 104.0) encodes a 3-Phosphoglycerate dehydrogenase; D-3-phosphoglycerate dehydrogenase; CONTAINS InterPro DOMAIN/s: D-3-phosphoglycerate dehydrogenase (InterPro:IPR006236), D-isomer specific 2-hydroxyacid dehydrogenase, catalytic domain (InterPro:IPR006139), D-isomer specific 2-hydroxyacid dehydrogenase, NAD-binding (InterPro:IPR006140), D-3-phosphogylcerate Dehydrogenase (InterPro:IPR015508), NAD(P)-binding domain (InterPro:IPR016040), Amino acid-binding ACT (InterPro:IPR002912); BEST Arabidopsis thaliana protein match is: D-3-phosphoglycerate dehydrogenase (TAIR:AT4G34200.1). & (q9zri8|fdh_horvu : 80.9) Formate dehydrogenase, mitochondrial precursor (EC 1.2.1.2) (NAD-dependent formate dehydrogenase) (FDH) - Hordeum vulgare (Barley) & (reliability: 208.0) & (original description: no original description) 0.9059249799592833 94 evm.model.contig_498.7 (at3g18630 : 219.0) Encodes a uracil-DNA glycosylase (UDG) involved in a base excision DNA repair pathway in mitochondria.; uracil dna glycosylase (UNG); FUNCTIONS IN: uracil DNA N-glycosylase activity; INVOLVED IN: DNA repair, base-excision repair; LOCATED IN: mitochondrion; EXPRESSED IN: 15 plant structures; EXPRESSED DURING: 8 growth stages; CONTAINS InterPro DOMAIN/s: Uracil-DNA glycosylase (InterPro:IPR002043), Uracil-DNA glycosylase-like (InterPro:IPR005122); BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT2G10550.1); Has 5606 Blast hits to 5606 proteins in 2219 species: Archae - 2; Bacteria - 4117; Metazoa - 124; Fungi - 141; Plants - 47; Viruses - 234; Other Eukaryotes - 941 (source: NCBI BLink). & (reliability: 438.0) & (original description: no original description) 0.9039964001992016 41 evm.model.contig_3606.1 no hits & (original description: no original description) 0.9031871088881744 42 evm.model.contig_2126.7 (at5g12470 : 136.0) Protein of unknown function (DUF3411); FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknown; LOCATED IN: mitochondrion, chloroplast, plastid, chloroplast inner membrane, chloroplast envelope; EXPRESSED IN: 24 plant structures; EXPRESSED DURING: 14 growth stages; CONTAINS InterPro DOMAIN/s: Protein of unknown function DUF3411 (InterPro:IPR021825); BEST Arabidopsis thaliana protein match is: Protein of unknown function (DUF399 and DUF3411) (TAIR:AT2G40400.2); Has 30201 Blast hits to 17322 proteins in 780 species: Archae - 12; Bacteria - 1396; Metazoa - 17338; Fungi - 3422; Plants - 5037; Viruses - 0; Other Eukaryotes - 2996 (source: NCBI BLink). & (reliability: 260.0) & (original description: no original description) 0.9009559207442777 43 evm.model.contig_2126.12 no hits & (original description: no original description) 0.8986520962855541 44 evm.model.contig_2048.10 (at3g23580 : 461.0) Encodes one of the 3 ribonucleotide reductase (RNR) small subunit genes (RNR2A). Functionally redundant with the ribonucleotide reductase TSO2. mRNA was shown to specifically accumulate during the S-phase of the cell cycle in synchronized tobacco BY2 cells. Critical for cell cycle progression, DNA damage repair and plant development.; ribonucleotide reductase 2A (RNR2A); CONTAINS InterPro DOMAIN/s: Ribonucleotide reductase-related (InterPro:IPR012348), Ribonucleotide reductase (InterPro:IPR000358), Ferritin/ribonucleotide reductase-like (InterPro:IPR009078); BEST Arabidopsis thaliana protein match is: Ferritin/ribonucleotide reductase-like family protein (TAIR:AT3G27060.1); Has 9602 Blast hits to 9597 proteins in 2376 species: Archae - 34; Bacteria - 4358; Metazoa - 261; Fungi - 240; Plants - 185; Viruses - 729; Other Eukaryotes - 3795 (source: NCBI BLink). & (p49730|rir2_tobac : 453.0) Ribonucleoside-diphosphate reductase small chain (EC 1.17.4.1) (Ribonucleotide reductase small subunit) (Ribonucleoside-diphosphate reductase R2 subunit) - Nicotiana tabacum (Common tobacco) & (reliability: 922.0) & (original description: no original description) 0.897134555194872 45 evm.model.contig_4400.3 no hits & (original description: no original description) 0.8967071990465935 50 evm.model.contig_4416.9 (at1g67320 : 211.0) DNA primase, large subunit family; FUNCTIONS IN: DNA primase activity; INVOLVED IN: DNA replication, synthesis of RNA primer; EXPRESSED IN: 21 plant structures; EXPRESSED DURING: 13 growth stages; CONTAINS InterPro DOMAIN/s: DNA primase, large subunit, eukaryotic (InterPro:IPR016558), DNA primase, large subunit, eukaryotic/archaeal (InterPro:IPR007238). & (reliability: 422.0) & (original description: no original description) 0.8966273440610942 47 evm.model.contig_2109.1 no hits & (original description: no original description) 0.8946727262187496 48 evm.model.contig_441.29 no hits & (original description: no original description) 0.8936709993253127 51 evm.model.contig_4408.20 (at3g13930 : 231.0) Dihydrolipoamide acetyltransferase, long form protein; FUNCTIONS IN: dihydrolipoyllysine-residue acetyltransferase activity, copper ion binding; INVOLVED IN: pyruvate metabolic process, metabolic process; LOCATED IN: mitochondrion, chloroplast, chloroplast envelope; EXPRESSED IN: 25 plant structures; EXPRESSED DURING: 13 growth stages; CONTAINS InterPro DOMAIN/s: 2-oxo acid dehydrogenase, lipoyl-binding site (InterPro:IPR003016), Dihydrolipoamide acetyltransferase, long form (InterPro:IPR006257), E3 binding (InterPro:IPR004167), 2-oxoacid dehydrogenase acyltransferase, catalytic domain (InterPro:IPR001078), Single hybrid motif (InterPro:IPR011053), Biotin/lipoyl attachment (InterPro:IPR000089); BEST Arabidopsis thaliana protein match is: Dihydrolipoamide acetyltransferase, long form protein (TAIR:AT1G54220.2); Has 21425 Blast hits to 19790 proteins in 2331 species: Archae - 106; Bacteria - 12026; Metazoa - 730; Fungi - 474; Plants - 369; Viruses - 0; Other Eukaryotes - 7720 (source: NCBI BLink). & (gnl|cdd|68872 : 103.0) no description available & (reliability: 462.0) & (original description: no original description) 0.8933333211149712 85 evm.model.contig_489.1 no hits & (original description: no original description) 0.8932388100894717 68 evm.model.contig_3482.2 (at3g45830 : 159.0) unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknown; LOCATED IN: cellular_component unknown; EXPRESSED IN: 24 plant structures; EXPRESSED DURING: 15 growth stages; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT1G02290.1); Has 499 Blast hits to 438 proteins in 100 species: Archae - 0; Bacteria - 7; Metazoa - 236; Fungi - 15; Plants - 108; Viruses - 2; Other Eukaryotes - 131 (source: NCBI BLink). & (reliability: 318.0) & (original description: no original description) 0.8926126534860077 56 evm.model.contig_2198.2 no hits & (original description: no original description) 0.8917521003905583 57 evm.model.contig_2290.5 no hits & (original description: no original description) 0.8910683697473477 98 evm.model.contig_2172.1 no hits & (original description: no original description) 0.8909841030244012 60 evm.model.contig_3674.1 no hits & (original description: no original description) 0.8906911198430109 61 evm.model.contig_2021.3 no hits & (original description: no original description) 0.8904164021114894 81 evm.model.contig_2494.4 no hits & (original description: no original description) 0.8901338597004204 64 evm.model.contig_4541.3 no hits & (original description: no original description) 0.8890114848653067 66 evm.model.contig_2183.3 (at5g48870 : 108.0) SAD1 encodes a polypeptide similar to multifunctional Sm-like snRNP proteins that are required for mRNA splicing, export, and degradation. Mutation in this gene increases plant sensitivity to drought stress and ABA in seed germination, root growth, and the expression of some stress-responsive genes.; SUPERSENSITIVE TO ABA AND DROUGHT 1 (SAD1); CONTAINS InterPro DOMAIN/s: Like-Sm ribonucleoprotein (LSM) domain (InterPro:IPR001163), Like-Sm ribonucleoprotein (LSM) domain, eukaryotic/archaea-type (InterPro:IPR006649), Like-Sm ribonucleoprotein (LSM)-related domain (InterPro:IPR010920); Has 1807 Blast hits to 1807 proteins in 277 species: Archae - 0; Bacteria - 0; Metazoa - 736; Fungi - 347; Plants - 385; Viruses - 0; Other Eukaryotes - 339 (source: NCBI BLink). & (reliability: 216.0) & (original description: no original description) 0.8889994726862663 67 evm.model.contig_527.12 no hits & (original description: no original description) 0.8871149219266451 69 evm.model.contig_554.1 no hits & (original description: no original description) 0.885596211435452 79 evm.model.contig_2501.11 no hits & (original description: no original description) 0.885579290169574 72 evm.model.contig_2122.18 no hits & (original description: no original description) 0.8853593660406218 73 evm.model.contig_3488.6 no hits & (original description: no original description) 0.8833272689729224 86 evm.model.contig_2082.6 no hits & (original description: no original description) 0.8825572909996173 92 evm.model.contig_4408.6 no hits & (original description: no original description) 0.8819428489364862 78 evm.model.contig_3081.1 no hits & (original description: no original description) 0.8809852789306076 79 evm.model.contig_31.1 no hits & (original description: no original description) 0.8807702787124161 80 evm.model.contig_2273.25 no hits & (original description: no original description) 0.8803377373495752 82 evm.model.contig_4413.3 no hits & (original description: no original description) 0.8801319031719986 83 evm.model.contig_461.2 no hits & (original description: no original description) 0.8800858359312428 84 evm.model.contig_3475.3 (q00268|pcna1_dauca : 227.0) Proliferating cell nuclear antigen (PCNA) (Cyclin) - Daucus carota (Carrot) & (at1g07370 : 226.0) Encodes putative proliferating cell nuclear antigen involved in cell cycle regulation.; proliferating cellular nuclear antigen 1 (PCNA1); CONTAINS InterPro DOMAIN/s: Proliferating cell nuclear antigen, PCNA (InterPro:IPR000730), Proliferating cell nuclear antigen, PCNA, C-terminal (InterPro:IPR022649), Proliferating cell nuclear antigen, PCNA, conserved site (InterPro:IPR022659), Proliferating cell nuclear antigen, PCNA, N-terminal (InterPro:IPR022648); BEST Arabidopsis thaliana protein match is: proliferating cell nuclear antigen 2 (TAIR:AT2G29570.1); Has 1857 Blast hits to 1845 proteins in 456 species: Archae - 391; Bacteria - 0; Metazoa - 315; Fungi - 169; Plants - 159; Viruses - 71; Other Eukaryotes - 752 (source: NCBI BLink). & (reliability: 452.0) & (original description: no original description) 0.8799466580454192 85 evm.model.contig_4409.1 (at5g08710 : 103.0) Regulator of chromosome condensation (RCC1) family protein; CONTAINS InterPro DOMAIN/s: Regulator of chromosome condensation/beta-lactamase-inhibitor protein II (InterPro:IPR009091), Regulator of chromosome condensation, RCC1 (InterPro:IPR000408); BEST Arabidopsis thaliana protein match is: Regulator of chromosome condensation (RCC1) family protein (TAIR:AT5G63860.1); Has 22297 Blast hits to 6326 proteins in 481 species: Archae - 85; Bacteria - 2816; Metazoa - 7462; Fungi - 1554; Plants - 2765; Viruses - 0; Other Eukaryotes - 7615 (source: NCBI BLink). & (reliability: 192.6) & (original description: no original description) 0.8793058843564037 86 evm.model.contig_2354.1 no hits & (original description: no original description) 0.8783233045672472 88 evm.model.contig_2179.3 no hits & (original description: no original description) 0.877664618779799 91 evm.model.contig_490.1 (at4g02060 : 555.0) Member of the minichromosome maintenance complex, involved in DNA replication initiation. Abundant in proliferating and endocycling tissues. Localized in the nucleus during G1, S and G2 phases of the cell cycle, and are released into the cytoplasmic compartment during mitosis. Binds chromatin.; PROLIFERA (PRL); CONTAINS InterPro DOMAIN/s: Nucleic acid-binding, OB-fold-like (InterPro:IPR016027), Nucleic acid-binding, OB-fold (InterPro:IPR012340), ATPase, AAA+ type, core (InterPro:IPR003593), DNA-dependent ATPase MCM (InterPro:IPR001208), DNA-dependent ATPase MCM, conserved site (InterPro:IPR018525), MCM protein 7 (InterPro:IPR008050); BEST Arabidopsis thaliana protein match is: minichromosome maintenance (MCM2/3/5) family protein (TAIR:AT5G44635.1). & (q43704|mcm3_maize : 239.0) DNA replication licensing factor MCM3 homolog (Replication origin activator) (ROA protein) (Fragment) - Zea mays (Maize) & (reliability: 1110.0) & (original description: no original description) 0.8775935565702919 92 evm.model.contig_4660.1 no hits & (original description: no original description) 0.8774312630612285 93 evm.model.contig_2100.2 (q6z437|mpk3_orysa : 259.0) Mitogen-activated protein kinase 3 (EC 2.7.11.24) (MAP kinase 3) (OsMAP3) (MAP kinase 2) (OsMAPK2) - Oryza sativa (Rice) & (at1g10210 : 258.0) Encodes ATMPK1.; mitogen-activated protein kinase 1 (ATMPK1); CONTAINS InterPro DOMAIN/s: Protein kinase, ATP binding site (InterPro:IPR017441), Protein kinase, catalytic domain (InterPro:IPR000719), Serine/threonine-protein kinase domain (InterPro:IPR002290), Serine/threonine-protein kinase-like domain (InterPro:IPR017442), Protein kinase-like domain (InterPro:IPR011009), Serine/threonine-protein kinase, active site (InterPro:IPR008271); BEST Arabidopsis thaliana protein match is: mitogen-activated protein kinase homolog 2 (TAIR:AT1G59580.2); Has 35333 Blast hits to 34131 proteins in 2444 species: Archae - 798; Bacteria - 22429; Metazoa - 974; Fungi - 991; Plants - 531; Viruses - 0; Other Eukaryotes - 9610 (source: NCBI BLink). & (reliability: 516.0) & (original description: no original description) 0.8772695823725827 94 evm.model.contig_4637.1 (at5g44200 : 144.0) Encodes a nuclear cap-binding protein that forms a heterodimeric complex with ABH1 (ATCBP80) and is likely to participate in RNA metabolism. Its mRNA is ubiquitously expressed.Loss of function mutations suggest a role in processing of pri-miRNA and mRNA splicing.; CAP-binding protein 20 (CBP20); FUNCTIONS IN: RNA binding, RNA cap binding; INVOLVED IN: RNA metabolic process, RNA splicing, via endonucleolytic cleavage and ligation, primary microRNA processing; LOCATED IN: mRNA cap binding complex; EXPRESSED IN: 24 plant structures; EXPRESSED DURING: 13 growth stages; CONTAINS InterPro DOMAIN/s: RNA recognition motif, RNP-1 (InterPro:IPR000504), Nucleotide-binding, alpha-beta plait (InterPro:IPR012677); BEST Arabidopsis thaliana protein match is: RNA-binding (RRM/RBD/RNP motifs) family protein (TAIR:AT3G46020.1); Has 9575 Blast hits to 9549 proteins in 2190 species: Archae - 96; Bacteria - 7682; Metazoa - 239; Fungi - 361; Plants - 280; Viruses - 2; Other Eukaryotes - 915 (source: NCBI BLink). & (reliability: 288.0) & (original description: no original description) 0.8768917561712715 96 evm.model.contig_3467.7 (at1g67630 : 101.0) DNA polymerase alpha 2 (POLA2); FUNCTIONS IN: DNA binding, DNA-directed DNA polymerase activity; INVOLVED IN: DNA replication; LOCATED IN: mitochondrion; EXPRESSED IN: 21 plant structures; EXPRESSED DURING: 13 growth stages; CONTAINS InterPro DOMAIN/s: DNA polymerase alpha, subunit B N-terminal (InterPro:IPR013627), DNA polymerase alpha, subunit B (InterPro:IPR016722), DNA polymerase alpha/epsilon, subunit B (InterPro:IPR007185); Has 415 Blast hits to 412 proteins in 190 species: Archae - 0; Bacteria - 0; Metazoa - 175; Fungi - 140; Plants - 46; Viruses - 0; Other Eukaryotes - 54 (source: NCBI BLink). & (reliability: 202.0) & (original description: no original description) 0.8768159332817475 97 evm.model.contig_2081.2 no hits & (original description: no original description) 0.8767455185944077 98