Sequence Description Alias PCC hrr evm.model.contig_2494.14 no hits & (original description: no original description) 0.8939138074273092 11 evm.model.contig_2622.8 no hits & (original description: no original description) 0.8789688667342814 3 evm.model.contig_601.3 no hits & (original description: no original description) 0.8681865447603044 12 evm.model.contig_477.4 no hits & (original description: no original description) 0.859544100536785 13 evm.model.contig_2090.41 no hits & (original description: no original description) 0.8575982455297453 11 evm.model.contig_2386.2 no hits & (original description: no original description) 0.8525909681857803 21 evm.model.contig_2169.5 no hits & (original description: no original description) 0.8517869109999745 25 evm.model.contig_541.4 no hits & (original description: no original description) 0.8507319178987114 38 evm.model.contig_917.1 no hits & (original description: no original description) 0.8499205637531934 30 evm.model.contig_3528.2 (at4g38240 : 116.0) Encodes N-acetyl glucosaminyl transferase I, the first enzyme in the pathway of complex glycan biosynthesis.; COMPLEX GLYCAN LESS 1 (CGL1); CONTAINS InterPro DOMAIN/s: Glycosyl transferase, family 13 (InterPro:IPR004139). & (reliability: 232.0) & (original description: no original description) 0.8494123994455415 19 evm.model.contig_2300.1 (at5g25150 : 83.6) Encodes a putative TATA-binding-protein associated factor TAF5. TAFs are subunits of the general transcription factor IID (TFIID).; TBP-associated factor 5 (TAF5); FUNCTIONS IN: transcription regulator activity, nucleotide binding; INVOLVED IN: regulation of transcription; LOCATED IN: nucleus; EXPRESSED IN: guard cell, root, inflorescence, cultured cell, leaf; CONTAINS InterPro DOMAIN/s: WD40 repeat 2 (InterPro:IPR019782), WD40 repeat, conserved site (InterPro:IPR019775), WD40 repeat (InterPro:IPR001680), G-protein beta WD-40 repeat, region (InterPro:IPR020472), WD40 repeat-like-containing domain (InterPro:IPR011046), WD40-repeat-containing domain (InterPro:IPR017986), WD40/YVTN repeat-like-containing domain (InterPro:IPR015943), WD40 repeat, subgroup (InterPro:IPR019781), TFIID subunit, WD40-associated region (InterPro:IPR007582); BEST Arabidopsis thaliana protein match is: Transducin/WD40 repeat-like superfamily protein (TAIR:AT3G49660.1); Has 114463 Blast hits to 42274 proteins in 991 species: Archae - 68; Bacteria - 11258; Metazoa - 46869; Fungi - 25620; Plants - 15010; Viruses - 3; Other Eukaryotes - 15635 (source: NCBI BLink). & (q39336|gblp_brana : 81.3) Guanine nucleotide-binding protein subunit beta-like protein - Brassica napus (Rape) & (reliability: 167.2) & (original description: no original description) 0.8460684910716576 11 evm.model.contig_2092.9 (original description: no original description) 0.8454474988893166 12 evm.model.contig_2012.1 (at5g66120 : 352.0) 3-dehydroquinate synthase, putative; FUNCTIONS IN: 3-dehydroquinate synthase activity; INVOLVED IN: aromatic amino acid family biosynthetic process; LOCATED IN: chloroplast; EXPRESSED IN: 23 plant structures; EXPRESSED DURING: 14 growth stages; CONTAINS InterPro DOMAIN/s: 3-dehydroquinate synthase AroB, subgroup (InterPro:IPR016037), 3-dehydroquinate synthase AroB (InterPro:IPR002658); Has 30201 Blast hits to 17322 proteins in 780 species: Archae - 12; Bacteria - 1396; Metazoa - 17338; Fungi - 3422; Plants - 5037; Viruses - 0; Other Eukaryotes - 2996 (source: NCBI BLink). & (reliability: 704.0) & (original description: no original description) 0.8393344680303658 55 evm.model.contig_2099.4 no hits & (original description: no original description) 0.8362614188612927 31 evm.model.contig_507.2 no hits & (original description: no original description) 0.8332103896933123 36 evm.model.contig_612.2 (at5g61540 : 138.0) N-terminal nucleophile aminohydrolases (Ntn hydrolases) superfamily protein; FUNCTIONS IN: asparaginase activity, hydrolase activity; INVOLVED IN: glycoprotein catabolic process; EXPRESSED IN: 21 plant structures; EXPRESSED DURING: 13 growth stages; CONTAINS InterPro DOMAIN/s: Peptidase T2, asparaginase 2 (InterPro:IPR000246); BEST Arabidopsis thaliana protein match is: N-terminal nucleophile aminohydrolases (Ntn hydrolases) superfamily protein (TAIR:AT3G16150.1); Has 2748 Blast hits to 2738 proteins in 812 species: Archae - 113; Bacteria - 1407; Metazoa - 453; Fungi - 78; Plants - 148; Viruses - 0; Other Eukaryotes - 549 (source: NCBI BLink). & (reliability: 276.0) & (original description: no original description) 0.8275580830381561 73 evm.model.contig_636.3 no hits & (original description: no original description) 0.8245657711474773 29 evm.model.contig_3652.1 no hits & (original description: no original description) 0.823961834406522 18 evm.model.contig_2027.6 no hits & (original description: no original description) 0.8216229427321292 46 evm.model.contig_2272.5 no hits & (original description: no original description) 0.8182888394183326 20 evm.model.contig_2069.2 no hits & (original description: no original description) 0.8126957163510862 27 evm.model.contig_3560.5 no hits & (original description: no original description) 0.8095873258417459 22 evm.model.contig_2027.4 no hits & (original description: no original description) 0.8092270931180999 23 evm.model.contig_2099.3 no hits & (original description: no original description) 0.8061562848386467 35 evm.model.contig_2396.3 no hits & (original description: no original description) 0.8027630989603824 43 evm.model.contig_4427.7 no hits & (original description: no original description) 0.8021015801988753 74 evm.model.contig_765.1 no hits & (original description: no original description) 0.8004387544211461 43 evm.model.contig_2059.14 (at5g56760 : 187.0) Encodes a cytosolic serine O-acetyltransferase involved in sulfur assimilation and cysteine biosynthesis. Expressed in the vascular system.; serine acetyltransferase 1;1 (SERAT1;1); FUNCTIONS IN: serine O-acetyltransferase activity; INVOLVED IN: cysteine biosynthetic process from serine; LOCATED IN: cytosol; EXPRESSED IN: 26 plant structures; EXPRESSED DURING: 15 growth stages; CONTAINS InterPro DOMAIN/s: Hexapeptide transferase, conserved site (InterPro:IPR018357), Serine O-acetyltransferase (InterPro:IPR005881), Trimeric LpxA-like (InterPro:IPR011004), Bacterial transferase hexapeptide repeat (InterPro:IPR001451), Serine acetyltransferase, N-terminal (InterPro:IPR010493); BEST Arabidopsis thaliana protein match is: serine acetyltransferase 2;2 (TAIR:AT3G13110.1); Has 18874 Blast hits to 18857 proteins in 2524 species: Archae - 292; Bacteria - 13784; Metazoa - 5; Fungi - 219; Plants - 250; Viruses - 18; Other Eukaryotes - 4306 (source: NCBI BLink). & (reliability: 374.0) & (original description: no original description) 0.7999233420692131 56 evm.model.contig_2069.9 no hits & (original description: no original description) 0.7970629592607527 64 evm.model.contig_2032.26 (at5g61450 : 160.0) P-loop containing nucleoside triphosphate hydrolases superfamily protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknown; LOCATED IN: chloroplast; EXPRESSED IN: 24 plant structures; EXPRESSED DURING: 12 growth stages; Has 1807 Blast hits to 1807 proteins in 277 species: Archae - 0; Bacteria - 0; Metazoa - 736; Fungi - 347; Plants - 385; Viruses - 0; Other Eukaryotes - 339 (source: NCBI BLink). & (reliability: 320.0) & (original description: no original description) 0.7968340327630791 30 evm.model.contig_2468.8 no hits & (original description: no original description) 0.7942323663733556 93 evm.model.contig_2185.3 no hits & (original description: no original description) 0.7899030034522273 43 evm.model.contig_2687.2 (at5g02710 : 112.0) unknown protein; LOCATED IN: chloroplast; EXPRESSED IN: 21 plant structures; EXPRESSED DURING: 13 growth stages; CONTAINS InterPro DOMAIN/s: Uncharacterised protein family UPF0153 (InterPro:IPR005358); Has 240 Blast hits to 240 proteins in 73 species: Archae - 10; Bacteria - 110; Metazoa - 0; Fungi - 0; Plants - 25; Viruses - 0; Other Eukaryotes - 95 (source: NCBI BLink). & (reliability: 224.0) & (original description: no original description) 0.789111721954048 81 evm.model.contig_4622.1 no hits & (original description: no original description) 0.7890725310829368 60 evm.model.contig_4416.15 no hits & (original description: no original description) 0.7879068429434426 40 evm.model.contig_512.12 no hits & (original description: no original description) 0.7875864449260952 59 evm.model.contig_4427.2 no hits & (original description: no original description) 0.7843736064761535 93 evm.model.contig_435.3 no hits & (original description: no original description) 0.7823070626671073 43 evm.model.contig_3571.2 (at3g07270 : 113.0) GTP cyclohydrolase I; CONTAINS InterPro DOMAIN/s: GTP cyclohydrolase I/Nitrile oxidoreductase (InterPro:IPR020602); Has 35333 Blast hits to 34131 proteins in 2444 species: Archae - 798; Bacteria - 22429; Metazoa - 974; Fungi - 991; Plants - 531; Viruses - 0; Other Eukaryotes - 9610 (source: NCBI BLink). & (reliability: 226.0) & (original description: no original description) 0.780685560990957 45 evm.model.contig_2076.2 (at3g05170 : 162.0) Phosphoglycerate mutase family protein; FUNCTIONS IN: catalytic activity; INVOLVED IN: metabolic process; LOCATED IN: cellular_component unknown; EXPRESSED IN: 17 plant structures; EXPRESSED DURING: 6 growth stages; CONTAINS InterPro DOMAIN/s: Histidine phosphatase superfamily, clade-1 (InterPro:IPR013078), Phosphoglycerate/bisphosphoglycerate mutase, active site (InterPro:IPR001345); BEST Arabidopsis thaliana protein match is: Phosphoglycerate mutase family protein (TAIR:AT1G08940.1); Has 1056 Blast hits to 1046 proteins in 414 species: Archae - 2; Bacteria - 582; Metazoa - 32; Fungi - 206; Plants - 86; Viruses - 0; Other Eukaryotes - 148 (source: NCBI BLink). & (reliability: 312.0) & (original description: no original description) 0.7796957381714082 46 evm.model.contig_2027.3 no hits & (original description: no original description) 0.7773429114446496 75 evm.model.contig_2070.18 (at1g18490 : 87.4) Protein of unknown function (DUF1637); FUNCTIONS IN: cysteamine dioxygenase activity; INVOLVED IN: oxidation reduction; LOCATED IN: cellular_component unknown; EXPRESSED IN: 22 plant structures; EXPRESSED DURING: 13 growth stages; CONTAINS InterPro DOMAIN/s: Protein of unknown function DUF1637 (InterPro:IPR012864); BEST Arabidopsis thaliana protein match is: Protein of unknown function (DUF1637) (TAIR:AT5G39890.1); Has 360 Blast hits to 360 proteins in 93 species: Archae - 0; Bacteria - 0; Metazoa - 102; Fungi - 0; Plants - 224; Viruses - 0; Other Eukaryotes - 34 (source: NCBI BLink). & (reliability: 174.8) & (original description: no original description) 0.7709033068682639 52 evm.model.contig_4506.5 no hits & (original description: no original description) 0.7693149323769521 85 evm.model.contig_2173.16 no hits & (original description: no original description) 0.7646356843294662 60 evm.model.contig_2179.2 no hits & (original description: no original description) 0.7628720456438688 59 evm.model.contig_2216.2 no hits & (original description: no original description) 0.7613849483367567 60 evm.model.contig_3401.22 no hits & (original description: no original description) 0.7592910477272344 61 evm.model.contig_3425.3 (at4g14700 : 165.0) Encodes origin of replication complex 1a subunit.The protein contains a PHD domain,binds methylated DNA and appears to function as a transcriptional activator.; origin recognition complex 1 (ORC1A); CONTAINS InterPro DOMAIN/s: ATPase, AAA-type, core (InterPro:IPR003959), Zinc finger, PHD-type, conserved site (InterPro:IPR019786), Zinc finger, PHD-type (InterPro:IPR001965), Origin recognition complex, subunit 1 (InterPro:IPR020793), ATPase, AAA+ type, core (InterPro:IPR003593), Bromo adjacent homology (BAH) domain (InterPro:IPR001025), Zinc finger, FYVE/PHD-type (InterPro:IPR011011), Zinc finger, PHD-finger (InterPro:IPR019787); BEST Arabidopsis thaliana protein match is: origin of replication complex 1B (TAIR:AT4G12620.1); Has 30201 Blast hits to 17322 proteins in 780 species: Archae - 12; Bacteria - 1396; Metazoa - 17338; Fungi - 3422; Plants - 5037; Viruses - 0; Other Eukaryotes - 2996 (source: NCBI BLink). & (reliability: 330.0) & (original description: no original description) 0.7514260662030215 66 evm.model.contig_2626.1 (at5g12040 : 297.0) Nitrilase/cyanide hydratase and apolipoprotein N-acyltransferase family protein; FUNCTIONS IN: hydrolase activity, acting on carbon-nitrogen (but not peptide) bonds, zinc ion binding; INVOLVED IN: nitrogen compound metabolic process; LOCATED IN: chloroplast; EXPRESSED IN: 23 plant structures; EXPRESSED DURING: 13 growth stages; CONTAINS InterPro DOMAIN/s: Nitrilase/cyanide hydratase and apolipoprotein N-acyltransferase (InterPro:IPR003010); BEST Arabidopsis thaliana protein match is: Nitrilase/cyanide hydratase and apolipoprotein N-acyltransferase family protein (TAIR:AT4G08790.1); Has 35333 Blast hits to 34131 proteins in 2444 species: Archae - 798; Bacteria - 22429; Metazoa - 974; Fungi - 991; Plants - 531; Viruses - 0; Other Eukaryotes - 9610 (source: NCBI BLink). & (q3hvn1|agub_soltu : 87.8) N-carbamoylputrescine amidase (EC 3.5.1.53) - Solanum tuberosum (Potato) & (reliability: 594.0) & (original description: no original description) 0.7500130234189869 67 evm.model.contig_2130.4 no hits & (original description: no original description) 0.7487516871702841 68 evm.model.contig_802.1 (original description: no original description) 0.7430597648929832 72 evm.model.contig_2024.24 no hits & (original description: no original description) 0.7419541273574676 86 evm.model.contig_4426.10 (q42698|ggpps_catro : 268.0) Geranylgeranyl pyrophosphate synthetase, chloroplast precursor (GGPP synthetase) (GGPS) [Includes: Dimethylallyltranstransferase (EC 2.5.1.1); Geranyltranstransferase (EC 2.5.1.10); Farnesyltranstransferase (EC 2.5.1.29)] - Catharanthu & (at4g36810 : 254.0) Encodes a protein with geranylgeranyl pyrophosphate synthase activity involved in isoprenoid biosynthesis. The enzyme appears to be targeted to the chloroplast in epidermal cells and guard cells of leaves, and in etioplasts in roots.; geranylgeranyl pyrophosphate synthase 1 (GGPS1); FUNCTIONS IN: farnesyltranstransferase activity; INVOLVED IN: isoprenoid biosynthetic process; LOCATED IN: etioplast, chloroplast; EXPRESSED IN: 23 plant structures; EXPRESSED DURING: 13 growth stages; CONTAINS InterPro DOMAIN/s: Polyprenyl synthetase-related (InterPro:IPR017446), Terpenoid synthase (InterPro:IPR008949), Polyprenyl synthetase (InterPro:IPR000092); BEST Arabidopsis thaliana protein match is: Terpenoid synthases superfamily protein (TAIR:AT2G18620.1); Has 16617 Blast hits to 16612 proteins in 2936 species: Archae - 341; Bacteria - 9385; Metazoa - 291; Fungi - 423; Plants - 452; Viruses - 12; Other Eukaryotes - 5713 (source: NCBI BLink). & (reliability: 508.0) & (original description: no original description) 0.740842627486796 76 evm.model.contig_815.2 no hits & (original description: no original description) 0.7392688153570006 79 evm.model.contig_2348.1 (original description: no original description) 0.7388681953372798 80 evm.model.contig_2022.4 no hits & (original description: no original description) 0.7381418862774543 88 evm.model.contig_2109.6 no hits & (original description: no original description) 0.7364229757911608 85 evm.model.contig_2066.2 no hits & (original description: no original description) 0.7354852118370683 90 evm.model.contig_2171.4 no hits & (original description: no original description) 0.734776427601801 91 evm.model.contig_690.1 (at3g55480 : 157.0) protein affected trafficking 2 (PAT2); FUNCTIONS IN: protein transporter activity, binding; INVOLVED IN: intracellular protein transport, vesicle-mediated transport, endocytosis, protein transport; LOCATED IN: membrane coat, Golgi apparatus; EXPRESSED IN: 23 plant structures; EXPRESSED DURING: 13 growth stages; CONTAINS InterPro DOMAIN/s: Adaptor protein complex AP-3, beta subunit (InterPro:IPR017108), Armadillo-like helical (InterPro:IPR011989), Armadillo-type fold (InterPro:IPR016024), Clathrin/coatomer adaptor, adaptin-like, N-terminal (InterPro:IPR002553); BEST Arabidopsis thaliana protein match is: Adaptin family protein (TAIR:AT4G23460.1); Has 2490 Blast hits to 1771 proteins in 223 species: Archae - 0; Bacteria - 20; Metazoa - 1008; Fungi - 721; Plants - 224; Viruses - 0; Other Eukaryotes - 517 (source: NCBI BLink). & (reliability: 314.0) & (original description: no original description) 0.7333844308844721 92 evm.model.contig_787.2 no hits & (original description: no original description) 0.7311015522815648 94 evm.model.contig_4416.4 no hits & (original description: no original description) 0.7307345995022319 95 evm.model.contig_787.3 no hits & (original description: no original description) 0.7297014233918867 96 evm.model.contig_2292.6 (at5g64070 : 298.0) Encodes a phosphatidylinositol 4-OH kinase, PI-4Kbeta1. Arabidopsis contains 12 PI-4Ks in three separate families: PI-4Kalphs, PI-4kbeta, and PI-4Kgamma. PI-4Kbeta1 is 83% identical to PI-4kbeta2 encoded by At5g09350. Interacts with the RabA4b GTPase. Important for polarized root hair growth as the loss of this gene and its close relative PI-4kbeta2, leads to the formation of abnormal root hairs.; phosphatidylinositol 4-OH kinase beta1 (PI-4KBETA1); FUNCTIONS IN: 1-phosphatidylinositol 4-kinase activity; INVOLVED IN: phosphoinositide biosynthetic process, root hair cell tip growth, pollen tube growth; LOCATED IN: cytosol, nucleus, membrane; EXPRESSED IN: male gametophyte, root hair tip, cultured cell, pollen tube; EXPRESSED DURING: L mature pollen stage, M germinated pollen stage; CONTAINS InterPro DOMAIN/s: Phosphatidylinositol 3-/4-kinase, catalytic (InterPro:IPR000403), Phosphatidylinositol Kinase (InterPro:IPR015433), Armadillo-type fold (InterPro:IPR016024), Phosphatidylinositol 3/4-kinase, conserved site (InterPro:IPR018936), Protein kinase-like domain (InterPro:IPR011009); BEST Arabidopsis thaliana protein match is: phosphatidylinositol 4-OH kinase beta2 (TAIR:AT5G09350.1); Has 1807 Blast hits to 1807 proteins in 277 species: Archae - 0; Bacteria - 0; Metazoa - 736; Fungi - 347; Plants - 385; Viruses - 0; Other Eukaryotes - 339 (source: NCBI BLink). & (p42347|pi3k1_soybn : 122.0) Phosphatidylinositol 3-kinase, root isoform (EC 2.7.1.137) (PI3-kinase) (PtdIns-3-kinase) (PI3K) (SPI3K-5) - Glycine max (Soybean) & (reliability: 596.0) & (original description: no original description) 0.7246432856169404 99