Sequence Description Alias PCC hrr evm.model.contig_4416.8 (at2g32000 : 337.0) DNA topoisomerase, type IA, core; FUNCTIONS IN: DNA topoisomerase activity, DNA topoisomerase type I activity, DNA binding, nucleic acid binding; INVOLVED IN: DNA topological change, DNA unwinding involved in replication, DNA metabolic process; LOCATED IN: endomembrane system, chromosome; EXPRESSED IN: 22 plant structures; EXPRESSED DURING: 13 growth stages; CONTAINS InterPro DOMAIN/s: Toprim domain, subgroup (InterPro:IPR006154), DNA topoisomerase, type IA, central region, subdomain 1 (InterPro:IPR013824), DNA topoisomerase, type IA, core (InterPro:IPR000380), Toprim domain (InterPro:IPR006171), DNA topoisomerase, type IA, DNA-binding (InterPro:IPR003602), DNA topoisomerase, type IA, domain 2 (InterPro:IPR003601), DNA topoisomerase, type IA, central (InterPro:IPR013497), DNA topoisomerase, type IA, central region, subdomain 3 (InterPro:IPR013826); BEST Arabidopsis thaliana protein match is: topoisomerase 3alpha (TAIR:AT5G63920.1); Has 35333 Blast hits to 34131 proteins in 2444 species: Archae - 798; Bacteria - 22429; Metazoa - 974; Fungi - 991; Plants - 531; Viruses - 0; Other Eukaryotes - 9610 (source: NCBI BLink). & (p46942|db10_nicsy : 193.0) ATP-dependent RNA helicase-like protein DB10 (EC 3.6.1.-) - Nicotiana sylvestris (Wood tobacco) & (reliability: 674.0) & (original description: no original description) 0.9502413346891319 17 evm.model.contig_2090.30 (p56317|clpp_chlvu : 257.0) ATP-dependent Clp protease proteolytic subunit (EC 3.4.21.92) (Endopeptidase Clp) - Chlorella vulgaris (Green alga) & (atcg00670 : 207.0) Encodes the only ClpP (caseinolytic protease) encoded within the plastid genome. Contains a highly conserved catalytic triad of Ser-type proteases (Ser-His-Asp). Part of the 350 kDa chloroplast Clp complex. The name reflects nomenclature described in Adam et. al (2001).; plastid-encoded CLP P (PCLPP); FUNCTIONS IN: serine-type peptidase activity; INVOLVED IN: proteolysis; LOCATED IN: chloroplast thylakoid membrane, chloroplastic endopeptidase Clp complex, plastid stroma, chloroplast, chloroplast stroma; EXPRESSED IN: 23 plant structures; EXPRESSED DURING: 13 growth stages; CONTAINS InterPro DOMAIN/s: Peptidase S14, ClpP, active site (InterPro:IPR018215), Peptidase S14, ClpP (InterPro:IPR001907); BEST Arabidopsis thaliana protein match is: CLP protease proteolytic subunit 2 (TAIR:AT1G12410.1). & (reliability: 414.0) & (original description: no original description) 0.9474386255231204 3 evm.model.contig_2538.3 no hits & (original description: no original description) 0.944386381851087 19 evm.model.contig_3401.16 (at5g06060 : 206.0) NAD(P)-binding Rossmann-fold superfamily protein; FUNCTIONS IN: oxidoreductase activity, binding, catalytic activity; INVOLVED IN: oxidation reduction, metabolic process; EXPRESSED IN: 23 plant structures; EXPRESSED DURING: 14 growth stages; CONTAINS InterPro DOMAIN/s: Short-chain dehydrogenase/reductase, conserved site (InterPro:IPR020904), NAD(P)-binding domain (InterPro:IPR016040), Glucose/ribitol dehydrogenase (InterPro:IPR002347), Short-chain dehydrogenase/reductase SDR (InterPro:IPR002198); BEST Arabidopsis thaliana protein match is: NAD(P)-binding Rossmann-fold superfamily protein (TAIR:AT2G29290.2); Has 1807 Blast hits to 1807 proteins in 277 species: Archae - 0; Bacteria - 0; Metazoa - 736; Fungi - 347; Plants - 385; Viruses - 0; Other Eukaryotes - 339 (source: NCBI BLink). & (q75kh3|grdh_orysa : 115.0) Glucose and ribitol dehydrogenase homolog (EC 1.1.1.-) - Oryza sativa (Rice) & (reliability: 412.0) & (original description: no original description) 0.9434169264872779 27 evm.model.contig_2201.1 (p46279|rpb7_soybn : 140.0) DNA-directed RNA polymerase II 19 kDa polypeptide (EC 2.7.7.6) (RNA polymerase II subunit 5) - Glycine max (Soybean) & (at5g59180 : 133.0) Non-catalytic subunit specific to DNA-directed RNA polymerase II; the ortholog of budding yeast RPB7; NRPB7; FUNCTIONS IN: DNA-directed RNA polymerase activity, RNA binding; INVOLVED IN: transcription; LOCATED IN: DNA-directed RNA polymerase II, core complex; EXPRESSED IN: 23 plant structures; EXPRESSED DURING: 13 growth stages; CONTAINS InterPro DOMAIN/s: Nucleic acid-binding, OB-fold (InterPro:IPR012340), Ribosomal protein S1, RNA-binding domain (InterPro:IPR003029), RNA polymerase Rpb7, N-terminal (InterPro:IPR005576); BEST Arabidopsis thaliana protein match is: RNA polymerase Rpb7-like, N-terminal domain (TAIR:AT4G14660.1); Has 1807 Blast hits to 1807 proteins in 277 species: Archae - 0; Bacteria - 0; Metazoa - 736; Fungi - 347; Plants - 385; Viruses - 0; Other Eukaryotes - 339 (source: NCBI BLink). & (reliability: 266.0) & (original description: no original description) 0.9432825177252306 57 evm.model.contig_4430.5 (at4g01940 : 105.0) Encodes a protein containing the NFU domain that may be involved in iron-sulfur cluster assembly. Part of a five member gene family, more closely related to NFU2 and 3 than to NFU4 and 5. Targeted to the chloroplast.; NFU domain protein 1 (NFU1); CONTAINS InterPro DOMAIN/s: NIF system FeS cluster assembly, NifU, C-terminal (InterPro:IPR001075); BEST Arabidopsis thaliana protein match is: NIFU-like protein 2 (TAIR:AT5G49940.1); Has 4565 Blast hits to 4561 proteins in 1155 species: Archae - 11; Bacteria - 2225; Metazoa - 159; Fungi - 160; Plants - 186; Viruses - 3; Other Eukaryotes - 1821 (source: NCBI BLink). & (q84lk7|nifu1_orysa : 85.5) NifU-like protein 1, chloroplast precursor (OsNifu1) - Oryza sativa (Rice) & (reliability: 210.0) & (original description: no original description) 0.9417194101390306 35 evm.model.contig_4506.3 (at3g02320 : 263.0) N2,N2-dimethylguanosine tRNA methyltransferase; FUNCTIONS IN: RNA binding, tRNA (guanine-N2-)-methyltransferase activity; INVOLVED IN: tRNA processing; LOCATED IN: cellular_component unknown; EXPRESSED IN: 15 plant structures; EXPRESSED DURING: 6 growth stages; CONTAINS InterPro DOMAIN/s: N2,N2-dimethylguanosine tRNA methyltransferase (InterPro:IPR002905); BEST Arabidopsis thaliana protein match is: N2,N2-dimethylguanosine tRNA methyltransferase (TAIR:AT5G15810.1); Has 1019 Blast hits to 973 proteins in 363 species: Archae - 257; Bacteria - 68; Metazoa - 198; Fungi - 156; Plants - 105; Viruses - 0; Other Eukaryotes - 235 (source: NCBI BLink). & (reliability: 526.0) & (original description: no original description) 0.9401102108693262 21 evm.model.contig_2345.4 (at5g09810 : 108.0) Member of Actin gene family.Mutants are defective in germination and root growth.; actin 7 (ACT7); FUNCTIONS IN: protein binding, structural constituent of cytoskeleton; INVOLVED IN: in 9 processes; LOCATED IN: mitochondrion, nucleolus, cell wall, cytoskeleton, plasma membrane; EXPRESSED IN: 29 plant structures; EXPRESSED DURING: 14 growth stages; CONTAINS InterPro DOMAIN/s: Actin, conserved site (InterPro:IPR004001), Actin/actin-like (InterPro:IPR004000), Actin/actin-like conserved site (InterPro:IPR020902); BEST Arabidopsis thaliana protein match is: actin 3 (TAIR:AT3G53750.1); Has 15241 Blast hits to 14839 proteins in 3047 species: Archae - 8; Bacteria - 21; Metazoa - 5732; Fungi - 5247; Plants - 1603; Viruses - 2; Other Eukaryotes - 2628 (source: NCBI BLink). & (p30165|act2_pea : 108.0) Actin-2 - Pisum sativum (Garden pea) & (reliability: 216.0) & (original description: no original description) 0.939792516653037 81 evm.model.contig_3656.2 (at2g29690 : 465.0) Encode a functional anthranilate synthase protein. Expressed at a constitutive basal level. Expression was not induced by wounding nor bacterial pathogen infiltration. Involved in aromatic amino acid biosynthesis.; anthranilate synthase 2 (ASA2); FUNCTIONS IN: anthranilate synthase activity; INVOLVED IN: tryptophan biosynthetic process, aromatic amino acid family biosynthetic process; LOCATED IN: chloroplast, anthranilate synthase complex; EXPRESSED IN: 21 plant structures; EXPRESSED DURING: 13 growth stages; CONTAINS InterPro DOMAIN/s: Anthranilate synthase component I, N-terminal (InterPro:IPR006805), Chorismate binding, C-terminal (InterPro:IPR015890), ADC synthase (InterPro:IPR005801), Anthranilate synthase component I (InterPro:IPR019999), Anthranilate synthase component I, PabB-like (InterPro:IPR005256); BEST Arabidopsis thaliana protein match is: anthranilate synthase alpha subunit 1 (TAIR:AT5G05730.1); Has 16475 Blast hits to 16472 proteins in 2614 species: Archae - 246; Bacteria - 11051; Metazoa - 5; Fungi - 314; Plants - 193; Viruses - 0; Other Eukaryotes - 4666 (source: NCBI BLink). & (reliability: 930.0) & (original description: no original description) 0.9386757006546298 36 evm.model.contig_2121.2 no hits & (original description: no original description) 0.9373033300089554 10 evm.model.contig_2042.5 no hits & (original description: no original description) 0.9344781801451448 41 evm.model.contig_4464.4 no hits & (original description: no original description) 0.9338270857128045 90 evm.model.contig_3383.12 no hits & (original description: no original description) 0.9336464039188707 87 evm.model.contig_2075.1 (at5g52820 : 508.0) WD-40 repeat family protein / notchless protein, putative; CONTAINS InterPro DOMAIN/s: WD40 repeat 2 (InterPro:IPR019782), WD40 repeat, conserved site (InterPro:IPR019775), NLE (InterPro:IPR012972), WD40 repeat (InterPro:IPR001680), G-protein, beta subunit (InterPro:IPR001632), G-protein beta WD-40 repeat, region (InterPro:IPR020472), WD40 repeat-like-containing domain (InterPro:IPR011046), WD40-repeat-containing domain (InterPro:IPR017986), WD40/YVTN repeat-like-containing domain (InterPro:IPR015943), WD40 repeat, subgroup (InterPro:IPR019781); BEST Arabidopsis thaliana protein match is: Transducin/WD40 repeat-like superfamily protein (TAIR:AT3G49660.1); Has 1807 Blast hits to 1807 proteins in 277 species: Archae - 0; Bacteria - 0; Metazoa - 736; Fungi - 347; Plants - 385; Viruses - 0; Other Eukaryotes - 339 (source: NCBI BLink). & (p93340|gblp_nicpl : 111.0) Guanine nucleotide-binding protein subunit beta-like protein - Nicotiana plumbaginifolia (Leadwort-leaved tobacco) & (reliability: 1016.0) & (original description: no original description) 0.9246771881320668 97 evm.model.contig_4624.1 (at1g48850 : 470.0) embryo defective 1144 (EMB1144); FUNCTIONS IN: chorismate synthase activity; INVOLVED IN: aromatic amino acid family biosynthetic process, embryo development ending in seed dormancy; LOCATED IN: nucleolus, chloroplast; EXPRESSED IN: 25 plant structures; EXPRESSED DURING: 15 growth stages; CONTAINS InterPro DOMAIN/s: Chorismate synthase, conserved site (InterPro:IPR020541), Chorismate synthase (InterPro:IPR000453); BEST Arabidopsis thaliana protein match is: RNA 3'-terminal phosphate cyclase/enolpyruvate transferase, alpha/beta (TAIR:AT1G48860.1); Has 35333 Blast hits to 34131 proteins in 2444 species: Archae - 798; Bacteria - 22429; Metazoa - 974; Fungi - 991; Plants - 531; Viruses - 0; Other Eukaryotes - 9610 (source: NCBI BLink). & (reliability: 940.0) & (original description: no original description) 0.9231553701574993 22 evm.model.contig_4420.1 (q9xhm1|if38_medtr : 393.0) Eukaryotic translation initiation factor 3 subunit 8 (eIF3 p110) (eIF3c) - Medicago truncatula (Barrel medic) & (at3g56150 : 366.0) member of eIF3c - eukaryotic initiation factor 3c; eukaryotic translation initiation factor 3C (EIF3C); FUNCTIONS IN: translation initiation factor activity; INVOLVED IN: translational initiation; LOCATED IN: cytosol, nucleus; EXPRESSED IN: 25 plant structures; EXPRESSED DURING: 15 growth stages; CONTAINS InterPro DOMAIN/s: Winged helix-turn-helix transcription repressor DNA-binding (InterPro:IPR011991), Proteasome component (PCI) domain (InterPro:IPR000717), Eukaryotic translation initiation factor 3 subunit 8, N-terminal (InterPro:IPR008905); BEST Arabidopsis thaliana protein match is: eukaryotic translation initiation factor 3 subunit C2 (TAIR:AT3G22860.1). & (reliability: 732.0) & (original description: no original description) 0.9229944650102424 33 evm.model.contig_3558.3 (at1g29880 : 539.0) glycyl-tRNA synthetase / glycine--tRNA ligase; FUNCTIONS IN: glycine-tRNA ligase activity, nucleotide binding, aminoacyl-tRNA ligase activity, ATP binding; INVOLVED IN: response to cadmium ion, glycyl-tRNA aminoacylation; LOCATED IN: mitochondrion; EXPRESSED IN: 24 plant structures; EXPRESSED DURING: 14 growth stages; CONTAINS InterPro DOMAIN/s: Aminoacyl-tRNA synthetase, class II (G/ H/ P/ S), conserved domain (InterPro:IPR002314), Glycyl-tRNA synthetase, alpha2 dimer (InterPro:IPR002315), S15/NS1, RNA-binding (InterPro:IPR009068), Glycyl-tRNA synthetase, alpha2 dimer, C-terminal (InterPro:IPR018160), Anticodon-binding (InterPro:IPR004154), WHEP-TRS (InterPro:IPR000738), Aminoacyl-tRNA synthetase, class II, conserved domain (InterPro:IPR006195); BEST Arabidopsis thaliana protein match is: tRNA synthetase class II (G, H, P and S) family protein (TAIR:AT1G29870.1); Has 6392 Blast hits to 4464 proteins in 1181 species: Archae - 269; Bacteria - 3324; Metazoa - 242; Fungi - 171; Plants - 75; Viruses - 0; Other Eukaryotes - 2311 (source: NCBI BLink). & (reliability: 1078.0) & (original description: no original description) 0.9204409040767194 57 evm.model.contig_2070.4 no hits & (original description: no original description) 0.919250738799087 30 evm.model.contig_2065.4 (at5g65687 : 160.0) Major facilitator superfamily protein; FUNCTIONS IN: carbohydrate transmembrane transporter activity, sugar:hydrogen symporter activity; INVOLVED IN: transmembrane transport; LOCATED IN: plasma membrane, membrane; CONTAINS InterPro DOMAIN/s: Major facilitator superfamily (InterPro:IPR020846), Major facilitator superfamily MFS-1 (InterPro:IPR011701), Major facilitator superfamily, general substrate transporter (InterPro:IPR016196); BEST Arabidopsis thaliana protein match is: Major facilitator superfamily protein (TAIR:AT2G22730.1); Has 30201 Blast hits to 17322 proteins in 780 species: Archae - 12; Bacteria - 1396; Metazoa - 17338; Fungi - 3422; Plants - 5037; Viruses - 0; Other Eukaryotes - 2996 (source: NCBI BLink). & (reliability: 320.0) & (original description: no original description) 0.919169249852899 32 evm.model.contig_2111.8 no hits & (original description: no original description) 0.9186767664733484 58 evm.model.contig_3383.5 no hits & (original description: no original description) 0.9158889366029098 41 evm.model.contig_3468.10 (at5g42540 : 250.0) Encodes a protein with similarity to yeast 5'-3'exonucleases and can functionally complement the yeast mutations. In Arabidopsis XRN2 acts as a suppressor of posttranscriptional gene silencing.; exoribonuclease 2 (XRN2); CONTAINS InterPro DOMAIN/s: 5'-3' exoribonuclease 2 (InterPro:IPR017151), Zinc finger, CCHC-type (InterPro:IPR001878), Putative 5-3 exonuclease (InterPro:IPR004859); BEST Arabidopsis thaliana protein match is: 5'-3' exoribonuclease 3 (TAIR:AT1G75660.1). & (reliability: 500.0) & (original description: no original description) 0.9110959171154833 48 evm.model.contig_473.4 (at5g08470 : 255.0) an AAA-ATPase that is the probable Arabidopsis orthologue of one of the AAA-ATPases involved in peroxisome biogenesis in yeasts and mammals.; peroxisome 1 (PEX1); FUNCTIONS IN: nucleoside-triphosphatase activity, ATPase activity, binding, nucleotide binding, ATP binding; INVOLVED IN: protein import into peroxisome matrix, fatty acid beta-oxidation, response to stress; LOCATED IN: peroxisome; EXPRESSED IN: 14 plant structures; EXPRESSED DURING: 6 growth stages; CONTAINS InterPro DOMAIN/s: ATPase, AAA+ type, core (InterPro:IPR003593), ATPase, AAA-type, core (InterPro:IPR003959), Aspartate decarboxylase-like fold (InterPro:IPR009010), ATPase, AAA-type, conserved site (InterPro:IPR003960), Peroxisome biogenesis factor 1, N-terminal (InterPro:IPR015342); BEST Arabidopsis thaliana protein match is: ATPase, AAA-type, CDC48 protein (TAIR:AT3G53230.1); Has 45022 Blast hits to 26157 proteins in 3042 species: Archae - 1897; Bacteria - 15808; Metazoa - 7500; Fungi - 5597; Plants - 4311; Viruses - 29; Other Eukaryotes - 9880 (source: NCBI BLink). & (p54774|cdc48_soybn : 198.0) Cell division cycle protein 48 homolog (Valosin-containing protein homolog) (VCP) - Glycine max (Soybean) & (reliability: 510.0) & (original description: no original description) 0.9104749557873356 50 evm.model.contig_3502.4 no hits & (original description: no original description) 0.9092390193648242 56 evm.model.contig_2355.1 no hits & (original description: no original description) 0.9081158477915315 57 evm.model.contig_2015.12 no hits & (original description: no original description) 0.9032110605643946 65 evm.model.contig_3387.3 (at2g37600 : 125.0) Ribosomal protein L36e family protein; FUNCTIONS IN: structural constituent of ribosome; INVOLVED IN: translation; LOCATED IN: ribosome, cytosolic large ribosomal subunit; EXPRESSED IN: 22 plant structures; EXPRESSED DURING: 13 growth stages; CONTAINS InterPro DOMAIN/s: Ribosomal protein L36e (InterPro:IPR000509); BEST Arabidopsis thaliana protein match is: Ribosomal protein L36e family protein (TAIR:AT3G53740.4); Has 756 Blast hits to 755 proteins in 263 species: Archae - 0; Bacteria - 0; Metazoa - 355; Fungi - 140; Plants - 140; Viruses - 0; Other Eukaryotes - 121 (source: NCBI BLink). & (p52866|rl36_dauca : 103.0) 60S ribosomal protein L36 - Daucus carota (Carrot) & (reliability: 250.0) & (original description: no original description) 0.9011516116929325 86 evm.model.contig_3750.1 no hits & (original description: no original description) 0.8983643624599496 86 evm.model.contig_470.1 no hits & (original description: no original description) 0.895836264386397 82 evm.model.contig_2286.25 no hits & (original description: no original description) 0.8910794593702411 96