ExPASy logo ExPASy Home page Site Map Search ExPASy Contact us PROSITE HAMAP Swiss-Prot
Search for

UniRule User Manual for the Web View

UniRule is a format describing rules used by the UniProt Knowledgebase (UniProtKB) automated annotation projects. It defines annotation that can be propagated to confirmed rule matches (UniProtKB entries). It also includes cases and specific conditions to restrict propagation of the annotation to suitable subsets of member sequences (e.g.: to a taxonomic range, to a metabolic pathway or only if a certain feature exists). These conditional elements are defined using: The rules can be displayed in a user-friendly Web View which consists of the following three main sections and associated sub-sections.
[General rule information] [Propagated annotation] [Additional information]

General rule information

This Header section contains information about the rule itself.
[Accession][Dates][Data Class] [Predictors][Name and function]

Accession

This field indicates the accession number of the rule. It can be in the form MF_xxxxx for the HAMAP families or in the form PRUxxxxx for the ProRule database.

Dates

This field is composed of two lines. The first line indicates the rule creation date; the second corresponds to the last rule revision date.

Data Class

The possible values for this field are:

Predictors

The line(s) Predictors indicates the motif identifier(s) used to trigger the application of the rule. The trigger can be either:

Name and function

These fields are optional for Protein rules and mandatory for Domain and Site rules. They provide respectively the name and the function of the protein, domain or site.

Propagated annotation

This section contains annotation that can be propagated to rule members.
[Identifier][Comments] [Cross-references][Keywords] [Gene Ontology][Features]

Identifier, protein and gene names/ Description

The name and the content of this section depend on the type of rule.
For Protein rules it corresponds to: For Domain and Site rules this is an optional field. It then contains only the part of the description which is common to all rule members preceeded by a plus (+).

Comments

This section contains all applicable comment lines of a UniProtKB entry (see: the CC line section of the UniProt Knowledgebase User Manual).

Cross-references

This section can be used:
  1. To indicate cross-references to domain databases within a UniProtKB entry; currently PROSITE, Pfam, PRINTS, TIGRFAMs and PIRSF (see: the DR line section of the UniProt Knowledgebase User Manual).

    In this case the format is:
    Database Name identifier1; identifier2; number of expected hits;
            Pfam  	 PF02033; RBFA; 1;
            TIGRFAMs TIGR00082; rbfA; 1;
            PROSITE  PS01319; RBFA; 1;
            
  2. To indicate which other rule(s) must be applied to completly annotate the protein or the domain.

    Two main cases can be distinguished:

Gene Ontology

This section contains Cross-references to the Gene Ontology database (GO, http://www.geneontology.org/).

Keywords

This section contains all applicable keywords of a UniProtKB entry (see: the KW line section of the UniProt Knowledgebase User Manual).

Features

This section contains:
  1. Template feature line(s)
    It defines the template for all the subsequent Feature lines. The format is:
    From: template name
    where template name must be one of the following values:
  2. Applicable feature lines that may be applied to UniProtKB entries (e.g. ACT_SITE, METAL, see the FT line section of the UniProt Knowledgebase User Manual).

Conditions may be used in feature lines. They usually correspond to pattern constraints, or to the presence of a specific amino acid.
e.g.
Key 	    	From 	    	To 	      	Description 	  	Condition
DISULFID 	  60 	    	80 	      	By similarity 	  	 C-x*-C
Optional label can be used to indicate the presence of a feature which is not mandatory in the matched sequences.
e.g.
Key  	     	       From  	     	To  	       	Description  	   	Condition
BINDING (Optional) 	153 	       153 	      	ATP (By similarity) 	   [RQ]
Multiple FT lines that should be applied either all together or not at all are grouped within an "FTGroup", to force the common presence of all sites.
e.g.
Key 	    From    To 	   Description 	  	                    Condition 	     FTGroup
ACT_SITE      42    42 	   Charge relay system (By similarity) 	  	H 	  	1
ACT_SITE      91    91 	   Charge relay system (By similarity) 	  	D 	  	1
ACT_SITE     186   186 	   Charge relay system (By similarity) 	  	S               1
This group can then be referenced by case statements in any other annotation section to be propagated.
For instance:
case  <FTGroup:1>
   Protein name 	+ (EC 3.4.21.-)
end case

Additional information

[Size range] [Related UniRules] [Template] [Scope] [Fusion] [Duplicate] [Plasmid encoded] [Repeats] [Topology] [Example] [Comments on the rule] [Member sequences]

ExPASy logo ExPASy Home page Site Map Search ExPASy Contact us PROSITE HAMAP Swiss-Prot
 Hosted by ca flag CBR Canada Mirror sites: Australia  Brazil  China  Korea  Switzerland