>Prot1 |
H |
D |
S |
G |
T |
V |
V |
E |
I |
G |
E |
S |
V |
T |
N |
F |
E |
V |
G |
D |
R |
V |
A |
I |
D |
P |
N |
I |
Y |
>Prot2 |
E |
F |
C |
G |
T |
V |
V |
E |
T |
K |
S |
S |
V |
- |
- |
F |
E |
V |
G |
D |
R |
V |
S |
I |
F |
P |
N |
I |
Y |
>Prot3 |
- |
N |
S |
G |
V |
V |
A |
E |
V |
G |
S |
S |
V |
T |
N |
V |
K |
V |
G |
D |
R |
V |
T |
V |
E |
P |
N |
I |
Y |
>Prot4 |
E |
F |
C |
G |
I |
V |
M |
E |
A |
G |
S |
S |
V |
T |
N |
I |
E |
V |
A |
D |
K |
I |
T |
G |
D |
P |
N |
I |
S |
>Prot5 |
E |
D |
S |
G |
T |
V |
V |
E |
I |
G |
E |
S |
V |
T |
N |
F |
E |
V |
G |
D |
R |
V |
A |
I |
D |
P |
N |
I |
Y |
>Prot6 |
- |
N |
S |
G |
V |
V |
A |
E |
I |
G |
S |
S |
V |
T |
N |
V |
K |
V |
G |
D |
R |
V |
T |
V |
S |
P |
N |
I |
Y |
Consider: [AG]-x-C-x(4)-{DE}. This pattern is translated as: Ala or Gly, any, Cys, any, any, any, any,
anything but Glu or Asp.
Pattern rules:
- A single amino acid character designates that only that amino acid can be allowed in that position! For example: D-A will match the two amino acids Glu and Ala in sequence!
- A group within [ and ] designates a positive group where any of the amino acids within [...] will be allowed in a position! For example [AG] will allow Ala or Gly in a single position!
- A group within { and } designates a negative group where none of the amino acids within {...} will be allowed in a position! For example {DE} will allow anthing but Glu or Asp!
- The x character designates that any amino acid can be matched within a single position!
- A repetition group is written using the ( and ) characters and can contain a single number, for example ...-D(4)-.. which corresponds to four repetitions of Glu. A repetition group may also be a range, for example ...-x(0,3)-... which matches anything between nothing at all and three amino acids of any kind!
- Each group or amino acid should be delimited by the - character