The MEROPS Specificity Search

What the search can NOT tell you!

Despite all the study of specificity of endopeptidases, it is still difficult if not impossible to predict where a native protein will be cleaved by a given enzyme. Amongst other factors, the tertiary structure of the substrate is important, and data obtained with small peptide substrates commonly do not relate directly to those for proteins. However, this search will allow the user to find cleavage sites for peptides, polypeptides and proteins in our collection of data from the literature. A description of the standard terminology for the specificity subsites of peptidases can be found here.

How to use the search

Please specify a peptide bond cleavage by selecting symbols to populate one or more of the boxes located on each side of the bond to be cleaved (marked ?). The symbols define the structure of the substrate, and their meanings are described fully below. The final specification will contain three elements. These have default settings that assume endopeptidase cleavage, so positions that contain unspecified amino acids may be left blank, but for an exopeptidase, one or two free termini must be indicated with TER. The three elements are:

1. N-terminal element

A single symbol is needed, and the system will use XXX by default. Other symbols that are valid for this element include: BLK, PEP, TER. More specific symbols for individual N-terminal blocking groups exist if needed (ABZ, AC, AHX, BOC, BZ, CPP, DAA, DNP, DNS, FA, FOR, GAB, GLP, MCA, MCC, MSU, PZ, SUC, TOS, Z).

2. Middle element

This will consist of one to eight of the symbols for amino acid residues. The default is XAA. Valid symbols are selected from those for normal amino acids (ALA, ARG, ASN, ASP, CYS, GLN, GLU, GLY, HIS, ILE, LEU, LYS, MET, PHE, PRO, SER, THR, TRP, TYR, VAL), modified or abnormal amino acids (CHA, CYA, DAL, DAR, DFH, DHA, DHB, DLY, DPA, DPH, DPR, DSE, DVA, GLT, HYP, LYD, NAL, NLE, NPH, NVA, PIP, THE, TYS), those for sets of amino acids (ACI, ALI, ARO, ASX, BAS, GLX, HYD), or XAA for any single amino acid.

3. C-terminal element

A single symbol is needed, and the system will use XXX by default. Other symbols that are valid for this element include: BLK, PEP, TER, and more specific symbols for individual C-terminal blocking groups exist if needed (AMC, BZN, EDD, MNA, NAE, NAN, NAO, NAP, NBA, NH2, OET, OME, ONA, OPH, P4P, PAB, PNA, SBZ).

Complete list of symbols

We use the standard three-letter symbols for natural L-amino acids: ALA, ARG, ASN, ASP, CYS, GLN, GLU, GLY, HIS, ILE, LEU, LYS, MET, PHE, PRO, SER, THR, TRP, TYR, VAL. Additional symbols are:

Symbol Definition May be N-terminal May be C-terminal
ABZ o-aminobenzoyl Yes No
AC acetyl Yes No
Ac2 N[alpha]N[epsilon]-diacetyl Yes No
ACI acidic amino acid No No
AHX 2-aminohexanoic acid Yes No
ALI aliphatic amino acid No No
AMC 7-(4-methyl)coumarylamide No Yes
ARO aromatic amino acid No No
ASX aspartate or asparagine No No
BAS basic amino acid No No
BLK any blocking group (N- or C-terminal) Yes Yes
BOC t-butyloxycarbonyl Yes No
BZ benzoyl Yes No
BZN benzylamide No Yes
CHA D-cyclohexylalanine No No
CPP carboxyphenylpropionyl Yes No
CYA cysteic acid No No
DAL D-alanine No No
DAP meso-2,6-diaminopimelyl No No
DAR D-arginine No No
DFH dehydrophenylalanine No No
DGl D-Glu No No
DHA dehydroalanine No No
DHB dehydrobutyrine No No
DLa D-lactate No No
DLY D-lysine No No
DLZ (epsilon-benzyloxycarbonyl)-D-lysine No No
DNP 2,4-dinitrophenyl Yes No
DNS 5-dimethylaminonaphthyl-1-sulfonyl; "dansyl" Yes No
DPA (2,4-dinitrophenyl)-L-2,3-diaminopropionyl No No
DPH D-phenylalanine No No
DPR D-proline No No
DSE D-serine No No
DVA D-valine No No
EDD N-(2,4-dinitrophenyl)ethylenediamine No Yes
FA furylacryloyl Yes No
FMK 7-amino-4-trifluoromethylcoumarin No Yes
FOR formyl Yes No
GAB gamma-amino-n-butyric acid No No
GLP pyrrolidone carboxylyl; pyroglutamyl Yes No
GLT glutaryl Yes No
GLX glutamate or glutamine No No
HYD hydrophobic amino acid No No
HYP hydroxyproline No No
LYD lysine(2,4-dinitrophenyl) No No
MCA (7-methoxy-coumarin-4-yl)acetyl Yes No
MCC 7-methoxycoumarin-3-carboxylyl Yes No
MCH methoxycarbonyl-cyclo-hexylglycyl Yes No
MES methane-sulfonyl Yes No
MNA 2-(4-methoxy)naphthylamide No Yes
MSU methoxysuccinyl Yes No
NAE 2,4-dinitroanilinoethylamide No Yes
NAL beta-naphthylalanine No No
NAN 4-nitroanilide No Yes
NAP 2-naphthylamide No Yes
NBA nitrobenzylamide No Yes
NH2 amide No Yes
NLE norleucine No No
NPH 4-nitrophenylalanine No No
NVA norvaline No No
OET ethyl ester No Yes
OME methyl ester No Yes
ONA 4-nitrophenyl ester No Yes
OPH phenyl ester No Yes
P4P 4-pyridinium alkoxy No Yes
PAB p-aminobenzoate No Yes
PEP peptide (any length) Yes Yes
PIP pipecolyl No No
PIN piperidine amide No No
PZ (4-phenylazo)benzyloxycarbonyl Yes No
SBZ thiobenzyl ester No Yes
SUC 3-carboxypropionyl (succinyl) Yes No
TER free N- or C-terminus Yes Yes
TOS toluene-p-sulfonyl; "tosyl" Yes No
TYS Tyr(SO3H) No No
XAA any single amino acid No No
XXX any group(s) (any number) Yes Yes
Z benzyloxycarbonyl Yes No