IUB ambiguity codes

Symbols used for describing an oligonucleotide sequence are

A = adenosine
(for the nucleoside - the base is adenine, the nucleotide is adenylic acid)
C = cytidine
(for the nucleoside - the base is cytosine, the nucleotide is cytidylic acid)
G = guanosine
(for the nucleoside - the base is guanine, the nucleotide is guanylic acid)
T = thymidine
(for the nucleoside - the base is thymine, the nucleotide is thymidylic acid)

Symbols for describing mixed base position in an oligonucleotide sequence (ambiguity codes) are


R = A + G	Y = C + T	K = G + T
M = A + C	S = C + G
W = A + T

V = A + C + G	B = C + G + T 	
H = A + C + T		
D = A + G + T

N = A + G + C + T

An additional symbol often used for describing an oligonucleotide sequence

I = inosine
(or hypoxanthosine for the nucleoside - the base is hypoxanthine, the nucleotide is inosinic acid)