IUPAClite Monosaccharide and Glycan Notation¶
GlycReSoft uses glypy’s IUPAClite notation for monosaccharides and glycan compositions.
A monosaccharide is denoted using IUPAC notation, omitting ring shape, anomeric state, chirality,
and modification positions (optionally). For example, a-D-Manp would be written Man, or
b-D-Glcp2NAc would be written GlcNAc.
You can also use generic base types like Hex or Pen for example, to denote a six or five
carbon monosaccharide. The notation is composable, so you can specify an arbitrarily modified
monosaccharide, like HexNAc(S) to specify a sulfated HexNAc, using the parenthesized convention
that separates substituent groups, or dHexN for a deoxy-Hexosamine.
You can also define “floating” substituent groups by prefixing their full lowercase
names with an @-sign, like @sulfate for sulfate or @acetyl for an acetyl group.
A glycan composition is written as one or more <monosaccharide>:<count> occurrences separated by
a “; “ (semi-colon + space), enclosed in “{ }”. A few examples are shown below:
{Hex:5; HexNAc:4; Neu5Ac:1}
{Hex:5; HexNAc:4; Neu5Ac:2}
{Fuc:1; Hex:5; HexNAc:4; Neu5Ac:2}
{Fuc:2; Hex:6; HexNAc:5; Neu5Ac:1}
{Fuc:1; Hex:6; HexNAc:5; Neu5Ac:2}
