Building a Glycan Hypothesis from glySpace

The glycoinformatics community has developed a federation of databases called glySpace, which composes the “namespace of all glycan structures”. It uses semantic web technologies to describe and relate these entities, and gateways into glySpace such as [GlyTouCan] provide query-able access to this data.

You can build a glycan composition hypothesis from the set of annotated glycan structures in glySpace for N-glycans and O-glycans, with or without taxonomic filters.

glycresoft build-hypothesis glycan-glyspace

glycresoft build-hypothesis glycan-glyspace [OPTIONS] DATABASE_CONNECTION

Options

-r, --reduction <string>

Reducing end modification

-d, --derivatization <string>

Chemical derivatization to apply

-n, --name <string>

The name for the hypothesis to be created

-m, --motif-class <choice>

Specify a glycan structure family to search for

Choices: [
n-linked; o-linked]
-t, --target-taxon <string>

Only select structures annotated with this taxonomy

-i, --include-children

Include child taxa of –target-taxon. No effect otherwise. [default: False]

-s, --detatch-substituent <substituent>

Substituent type to detatch from all monosaccharides (May specify more than once)

Arguments

DATABASE_CONNECTION

Required argument <string> A connection URI for a database, or a path on the file system

For more information on reductions and derivatizations, please see Glycan Modifications

Usage Example

# Get all human N-glycans
$ glycresoft build-hypothesis glycan-glyspace -m n-linked -t 9606 glyspace-glycans.db -n "Human N-Linked Glycans"
Building Glycan Hypothesis Human N-Linked Glycans (2)
14:35:34 - glycresoft:log:175 - INFO - Begin N Glycan Glyspace Hypothesis Serializer
{'composition_cache': None,
 'derivatization': None,
 'engine': Engine(sqlite:///glyspace-glycans.db),
 'filter_functions': [TaxonomyFilter({'9606'})],
 'loader': None,
 'reduction': None,
 'seen': {},
 'simplify': True,
 'start_time': datetime.datetime(2017, 8, 31, 14, 35, 34, 164000),
 'status': 'started',
 'transformer': None,
 'uuid': '471041e33f23485c9a570a5b4ea6e0d2'}
14:35:34 - glycresoft:log:175 - INFO - Querying GlySpace
14:35:54 - glycresoft:log:175 - INFO - Translating Response
14:36:19 - glycresoft:log:175 - INFO - Stored 976 glycan structures and 195 glycan compositions
14:38:24 - glycresoft:log:175 - INFO - Hypothesis Completed
14:38:24 - glycresoft:log:175 - INFO - End N Glycan Glyspace Hypothesis Serializer
14:38:24 - glycresoft:log:175 - INFO - Started at 2017-08-31 14:35:34.164000.
Ended at 2017-08-31 14:38:24.535000.
Total time elapsed: 0:02:50.371000
NGlycanGlyspaceHypothesisSerializer completed successfully.

# Get all human O-glycans
$ glycresoft build-hypothesis glyspace-glycan -m o-linked -t 9606 glyspace-glycans.db -n "Human O-Linked Glycans"
Building Glycan Hypothesis Human O-Linked Glycans
15:33:49 - glycresoft:log:175 - INFO - Begin O Glycan Glyspace Hypothesis Serializer
{'composition_cache': None,
 'derivatization': None,
 'engine': Engine(sqlite:///glyspace-glycans.db),
 'filter_functions': [TaxonomyFilter({'9606'})],
 'loader': None,
 'reduction': None,
 'seen': {},
 'simplify': True,
 'start_time': datetime.datetime(2017, 8, 31, 15, 33, 49, 601000),
 'status': 'started',
 'transformer': None,
 'uuid': '714d52c2525c499286f673e294672b9e'}
15:33:49 - glycresoft:log:175 - INFO - Querying GlySpace
15:33:55 - glycresoft:log:175 - INFO - Translating Response
15:34:01 - glycresoft:log:175 - INFO - Stored 315 glycan structures and 95 glycan compositions
15:34:01 - glycresoft:log:175 - INFO - Hypothesis Completed
15:34:01 - glycresoft:log:175 - INFO - End O Glycan Glyspace Hypothesis Serializer
15:34:01 - glycresoft:log:175 - INFO - Started at 2017-08-31 15:33:49.601000.
Ended at 2017-08-31 15:34:01.737000.
Total time elapsed: 0:00:12.136000
OGlycanGlyspaceHypothesisSerializer completed successfully.

Bibliography

[GlyTouCan]Aoki-Kinoshita, K., Agravat, S., Aoki, N. P., Arpinar, S., Cummings, R. D., Fujita, A., … Narimatsu, H. (2015). GlyTouCan 1.0 – The international glycan structure repository. Nucleic Acids Research, gkv1041. https://doi.org/10.1093/nar/gkv1041