ms-index¶
Data Conversion¶
mzML Conversion¶
ms-index mzml¶
Convert source into mzML format written to output, applying a collection of optional data transformations along the way.
ms-index mzml [OPTIONS] SOURCE OUTPUT
Options
- -r, --ms1-filter <func>¶
- -rn, --msn-filter <func>¶
- -p, --pick-peaks¶
Enable peak picking, centroiding profile data
- -f, --reprofile¶
Enable reprofiling, converting all MS1 spectra into smoothed profile data
- -z, --compress¶
Compress the output file using gzip
- -c, --correct-precursor-mz¶
Adjust the precursor m/z of each MSn scan to the nearest peak m/z in the precursor
- --update-metadata, --no-update-metadata¶
Whether or not to add the conversion program’s metadata to the mzML file.
- -X, --option <tuple>¶
Pass an arbitrary key-value pair through to configure the writer (May specify more than once)
Arguments
- SOURCE¶
Required argument <string>
- OUTPUT¶
Required argument <path>
MGF Conversion¶
ms-index mgf¶
Convert a mass spectrometry data file to MGF. MGF can only represent centroid spectra and generally does not contain any MS1 information.
ms-index mgf [OPTIONS] SOURCE OUTPUT
Options
- -z, --compress¶
Compress the output file using gzip
- -rn, --msn-filter <func>¶
Arguments
- SOURCE¶
Required argument <string>
- OUTPUT¶
Required argument <path>
mzMLb Conversion¶
This feature requires h5py
to be installed, and hdf5plugin
for
the faster non-zlib
based compressors to be available.
ms-index mzmlb¶
Convert source into mzML format written to output, applying a collection of optional data transformations along the way.
ms-index mzmlb [OPTIONS] SOURCE [OUTPUT]
Options
- -r, --ms1-filter <func>¶
- -rn, --msn-filter <func>¶
- -p, --pick-peaks¶
Enable peak picking, centroiding profile data
- -f, --reprofile¶
Enable reprofiling, converting all MS1 spectra into smoothed profile data
- -c, --correct-precursor-mz¶
Adjust the precursor m/z of each MSn scan to the nearest peak m/z in the precursor
- --update-metadata, --no-update-metadata¶
Whether or not to add the conversion program’s metadata to the mzML file.
- -z, --compression <choice>¶
The compressor to use
Choices: [gzip; blosc; blosc:lz4;blosc:lz4hc; blosc:zlib; blosc:zstd;zlib]
- -X, --option <tuple>¶
Pass an arbitrary key-value pair through to configure the writer (May specify more than once)
Arguments
- SOURCE¶
Required argument <string>
- OUTPUT¶
Optional argument <path>
Indexing¶
ms-index byte-index¶
Build an external byte offset index for a MS data file, saving time when opening the file with indexing enabled.
Supported Formats: mzML, mzXML, MGF
ms-index byte-index [OPTIONS] [PATHS]...
Arguments
- PATHS¶
Optional argument(s) <path>
ms-index metadata-index¶
Build an external scan metadata index for a mass spectrometry data file.
This extended index is saved in a separate JSON file that can be loaded with
ExtendedScanIndex
. It includes the scan time of all scans, the precursor
mass of MSn scans, as well as the relationships between precursor and product ion
scans, as well as other details. See ExtendedScanIndex
for more information
ms-index metadata-index [OPTIONS] [PATHS]...
Options
- -D, --deconvoluted¶
Whether or not to assume this file has been processed specifically by ms_deisotope’s deconvolution algorithm
- -p, --processes <int>¶
Number of worker processes to use. Defaults to 4 or the number of CPUs, whichever is lower
Arguments
- PATHS¶
Optional argument(s) <path>
ms-index msms-intervals¶
Construct an interval tree spanning time and m/z domains where MSn spectra were acquired in the LC-MS map.
The interval tree is serialized to JSON.
ms-index msms-intervals [OPTIONS] [PATHS]...
Options
- -o, --output <path>¶
- -p, --processes <int>¶
Number of worker processes to use. Defaults to 4 or the number of CPUs, whichever is lower
Arguments
- PATHS¶
Optional argument(s) <path>
Analytical Utilities¶
ms-index describe¶
Produces a minimal textual description of a mass spectrometry data file.
ms-index describe [OPTIONS] PATH
Options
- -d, --diagnostics¶
Run more diagnostics, greatly increasing runtime but producing additional informatoin
Arguments
- PATH¶
Required argument <path>
ms-index charge-states¶
Count the different precursor charge states in a mass spectrometry data file.
This command will construct a metadata index if it is not found.
ms-index charge-states [OPTIONS] PATH
Arguments
- PATH¶
Required argument <path>
ms-index precursor-clustering¶
Cluster precursor masses in a mass spectrometry data file.
This command will construct a metadata index if it is not found.
ms-index precursor-clustering [OPTIONS] PATH
Arguments
- PATH¶
Required argument <path>
ms-index spectrum-clustering¶
Cluster spectra by precursor mass and cosine similarity.
Spectrum clusters are written out to a text file recording cluster precursor mass, within-cluster similarity, and the source file and scan ID for each cluster member.
ms-index spectrum-clustering [OPTIONS] [PATHS]...
Options
- -m, --precursor-error-tolerance <float>¶
- -t, --similarity-threshold <float>¶
- -o, --output <path>¶
- -c, --cache-size <int>¶
The number of scans to cache in memory when not using –in-memory. If you are clustering multiple files, this number will be apply to each file separately.
- -M, --in-memory¶
Whether to load the entire dataset into memory for better performance
- -D, --deconvoluted¶
Whether to assume the spectra are deconvoluted or not
Arguments
- PATHS¶
Optional argument(s) <path>