File size: 1,272 Bytes
7b7a7b6
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
# MAGMa

This code directory is adapted from https://github.com/samgoldman97/mist/tree/nmi_paper_v1/src/mist/magma


MAGMa is an algorithm which takes as input a molecule and provides as output a list of fragment molecules of the parent.

In this project, MAGMa is used to label the fragment peaks of spectra datasets
with chemical formulae and corresponding smiles, to be used as an extra
training signal for models. The fragmentation code utilized is heavily inspired
by the [original source code](https://github.com/NLeSC/MAGMa).

`run_magma.py` can be run directly and requires the following arguments:

- **--spectra-dir**: The directory path containing the SIRIUS program outputs.
  To subset spectra, we use only peaks that have been preserved by SIRIUS as
  an initial cleaning step. The program can be adapted to use other spectra
  input sources.   
- **--output-dir**: The chosen output directory path to save the magma output files    
- **--lowest-penalty-filter**: If flag set, when selecting candidate chemical formulae and smiles to label spectra peaks, only candidates with the lowest penalty score (as assigned by the Magma fragmentation engine) will be selected    
- **--spec-labels**: TSV file containing all the smiles for the spectra being used.