Spaces:
Sleeping
Sleeping
File size: 1,272 Bytes
7b7a7b6 |
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 |
# MAGMa
This code directory is adapted from https://github.com/samgoldman97/mist/tree/nmi_paper_v1/src/mist/magma
MAGMa is an algorithm which takes as input a molecule and provides as output a list of fragment molecules of the parent.
In this project, MAGMa is used to label the fragment peaks of spectra datasets
with chemical formulae and corresponding smiles, to be used as an extra
training signal for models. The fragmentation code utilized is heavily inspired
by the [original source code](https://github.com/NLeSC/MAGMa).
`run_magma.py` can be run directly and requires the following arguments:
- **--spectra-dir**: The directory path containing the SIRIUS program outputs.
To subset spectra, we use only peaks that have been preserved by SIRIUS as
an initial cleaning step. The program can be adapted to use other spectra
input sources.
- **--output-dir**: The chosen output directory path to save the magma output files
- **--lowest-penalty-filter**: If flag set, when selecting candidate chemical formulae and smiles to label spectra peaks, only candidates with the lowest penalty score (as assigned by the Magma fragmentation engine) will be selected
- **--spec-labels**: TSV file containing all the smiles for the spectra being used.
|