Skip to content

Questions on DSCF pretraining code and COREL implementation #1

@audreyeternal

Description

@audreyeternal

Thank you for providing the foundation model DSCF for spectrometry.

I have a few questions regarding the code and pretraining:

  1. Is there pretraining and inference code available? The loss function described in Eq. (1) of the paper doesn’t seem to be implemented in the current code.

  2. The paper states:

“COREL involves sampling and synthesizing spectral data from the spectral dictionary and using spectral component filtering methods to separate the spectral components present in complex systems.”

Just to confirm: does the objective of the pre-training (COREL) is to take in a mixture spectrum (possibly with a random mask) and aim to separate components?
If MultiDec_1d_model is the pretraining model, how should the outplanes parameter be set? The default seems to be 1, which reduces it to a standard MAE task.

  1. From the code, it looks like spectral component abundances in mixtures are randomly sampled. Could this affect performance due to mismatch with real-world abundance distributions?

Thanks in advance!

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions