2023-08-16

This is PMA's DBGI daily open-notebook.

Today is 2023.08.16

Todo today

Have a look at the DBGI discussion forum

- https://github.com/orgs/digital-botanical-gardens-initiative/discussions

Doing

Paused

Done

Notes

For KG conversion. https://github.com/biolink/kgx

Will be KGHub compatible ...

We should widen the toy_datset for enpkg so as to contain also:

  • neg mode MS files
  • files aqcquired on a different platform (e.g. ToF)
  • Blanks + QC files

Think about edges cases

  • corrupted metadata
  • corrupted MZml

fdf TFIDF https://en.wikipedia.org/wiki/Tf%E2%80%93idf

https://en.wikipedia.org/wiki/Jaro%E2%80%93Winkler_distance

Send to Luca for the rust mgf handling lib mascot-rs https://docs.rs/mascot-rs/latest/mascot_rs/

mgf files in neg mode mgf file from ToF (check Audrey files)

polars(is pandas equivalent)

Think about a metadata validator interface.

It should propose a metadata template with :

  • the X required metadata fields
  • a list of validated but optional metadata fields
  • a form to request addition of a metadata field which would not be already present in the previous list

Todo tomorrow

Today I learned that