2023-08-16
This is PMA's DBGI daily open-notebook.
Today is 2023.08.16
Todo today
Have a look at the DBGI discussion forum
- https://github.com/orgs/digital-botanical-gardens-initiative/discussions
Doing
Paused
Done
Notes
For KG conversion. https://github.com/biolink/kgx
Will be KGHub compatible ...
We should widen the toy_datset for enpkg so as to contain also:
- neg mode MS files
- files aqcquired on a different platform (e.g. ToF)
- Blanks + QC files
Think about edges cases
- corrupted metadata
- corrupted MZml
fdf TFIDF https://en.wikipedia.org/wiki/Tf%E2%80%93idf
https://en.wikipedia.org/wiki/Jaro%E2%80%93Winkler_distance
Send to Luca for the rust mgf handling lib mascot-rs https://docs.rs/mascot-rs/latest/mascot_rs/
mgf files in neg mode mgf file from ToF (check Audrey files)
polars(is pandas equivalent)
Think about a metadata validator interface.
It should propose a metadata template with :
- the X required metadata fields
- a list of validated but optional metadata fields
- a form to request addition of a metadata field which would not be already present in the previous list