3 Performing Taxonomically Informed Metabolite Annotation

Author

Adriano Rutz

Published

July 29, 2025

This vignette describes how Taxonomically Informed Metabolite Annotation is performed. If you followed all previous steps successfully, this should be a piece of cake, you deserve it!

tima::tima_full()
#> + par_def_ann_spe dispatched
#> ✔ par_def_ann_spe completed [28ms, 2.14 kB]
#> + par_def_wei_ann dispatched
#> ✔ par_def_wei_ann completed [0ms, 5.05 kB]
#> + par_def_pre_ann_gnp dispatched
#> ✔ par_def_pre_ann_gnp completed [1ms, 1.42 kB]
#> + par_def_pre_lib_sop_mer dispatched
#> ✔ par_def_pre_lib_sop_mer completed [1ms, 3.40 kB]
#> + yaml_paths dispatched
#> ✔ yaml_paths completed [1ms, 11.52 kB]
#> + par_def_pre_lib_sop_lot dispatched
#> ✔ par_def_pre_lib_sop_lot completed [0ms, 494 B]
#> + par_def_ann_mas dispatched
#> ✔ par_def_ann_mas completed [1ms, 6.09 kB]
#> + par_def_pre_lib_sop_hmd dispatched
#> ✔ par_def_pre_lib_sop_hmd completed [1ms, 492 B]
#> + par_def_fil_ann dispatched
#> ✔ par_def_fil_ann completed [1ms, 1.34 kB]
#> + par_def_pre_lib_sop_clo dispatched
#> ✔ par_def_pre_lib_sop_clo completed [0ms, 523 B]
#> + par_def_pre_lib_spe dispatched
#> ✔ par_def_pre_lib_spe completed [1ms, 1.57 kB]
#> + par_def_pre_fea_com dispatched
#> ✔ par_def_pre_fea_com completed [1ms, 358 B]
#> + par_def_cre_com dispatched
#> ✔ par_def_cre_com completed [1ms, 375 B]
#> + par_def_cre_edg_spe dispatched
#> ✔ par_def_cre_edg_spe completed [1ms, 1.42 kB]
#> + par_def_pre_fea_edg dispatched
#> ✔ par_def_pre_fea_edg completed [1ms, 706 B]
#> + par_def_pre_fea_tab dispatched
#> ✔ par_def_pre_fea_tab completed [1ms, 860 B]
#> + par_def_pre_lib_rt dispatched
#> ✔ par_def_pre_lib_rt completed [0ms, 2.05 kB]
#> + par_def_pre_ann_spe dispatched
#> ✔ par_def_pre_ann_spe completed [0ms, 1.46 kB]
#> + par_def_pre_ann_sir dispatched
#> ✔ par_def_pre_ann_sir completed [0ms, 1.93 kB]
#> + par_def_pre_tax dispatched
#> ✔ par_def_pre_tax completed [0ms, 1.51 kB]
#> + par_def_pre_lib_sop_ecm dispatched
#> ✔ par_def_pre_lib_sop_ecm completed [0ms, 492 B]
#> + paths dispatched
#> ✔ paths completed [2ms, 2.52 kB]
#> + lib_spe_exp_gnp_pre_sop dispatched
#> INFO [2025-07-29 12:25:29] Directory data/interim/libraries/sop created.
#> ✔ lib_spe_exp_gnp_pre_sop completed [479ms, 1.42 MB]
#> + lib_spe_exp_mb_pre_sop dispatched
#> ✔ lib_spe_exp_mb_pre_sop completed [283ms, 480.97 kB]
#> + lib_spe_exp_mer_pre_sop dispatched
#> ✔ lib_spe_exp_mer_pre_sop completed [393ms, 1.19 MB]
#> + lib_spe_is_wik_pre_sop dispatched
#> ✔ lib_spe_is_wik_pre_sop completed [684ms, 37.90 MB]
#> + lib_spe_exp_mb_pre_pos dispatched
#> INFO [2025-07-29 12:25:31] Directory data/interim/libraries/spectra/exp created.
#> ✔ lib_spe_exp_mb_pre_pos completed [764ms, 19.41 MB]
#> + par_pre_par dispatched
#> ✔ par_pre_par completed [0ms, 1.38 kB]
#> + lib_spe_exp_mer_pre_neg dispatched
#> ✔ lib_spe_exp_mer_pre_neg completed [782ms, 31.54 MB]
#> + lib_spe_is_wik_pre_neg dispatched
#> INFO [2025-07-29 12:25:33] Directory data/interim/libraries/spectra/is created.
#> Downloading  50% ■■■■■■■■■■■■■■■■                  1s
#> Downloading 100% ■■■■■■■■■■■■■■■■■■■■■■■■■■■■■■■   0s
#> ✔ lib_spe_is_wik_pre_neg completed [10.2s, 687.33 MB]
#> + par_pre_par2 dispatched
#> ✔ par_pre_par2 completed [0ms, 21.17 kB]
#> + lib_spe_is_wik_pre_pos dispatched
#> Downloading  44% ■■■■■■■■■■■■■■                    1s
#> Downloading 100% ■■■■■■■■■■■■■■■■■■■■■■■■■■■■■■■   0s
#> ✔ lib_spe_is_wik_pre_pos completed [12.8s, 863.95 MB]
#> + lib_sop_lot dispatched
#> INFO [2025-07-29 12:25:59] Downloading 230106_frozen_metadata.csv.gz from https://doi.org/10.5281/zenodo.5794106 (The LOTUS Initiative for Open Natural Products Research: frozen dataset union wikidata (with metadata); unique identifier: https://doi.org/10.5281/zenodo.7534071)
#> INFO [2025-07-29 12:25:59] Directory data/source/libraries/sop created.
#> ✔ lib_sop_lot completed [1m 24.5s, 92.98 MB]
#> + lib_sop_hmd dispatched
#> INFO [2025-07-29 12:27:22] Directory data/source/libraries/sop/hmdb created.
#> Downloading  11% ■■■■                              8s
#> Downloading  36% ■■■■■■■■■■■■                      6s
#> Downloading  72% ■■■■■■■■■■■■■■■■■■■■■■■           2s
#> Downloading 100% ■■■■■■■■■■■■■■■■■■■■■■■■■■■■■■■   0s
#> ✔ lib_sop_hmd completed [8.3s, 96.48 MB]
#> + lib_spe_exp_gnp_pre_neg dispatched
#> ✔ lib_spe_exp_gnp_pre_neg completed [2.5s, 154.12 MB]
#> + lib_spe_exp_mer_pre_pos dispatched
#> ✔ lib_spe_exp_mer_pre_pos completed [1.6s, 84.94 MB]
#> + lib_sop_ecm dispatched
#> ✔ lib_sop_ecm completed [771ms, 1.33 MB]
#> + lib_spe_exp_mb_pre_neg dispatched
#> ✔ lib_spe_exp_mb_pre_neg completed [539ms, 7.06 MB]
#> + lib_spe_exp_gnp_pre_pos dispatched
#> ✔ lib_spe_exp_gnp_pre_pos completed [7s, 481.27 MB]
#> + par_fin_par dispatched
#> ✔ par_fin_par completed [1ms, 307 B]
#> + par_fin_par2 dispatched
#> ✔ par_fin_par2 completed [2ms, 2.96 kB]
#> + par_usr_pre_lib_sop_mer dispatched
#> TRACE [2025-07-29 12:27:44] Loading default params
#> TRACE [2025-07-29 12:27:44] All params
#> TRACE [2025-07-29 12:27:44] Small params
#> TRACE [2025-07-29 12:27:44] Advanced params
#> TRACE [2025-07-29 12:27:44] Changing params
#> TRACE [2025-07-29 12:27:44] Changing filenames
#> TRACE [2025-07-29 12:27:46] Exporting params
#> INFO [2025-07-29 12:27:46] Directory params/user created.
#> ✔ par_usr_pre_lib_sop_mer completed [1.4s, 1.55 kB]
#> + par_usr_pre_lib_sop_lot dispatched
#> TRACE [2025-07-29 12:27:46] Loading default params
#> TRACE [2025-07-29 12:27:46] All params
#> TRACE [2025-07-29 12:27:46] Small params
#> TRACE [2025-07-29 12:27:46] Advanced params
#> TRACE [2025-07-29 12:27:46] Changing params
#> TRACE [2025-07-29 12:27:46] Changing filenames
#> TRACE [2025-07-29 12:27:47] Exporting params
#> ✔ par_usr_pre_lib_sop_lot completed [1.3s, 174 B]
#> + par_usr_pre_tax dispatched
#> TRACE [2025-07-29 12:27:47] Loading default params
#> TRACE [2025-07-29 12:27:47] All params
#> TRACE [2025-07-29 12:27:47] Small params
#> TRACE [2025-07-29 12:27:47] Advanced params
#> TRACE [2025-07-29 12:27:47] Changing params
#> TRACE [2025-07-29 12:27:47] Changing filenames
#> TRACE [2025-07-29 12:27:49] Exporting params
#> ✔ par_usr_pre_tax completed [1.3s, 438 B]
#> + par_usr_pre_ann_gnp dispatched
#> TRACE [2025-07-29 12:27:49] Loading default params
#> TRACE [2025-07-29 12:27:49] All params
#> TRACE [2025-07-29 12:27:49] Small params
#> TRACE [2025-07-29 12:27:49] Advanced params
#> TRACE [2025-07-29 12:27:49] Changing params
#> TRACE [2025-07-29 12:27:49] Changing filenames
#> TRACE [2025-07-29 12:27:50] Exporting params
#> ✔ par_usr_pre_ann_gnp completed [1.3s, 708 B]
#> + par_usr_pre_lib_sop_hmd dispatched
#> TRACE [2025-07-29 12:27:50] Loading default params
#> TRACE [2025-07-29 12:27:50] All params
#> TRACE [2025-07-29 12:27:50] Small params
#> TRACE [2025-07-29 12:27:50] Advanced params
#> TRACE [2025-07-29 12:27:50] Changing params
#> TRACE [2025-07-29 12:27:50] Changing filenames
#> TRACE [2025-07-29 12:27:52] Exporting params
#> ✔ par_usr_pre_lib_sop_hmd completed [1.3s, 178 B]
#> + par_usr_cre_com dispatched
#> TRACE [2025-07-29 12:27:52] Loading default params
#> TRACE [2025-07-29 12:27:52] All params
#> TRACE [2025-07-29 12:27:52] Small params
#> TRACE [2025-07-29 12:27:52] Advanced params
#> TRACE [2025-07-29 12:27:52] Changing params
#> TRACE [2025-07-29 12:27:52] Changing filenames
#> TRACE [2025-07-29 12:27:53] Exporting params
#> ✔ par_usr_cre_com completed [1.3s, 200 B]
#> + par_usr_pre_lib_sop_clo dispatched
#> TRACE [2025-07-29 12:27:53] Loading default params
#> TRACE [2025-07-29 12:27:53] All params
#> TRACE [2025-07-29 12:27:53] Small params
#> TRACE [2025-07-29 12:27:53] Advanced params
#> TRACE [2025-07-29 12:27:53] Changing params
#> TRACE [2025-07-29 12:27:53] Changing filenames
#> TRACE [2025-07-29 12:27:55] Exporting params
#> ✔ par_usr_pre_lib_sop_clo completed [1.3s, 205 B]
#> + par_usr_cre_edg_spe dispatched
#> TRACE [2025-07-29 12:27:55] Loading default params
#> TRACE [2025-07-29 12:27:55] All params
#> TRACE [2025-07-29 12:27:55] Small params
#> TRACE [2025-07-29 12:27:55] Advanced params
#> TRACE [2025-07-29 12:27:55] Changing params
#> TRACE [2025-07-29 12:27:55] Changing filenames
#> TRACE [2025-07-29 12:27:56] Exporting params
#> ✔ par_usr_cre_edg_spe completed [1.3s, 452 B]
#> + par_usr_pre_fea_com dispatched
#> TRACE [2025-07-29 12:27:56] Loading default params
#> TRACE [2025-07-29 12:27:56] All params
#> TRACE [2025-07-29 12:27:56] Small params
#> TRACE [2025-07-29 12:27:56] Advanced params
#> TRACE [2025-07-29 12:27:56] Changing params
#> TRACE [2025-07-29 12:27:56] Changing filenames
#> TRACE [2025-07-29 12:27:58] Exporting params
#> ✔ par_usr_pre_fea_com completed [1.5s, 200 B]
#> + par_usr_pre_fea_edg dispatched
#> TRACE [2025-07-29 12:27:58] Loading default params
#> TRACE [2025-07-29 12:27:58] All params
#> TRACE [2025-07-29 12:27:58] Small params
#> TRACE [2025-07-29 12:27:58] Advanced params
#> TRACE [2025-07-29 12:27:58] Changing params
#> TRACE [2025-07-29 12:27:58] Changing filenames
#> TRACE [2025-07-29 12:27:59] Exporting params
#> ✔ par_usr_pre_fea_edg completed [1.3s, 328 B]
#> + par_usr_pre_lib_sop_ecm dispatched
#> TRACE [2025-07-29 12:27:59] Loading default params
#> TRACE [2025-07-29 12:27:59] All params
#> TRACE [2025-07-29 12:27:59] Small params
#> TRACE [2025-07-29 12:27:59] Advanced params
#> TRACE [2025-07-29 12:27:59] Changing params
#> TRACE [2025-07-29 12:27:59] Changing filenames
#> TRACE [2025-07-29 12:28:01] Exporting params
#> ✔ par_usr_pre_lib_sop_ecm completed [1.3s, 176 B]
#> + par_usr_fil_ann dispatched
#> TRACE [2025-07-29 12:28:01] Loading default params
#> TRACE [2025-07-29 12:28:01] All params
#> TRACE [2025-07-29 12:28:01] Small params
#> TRACE [2025-07-29 12:28:01] Advanced params
#> TRACE [2025-07-29 12:28:01] Changing params
#> TRACE [2025-07-29 12:28:01] Changing filenames
#> TRACE [2025-07-29 12:28:02] Exporting params
#> ✔ par_usr_fil_ann completed [1.3s, 668 B]
#> + par_usr_pre_fea_tab dispatched
#> TRACE [2025-07-29 12:28:02] Loading default params
#> TRACE [2025-07-29 12:28:02] All params
#> TRACE [2025-07-29 12:28:02] Small params
#> TRACE [2025-07-29 12:28:02] Advanced params
#> TRACE [2025-07-29 12:28:02] Changing params
#> TRACE [2025-07-29 12:28:02] Changing filenames
#> TRACE [2025-07-29 12:28:04] Exporting params
#> ✔ par_usr_pre_fea_tab completed [1.3s, 274 B]
#> + par_usr_pre_lib_rt dispatched
#> TRACE [2025-07-29 12:28:04] Loading default params
#> TRACE [2025-07-29 12:28:04] All params
#> TRACE [2025-07-29 12:28:04] Small params
#> TRACE [2025-07-29 12:28:04] Advanced params
#> TRACE [2025-07-29 12:28:04] Changing params
#> TRACE [2025-07-29 12:28:04] Changing filenames
#> TRACE [2025-07-29 12:28:05] Exporting params
#> ✔ par_usr_pre_lib_rt completed [1.3s, 440 B]
#> + par_usr_ann_spe dispatched
#> TRACE [2025-07-29 12:28:05] Loading default params
#> TRACE [2025-07-29 12:28:05] All params
#> TRACE [2025-07-29 12:28:05] Small params
#> TRACE [2025-07-29 12:28:05] Advanced params
#> TRACE [2025-07-29 12:28:05] Changing params
#> TRACE [2025-07-29 12:28:05] Changing filenames
#> TRACE [2025-07-29 12:28:07] Exporting params
#> ✔ par_usr_ann_spe completed [1.3s, 1.03 kB]
#> + par_usr_pre_ann_spe dispatched
#> TRACE [2025-07-29 12:28:07] Loading default params
#> TRACE [2025-07-29 12:28:07] All params
#> TRACE [2025-07-29 12:28:07] Small params
#> TRACE [2025-07-29 12:28:07] Advanced params
#> TRACE [2025-07-29 12:28:07] Changing params
#> TRACE [2025-07-29 12:28:07] Changing filenames
#> TRACE [2025-07-29 12:28:08] Exporting params
#> ✔ par_usr_pre_ann_spe completed [1.3s, 731 B]
#> + par_usr_pre_lib_spe dispatched
#> TRACE [2025-07-29 12:28:08] Loading default params
#> TRACE [2025-07-29 12:28:08] All params
#> TRACE [2025-07-29 12:28:08] Small params
#> TRACE [2025-07-29 12:28:08] Advanced params
#> TRACE [2025-07-29 12:28:08] Changing params
#> TRACE [2025-07-29 12:28:08] Changing filenames
#> TRACE [2025-07-29 12:28:10] Exporting params
#> ✔ par_usr_pre_lib_spe completed [1.3s, 298 B]
#> + par_usr_pre_ann_sir dispatched
#> TRACE [2025-07-29 12:28:10] Loading default params
#> TRACE [2025-07-29 12:28:10] All params
#> TRACE [2025-07-29 12:28:10] Small params
#> TRACE [2025-07-29 12:28:10] Advanced params
#> TRACE [2025-07-29 12:28:10] Changing params
#> TRACE [2025-07-29 12:28:10] Changing filenames
#> TRACE [2025-07-29 12:28:11] Exporting params
#> ✔ par_usr_pre_ann_sir completed [1.3s, 900 B]
#> + par_usr_ann_mas dispatched
#> TRACE [2025-07-29 12:28:11] Loading default params
#> TRACE [2025-07-29 12:28:11] All params
#> TRACE [2025-07-29 12:28:11] Small params
#> TRACE [2025-07-29 12:28:11] Advanced params
#> TRACE [2025-07-29 12:28:11] Changing params
#> TRACE [2025-07-29 12:28:11] Changing filenames
#> TRACE [2025-07-29 12:28:13] Exporting params
#> ✔ par_usr_ann_mas completed [1.3s, 2.68 kB]
#> + par_usr_wei_ann dispatched
#> TRACE [2025-07-29 12:28:13] Loading default params
#> TRACE [2025-07-29 12:28:13] All params
#> TRACE [2025-07-29 12:28:13] Small params
#> TRACE [2025-07-29 12:28:13] Advanced params
#> TRACE [2025-07-29 12:28:13] Changing params
#> TRACE [2025-07-29 12:28:13] Changing filenames
#> TRACE [2025-07-29 12:28:14] Exporting params
#> ✔ par_usr_wei_ann completed [1.3s, 1.76 kB]
#> + par_pre_lib_sop_mer dispatched
#> ✔ par_pre_lib_sop_mer completed [1ms, 558 B]
#> + par_pre_lib_sop_lot dispatched
#> ✔ par_pre_lib_sop_lot completed [1ms, 186 B]
#> + par_pre_tax dispatched
#> ✔ par_pre_tax completed [1ms, 327 B]
#> + par_pre_ann_gnp dispatched
#> ✔ par_pre_ann_gnp completed [0ms, 336 B]
#> + par_pre_lib_sop_hmd dispatched
#> ✔ par_pre_lib_sop_hmd completed [0ms, 191 B]
#> + par_cre_com dispatched
#> ✔ par_cre_com completed [1ms, 191 B]
#> + par_pre_lib_sop_clo dispatched
#> ✔ par_pre_lib_sop_clo completed [0ms, 213 B]
#> + par_cre_edg_spe dispatched
#> ✔ par_cre_edg_spe completed [1ms, 389 B]
#> + par_pre_fea_com dispatched
#> ✔ par_pre_fea_com completed [0ms, 184 B]
#> + par_pre_fea_edg dispatched
#> ✔ par_pre_fea_edg completed [1ms, 244 B]
#> + par_pre_lib_sop_ecm dispatched
#> ✔ par_pre_lib_sop_ecm completed [0ms, 191 B]
#> + par_fil_ann dispatched
#> ✔ par_fil_ann completed [1ms, 347 B]
#> + par_pre_fea_tab dispatched
#> ✔ par_pre_fea_tab completed [0ms, 278 B]
#> + par_pre_lib_rt dispatched
#> ✔ par_pre_lib_rt completed [0ms, 329 B]
#> + par_ann_spe dispatched
#> ✔ par_ann_spe completed [0ms, 497 B]
#> + par_pre_ann_spe dispatched
#> ✔ par_pre_ann_spe completed [0ms, 334 B]
#> + par_pre_lib_spe dispatched
#> ✔ par_pre_lib_spe completed [1ms, 305 B]
#> + par_pre_ann_sir dispatched
#> ✔ par_pre_ann_sir completed [0ms, 405 B]
#> + par_ann_mas dispatched
#> ✔ par_ann_mas completed [1ms, 1.13 kB]
#> + par_wei_ann dispatched
#> ✔ par_wei_ann completed [1ms, 932 B]
#> + lib_sop_mer_str_pro dispatched
#> INFO [2025-07-29 12:28:18] Directory data/interim/libraries/sop/merged/structures created.
#> ✔ lib_sop_mer_str_pro completed [1.3s, 90.51 MB]
#> + lib_sop_lot_pre dispatched
#> TRACE [2025-07-29 12:28:19] Loading and preparing LOTUS
#> INFO [2025-07-29 12:28:30] ... path to export is data/interim/libraries/sop/lotus_prepared.tsv.gz
#> ✔ lib_sop_lot_pre completed [13.8s, 46.52 MB]
#> + lib_sop_hmd_pre dispatched
#> TRACE [2025-07-29 12:28:33] Unzipping HMDB
#> TRACE [2025-07-29 12:28:37] Loading HMDB
#> TRACE [2025-07-29 12:29:17] Formatting HMDB
#> TRACE [2025-07-29 12:29:17] Deleting unzipped file
#> INFO [2025-07-29 12:29:17] ... path to export is data/interim/libraries/sop/hmdb_prepared.tsv.gz
#> ✔ lib_sop_hmd_pre completed [44.7s, 8.06 MB]
#> + lib_sop_clo_pre dispatched
#> WARN [2025-07-29 12:29:19] Sorry, you do not have access to the closed resource,
#> returning an empty file instead
#> INFO [2025-07-29 12:29:19] Directory data/interim/params created.
#> INFO [2025-07-29 12:29:19] ... path to used parameters is data/interim/params
#> INFO [2025-07-29 12:29:19] ... path to export is data/interim/libraries/sop/closed_prepared.tsv.gz
#> ✔ lib_sop_clo_pre completed [18ms, 273 B]
#> + lib_sop_ecm_pre dispatched
#> TRACE [2025-07-29 12:29:19] Loading ECMDB resources
#> TRACE [2025-07-29 12:29:19] Formatting ECMDB
#> INFO [2025-07-29 12:29:19] ... path to used parameters is data/interim/params
#> INFO [2025-07-29 12:29:19] ... path to export is data/interim/libraries/sop/ecmdb_prepared.tsv.gz
#> ✔ lib_sop_ecm_pre completed [638ms, 177.47 kB]
#> + par_pre_fea_tab_fil_fea_raw dispatched
#> ✔ par_pre_fea_tab_fil_fea_raw completed [0ms, 451.55 kB]
#> + lib_rt dispatched
#> WARN [2025-07-29 12:29:20] No retention time library found, returning empty sop table.
#> WARN [2025-07-29 12:29:20] No retention time library found, returning empty retention time table.
#> INFO [2025-07-29 12:29:20] ... path to used parameters is data/interim/params
#> INFO [2025-07-29 12:29:20] Directory data/interim/libraries/rt created.
#> INFO [2025-07-29 12:29:20] ... path to export is data/interim/libraries/rt/prepared.tsv.gz
#> INFO [2025-07-29 12:29:20] ... path to export is data/interim/libraries/sop/rt_prepared.tsv.gz
#> ✔ lib_rt completed [53ms, 182 B]
#> + par_ann_spe_fil_spe_raw dispatched
#> ✔ par_ann_spe_fil_spe_raw completed [0ms, 7.77 MB]
#> + lib_spe_exp_int_pre dispatched
#> WARN [2025-07-29 12:29:20] Your input file does not exist, returning empty lib instead.
#> INFO [2025-07-29 12:29:21] ... path to export is data/interim/libraries/sop/internal_prepared.tsv.gz
#> INFO [2025-07-29 12:29:21] ... path to used parameters is data/interim/params
#> ✔ lib_spe_exp_int_pre completed [1.3s, 155 B]
#> + input_features dispatched
#> ✔ input_features completed [0ms, 451.55 kB]
#> + lib_rt_sop dispatched
#> ✔ lib_rt_sop completed [1ms, 96 B]
#> + lib_rt_rts dispatched
#> ✔ lib_rt_rts completed [0ms, 86 B]
#> + input_spectra dispatched
#> ✔ input_spectra completed [0ms, 7.77 MB]
#> + lib_spe_exp_int_pre_sop dispatched
#> ✔ lib_spe_exp_int_pre_sop completed [0ms, 106 B]
#> + lib_spe_exp_int_pre_pos dispatched
#> ✔ lib_spe_exp_int_pre_pos completed [0ms, 601 B]
#> + lib_spe_exp_int_pre_neg dispatched
#> ✔ lib_spe_exp_int_pre_neg completed [0ms, 601 B]
#> + fea_pre dispatched
#> TRACE [2025-07-29 12:29:24] Preparing features table
#> TRACE [2025-07-29 12:29:24] Formatting feature table
#> TRACE [2025-07-29 12:29:24] ... requires 'Peak area' or ':area' in columns (mzmine format)
#> TRACE [2025-07-29 12:29:24] ... or 'quant_' in columns (SLAW format)
#> TRACE [2025-07-29 12:29:24] ... or 'Peak height' in columns (SIRIUS format)
#> TRACE [2025-07-29 12:29:24] Filtering top intensities per feature
#> INFO [2025-07-29 12:29:24] ... path to used parameters is data/interim/params
#> INFO [2025-07-29 12:29:24] Directory data/interim/features created.
#> INFO [2025-07-29 12:29:24] ... path to export is data/interim/features/example_features.tsv.gz
#> ✔ fea_pre completed [151ms, 95.63 kB]
#> + fea_edg_spe dispatched
#> INFO [2025-07-29 12:29:26] Read 10000 spectra...
#> TRACE [2025-07-29 12:29:34] Harmonizing names
#> TRACE [2025-07-29 12:29:34] Filtering MS2 only
#> TRACE [2025-07-29 12:29:34] Combining spectra in case
#> TRACE [2025-07-29 12:29:36] Applying sanitization of the spectra
#> TRACE [2025-07-29 12:29:36] Filtering empty spectra
#> INFO [2025-07-29 12:29:36] Considering 3840 spectra
#> TRACE [2025-07-29 12:29:36] Performing spectral comparison
#> TRACE [2025-07-29 12:29:36] As we do not limit the precursors delta,
#> expect a (relatively) long processing time.
#> INFO [2025-07-29 12:29:36] =====================================================
#> INFO [2025-07-29 12:29:36] = Take yourself a break, you deserve it.            =
#> INFO [2025-07-29 12:29:36] =====================================================
#>  ■                                  0% |  ETA:  6m
#>  ■                                  1% |  ETA:  6m
#>  ■■                                 2% |  ETA:  6m
#>  ■■                                 3% |  ETA:  6m
#>  ■■                                 3% |  ETA:  6m
#>  ■■                                 4% |  ETA:  6m
#>  ■■■                                5% |  ETA:  6m
#>  ■■■                                6% |  ETA:  6m
#>  ■■■                                7% |  ETA:  6m
#>  ■■■                                8% |  ETA:  6m
#>  ■■■■                               9% |  ETA:  6m
#>  ■■■■                              10% |  ETA:  5m
#>  ■■■■                              10% |  ETA:  5m
#>  ■■■■                              11% |  ETA:  5m
#>  ■■■■■                             12% |  ETA:  5m
#>  ■■■■■                             13% |  ETA:  5m
#>  ■■■■■                             14% |  ETA:  5m
#>  ■■■■■                             15% |  ETA:  5m
#>  ■■■■■■                            16% |  ETA:  5m
#>  ■■■■■■                            17% |  ETA:  5m
#>  ■■■■■■                            18% |  ETA:  5m
#>  ■■■■■■■                           18% |  ETA:  5m
#>  ■■■■■■■                           19% |  ETA:  5m
#>  ■■■■■■■                           20% |  ETA:  5m
#>  ■■■■■■■                           21% |  ETA:  4m
#>  ■■■■■■■■                          23% |  ETA:  4m
#>  ■■■■■■■■                          24% |  ETA:  4m
#>  ■■■■■■■■                          25% |  ETA:  4m
#>  ■■■■■■■■■                         26% |  ETA:  4m
#>  ■■■■■■■■■                         27% |  ETA:  4m
#>  ■■■■■■■■■                         28% |  ETA:  4m
#>  ■■■■■■■■■■                        29% |  ETA:  4m
#>  ■■■■■■■■■■                        30% |  ETA:  4m
#>  ■■■■■■■■■■                        31% |  ETA:  4m
#>  ■■■■■■■■■■■                       32% |  ETA:  4m
#>  ■■■■■■■■■■■                       33% |  ETA:  4m
#>  ■■■■■■■■■■■                       34% |  ETA:  3m
#>  ■■■■■■■■■■■■                      36% |  ETA:  3m
#>  ■■■■■■■■■■■■                      37% |  ETA:  3m
#>  ■■■■■■■■■■■■                      38% |  ETA:  3m
#>  ■■■■■■■■■■■■■                     40% |  ETA:  3m
#>  ■■■■■■■■■■■■■                     41% |  ETA:  3m
#>  ■■■■■■■■■■■■■■                    42% |  ETA:  3m
#>  ■■■■■■■■■■■■■■                    44% |  ETA:  3m
#>  ■■■■■■■■■■■■■■■                   45% |  ETA:  3m
#>  ■■■■■■■■■■■■■■■                   47% |  ETA:  3m
#>  ■■■■■■■■■■■■■■■■                  48% |  ETA:  2m
#>  ■■■■■■■■■■■■■■■■                  50% |  ETA:  2m
#>  ■■■■■■■■■■■■■■■■                  52% |  ETA:  2m
#>  ■■■■■■■■■■■■■■■■■                 53% |  ETA:  2m
#>  ■■■■■■■■■■■■■■■■■■                55% |  ETA:  2m
#>  ■■■■■■■■■■■■■■■■■■                57% |  ETA:  2m
#>  ■■■■■■■■■■■■■■■■■■■               59% |  ETA:  2m
#>  ■■■■■■■■■■■■■■■■■■■               61% |  ETA:  2m
#>  ■■■■■■■■■■■■■■■■■■■■              63% |  ETA:  2m
#>  ■■■■■■■■■■■■■■■■■■■■■             65% |  ETA:  1m
#>  ■■■■■■■■■■■■■■■■■■■■■             68% |  ETA:  1m
#>  ■■■■■■■■■■■■■■■■■■■■■■            71% |  ETA:  1m
#>  ■■■■■■■■■■■■■■■■■■■■■■■           74% |  ETA:  1m
#>  ■■■■■■■■■■■■■■■■■■■■■■■■          77% |  ETA:  1m
#>  ■■■■■■■■■■■■■■■■■■■■■■■■■         81% |  ETA: 42s
#>  ■■■■■■■■■■■■■■■■■■■■■■■■■■■       86% |  ETA: 30s
#>  ■■■■■■■■■■■■■■■■■■■■■■■■■■■■■     93% |  ETA: 13s
#> TRACE [2025-07-29 12:32:44] Calculating features' entropy
#> TRACE [2025-07-29 12:32:44] Counting features' number of peaks
#> INFO [2025-07-29 12:32:44] ... path to used parameters is data/interim/params
#> INFO [2025-07-29 12:32:44] ... path to export is data/interim/features/example_edgesSpectra.tsv
#> ✔ fea_edg_spe completed [3m 20.1s, 533.82 kB]
#> + lib_sop_mer dispatched
#> TRACE [2025-07-29 12:32:45] Loading and concatenating prepared libraries
#> TRACE [2025-07-29 12:32:56] Splitting the concatenated library into smaller standardized pieces
#> TRACE [2025-07-29 12:33:04] Sanitizing structures
#> TRACE [2025-07-29 12:33:04] Processing SMILES
#> INFO [2025-07-29 12:33:14] Passing 61 SMILES to RDKit
#> [12:33:14] Explicit valence for atom # 1 N, 3, is greater than permitted
#> [12:33:14] ERROR: Could not sanitize molecule on line 1
#> [12:33:14] ERROR: Explicit valence for atom # 1 N, 3, is greater than permitted
#> [12:33:14] Explicit valence for atom # 0 P, 11, is greater than permitted
#> [12:33:14] ERROR: Could not sanitize molecule on line 2
#> [12:33:14] ERROR: Explicit valence for atom # 0 P, 11, is greater than permitted
#> [12:33:14] Explicit valence for atom # 21 N, 4, is greater than permitted
#> [12:33:14] ERROR: Could not sanitize molecule on line 3
#> [12:33:14] ERROR: Explicit valence for atom # 21 N, 4, is greater than permitted
#> [12:33:14] Explicit valence for atom # 1 Cl, 4, is greater than permitted
#> [12:33:14] ERROR: Could not sanitize molecule on line 4
#> [12:33:14] ERROR: Explicit valence for atom # 1 Cl, 4, is greater than permitted
#> [12:33:14] Explicit valence for atom # 6 C, 5, is greater than permitted
#> [12:33:14] ERROR: Could not sanitize molecule on line 5
#> [12:33:14] ERROR: Explicit valence for atom # 6 C, 5, is greater than permitted
#> [12:33:14] Explicit valence for atom # 18 S, 7, is greater than permitted
#> [12:33:14] ERROR: Could not sanitize molecule on line 6
#> [12:33:14] ERROR: Explicit valence for atom # 18 S, 7, is greater than permitted
#> [12:33:14] SMILES Parse Error: syntax error while parsing: OC1=CC=CC(=C1)C-1=C2\CCC(=N2)\C(=C2/N\C(\C=C2)=C(/C2=N/C(/C=C2)=C(\C2=CC=C\-1N2)C1=CC(O)=CC=C1)C1=CC(O)=CC=C1)\C1=CC(O)=CC=C1
#> [12:33:14] SMILES Parse Error: check for mistakes around position 76:
#> [12:33:14] C(/C=C2)=C(\C2=CC=C\-1N2)C1=CC(O)=CC=C1)C
#> [12:33:14] ~~~~~~~~~~~~~~~~~~~~^
#> [12:33:14] SMILES Parse Error: Failed parsing SMILES 'OC1=CC=CC(=C1)C-1=C2\CCC(=N2)\C(=C2/N\C(\C=C2)=C(/C2=N/C(/C=C2)=C(\C2=CC=C\-1N2)C1=CC(O)=CC=C1)C1=CC(O)=CC=C1)\C1=CC(O)=CC=C1' for input: 'OC1=CC=CC(=C1)C-1=C2\CCC(=N2)\C(=C2/N\C(\C=C2)=C(/C2=N/C(/C=C2)=C(\C2=CC=C\-1N2)C1=CC(O)=CC=C1)C1=CC(O)=CC=C1)\C1=CC(O)=CC=C1'
#> [12:33:14] ERROR: Smiles parse error on line 7
#> [12:33:14] ERROR: Cannot create molecule from : 'OC1=CC=CC(=C1)C-1=C2\CCC(=N2)\C(=C2/N\C(\C=C2)=C(/C2=N/C(/C=C2)=C(\C2=CC=C\-1N2)C1=CC(O)=CC=C1)C1=CC(O)=CC=C1)\C1=CC(O)=CC=C1'
#> INFO [2025-07-29 12:33:37] Led to 877903 referenced structure-organism pairs
#> INFO [2025-07-29 12:33:40] Corresponding to 393389 unique stereoisomers (excluding structures without stereochemistry)...
#> INFO [2025-07-29 12:33:40] ... and 1007790 unique structures without stereochemistry...
#> INFO [2025-07-29 12:33:40] ... or 1184311 unique constitutional isomers (ignoring stereochemistry)
#> INFO [2025-07-29 12:34:00] ... among 36800 unique organisms
#> TRACE [2025-07-29 12:34:00] Keeping keys
#> TRACE [2025-07-29 12:34:00] Keeping organisms
#> TRACE [2025-07-29 12:34:00] Completing organisms taxonomy
#> TRACE [2025-07-29 12:34:00] Testing if Open Tree of Life API is up
#> TRACE [2025-07-29 12:34:01] Success! Submitting request
#> TRACE [2025-07-29 12:34:04] Request finished!
#> TRACE [2025-07-29 12:34:04] Getting taxonomy
#> TRACE [2025-07-29 12:34:06] Taxonomy retrieved!
#> TRACE [2025-07-29 12:34:06] Got OTTaxonomy!
#> TRACE [2025-07-29 12:34:06] Keeping structures
#> INFO [2025-07-29 12:34:06] ... path to used parameters is data/interim/params
#> INFO [2025-07-29 12:34:06] ... path to export is data/interim/libraries/sop/merged/keys.tsv.gz
#> INFO [2025-07-29 12:34:07] Directory data/interim/libraries/sop/merged/organisms/taxonomies created.
#> INFO [2025-07-29 12:34:07] ... path to export is data/interim/libraries/sop/merged/organisms/taxonomies/ott.tsv.gz
#> INFO [2025-07-29 12:34:07] ... path to export is data/interim/libraries/sop/merged/structures/stereo.tsv.gz
#> INFO [2025-07-29 12:34:10] ... path to export is data/interim/libraries/sop/merged/structures/metadata.tsv.gz
#> INFO [2025-07-29 12:34:12] ... path to export is data/interim/libraries/sop/merged/structures/names.tsv.gz
#> INFO [2025-07-29 12:34:12] Directory data/interim/libraries/sop/merged/structures/taxonomies created.
#> INFO [2025-07-29 12:34:12] ... path to export is data/interim/libraries/sop/merged/structures/taxonomies/classyfire.tsv.gz
#> INFO [2025-07-29 12:34:13] ... path to export is data/interim/libraries/sop/merged/structures/taxonomies/npc.tsv.gz
#> ✔ lib_sop_mer completed [1m 28.1s, 250 B]
#> + ann_spe_pos dispatched
#> TRACE [2025-07-29 12:34:14] Loading spectra
#> INFO [2025-07-29 12:34:16] Read 10000 spectra...
#> TRACE [2025-07-29 12:34:23] Harmonizing names
#> TRACE [2025-07-29 12:34:23] Filtering MS2 only
#> TRACE [2025-07-29 12:34:23] Combining spectra in case
#> TRACE [2025-07-29 12:34:24] Applying sanitization of the spectra
#> TRACE [2025-07-29 12:34:24] Filtering empty spectra
#> INFO [2025-07-29 12:34:24] Considering 3840 spectra
#> TRACE [2025-07-29 12:34:24] Loading spectral libraries
#> TRACE [2025-07-29 12:34:42] Harmonizing spectrum id
#> TRACE [2025-07-29 12:34:42] Harmonizing spectrum id
#> TRACE [2025-07-29 12:34:48] Harmonizing spectrum id
#> TRACE [2025-07-29 12:34:50] Harmonizing spectrum id
#> TRACE [2025-07-29 12:34:53] Harmonizing spectrum id
#> INFO [2025-07-29 12:35:09] Annotating using following libraries
#> INFO [2025-07-29 12:35:11]          library spectra unique_connectivities
#>  ISDB - Wikidata  998198                998198
#>             gnps  354788                 22675
#>           merlin  208273                 26197
#>         massbank   66388                  5901
#> TRACE [2025-07-29 12:35:11] Reducing library size
#> TRACE [2025-07-29 12:35:12] Annotating
#>  ■                                  0% |  ETA:  6m
#>  ■                                  1% |  ETA:  6m
#>  ■■                                 2% |  ETA:  5m
#>  ■■                                 3% |  ETA:  5m
#>  ■■■                                5% |  ETA:  4m
#>  ■■■                                7% |  ETA:  4m
#>  ■■■■                               8% |  ETA:  3m
#>  ■■■■                               9% |  ETA:  4m
#>  ■■■■                              11% |  ETA:  3m
#>  ■■■■■                             13% |  ETA:  3m
#>  ■■■■■                             14% |  ETA:  3m
#>  ■■■■■■                            16% |  ETA:  3m
#>  ■■■■■■                            18% |  ETA:  3m
#>  ■■■■■■■                           19% |  ETA:  3m
#>  ■■■■■■■                           21% |  ETA:  3m
#>  ■■■■■■■■                          23% |  ETA:  3m
#>  ■■■■■■■■                          24% |  ETA:  3m
#>  ■■■■■■■■■                         26% |  ETA:  2m
#>  ■■■■■■■■■                         28% |  ETA:  2m
#>  ■■■■■■■■■■                        30% |  ETA:  2m
#>  ■■■■■■■■■■                        31% |  ETA:  2m
#>  ■■■■■■■■■■■                       33% |  ETA:  2m
#>  ■■■■■■■■■■■                       35% |  ETA:  2m
#>  ■■■■■■■■■■■■                      37% |  ETA:  2m
#>  ■■■■■■■■■■■■                      38% |  ETA:  2m
#>  ■■■■■■■■■■■■■                     40% |  ETA:  2m
#>  ■■■■■■■■■■■■■                     42% |  ETA:  2m
#>  ■■■■■■■■■■■■■■                    44% |  ETA:  2m
#>  ■■■■■■■■■■■■■■■                   45% |  ETA:  2m
#>  ■■■■■■■■■■■■■■■                   47% |  ETA:  2m
#>  ■■■■■■■■■■■■■■■■                  49% |  ETA:  2m
#>  ■■■■■■■■■■■■■■■■                  51% |  ETA:  2m
#>  ■■■■■■■■■■■■■■■■■                 52% |  ETA:  1m
#>  ■■■■■■■■■■■■■■■■■                 54% |  ETA:  1m
#>  ■■■■■■■■■■■■■■■■■■                56% |  ETA:  1m
#>  ■■■■■■■■■■■■■■■■■■■               59% |  ETA:  1m
#>  ■■■■■■■■■■■■■■■■■■■               61% |  ETA:  1m
#>  ■■■■■■■■■■■■■■■■■■■■              63% |  ETA:  1m
#>  ■■■■■■■■■■■■■■■■■■■■              64% |  ETA:  1m
#>  ■■■■■■■■■■■■■■■■■■■■■             66% |  ETA:  1m
#>  ■■■■■■■■■■■■■■■■■■■■■■            69% |  ETA:  1m
#>  ■■■■■■■■■■■■■■■■■■■■■■            71% |  ETA:  1m
#>  ■■■■■■■■■■■■■■■■■■■■■■■           73% |  ETA: 46s
#>  ■■■■■■■■■■■■■■■■■■■■■■■■          75% |  ETA: 43s
#>  ■■■■■■■■■■■■■■■■■■■■■■■■          77% |  ETA: 39s
#>  ■■■■■■■■■■■■■■■■■■■■■■■■■         79% |  ETA: 36s
#>  ■■■■■■■■■■■■■■■■■■■■■■■■■         81% |  ETA: 32s
#>  ■■■■■■■■■■■■■■■■■■■■■■■■■■        83% |  ETA: 29s
#>  ■■■■■■■■■■■■■■■■■■■■■■■■■■■       85% |  ETA: 25s
#>  ■■■■■■■■■■■■■■■■■■■■■■■■■■■       87% |  ETA: 23s
#>  ■■■■■■■■■■■■■■■■■■■■■■■■■■■■      89% |  ETA: 19s
#>  ■■■■■■■■■■■■■■■■■■■■■■■■■■■■      90% |  ETA: 17s
#>  ■■■■■■■■■■■■■■■■■■■■■■■■■■■■■     92% |  ETA: 13s
#>  ■■■■■■■■■■■■■■■■■■■■■■■■■■■■■     94% |  ETA: 10s
#>  ■■■■■■■■■■■■■■■■■■■■■■■■■■■■■■    96% |  ETA:  7s
#>  ■■■■■■■■■■■■■■■■■■■■■■■■■■■■■■    98% |  ETA:  3s
#>  ■■■■■■■■■■■■■■■■■■■■■■■■■■■■■■■  100% |  ETA:  0s
#> TRACE [2025-07-29 12:38:02] Trying to harmonize adducts definitions
#> TRACE [2025-07-29 12:38:03] Filtering results above threshold only
#> INFO [2025-07-29 12:38:04] 321348 Candidates were annotated on 3679 features, with at least 0 similarity score.
#> INFO [2025-07-29 12:38:04] ... path to used parameters is data/interim/params
#> INFO [2025-07-29 12:38:04] ... path to export is data/interim/annotations/example_spectralMatches_pos.tsv.gz
#> ✔ ann_spe_pos completed [3m 51.4s, 37.54 MB]
#> + ann_spe_neg dispatched
#> TRACE [2025-07-29 12:38:07] Loading spectra
#> INFO [2025-07-29 12:38:09] Read 10000 spectra...
#> TRACE [2025-07-29 12:38:15] Harmonizing names
#> TRACE [2025-07-29 12:38:15] Filtering MS2 only
#> TRACE [2025-07-29 12:38:15] Combining spectra in case
#> TRACE [2025-07-29 12:38:15] Applying sanitization of the spectra
#> TRACE [2025-07-29 12:38:15] Filtering empty spectra
#> INFO [2025-07-29 12:38:15] Considering 0 spectra
#> WARN [2025-07-29 12:38:15] No spectra matched the given polarity, returning an empty dataframe
#> INFO [2025-07-29 12:38:15] ... path to used parameters is data/interim/params
#> INFO [2025-07-29 12:38:15] ... path to export is data/interim/annotations/example_spectralMatches_neg.tsv.gz
#> ✔ ann_spe_neg completed [8.3s, 187 B]
#> + edg_spe dispatched
#> ✔ edg_spe completed [0ms, 533.82 kB]
#> + lib_mer_key dispatched
#> ✔ lib_mer_key completed [1ms, 18.06 MB]
#> + lib_mer_str_met dispatched
#> ✔ lib_mer_str_met completed [0ms, 36.02 MB]
#> + lib_mer_str_nam dispatched
#> ✔ lib_mer_str_nam completed [0ms, 11.28 MB]
#> + lib_mer_str_stereo dispatched
#> ✔ lib_mer_str_stereo completed [0ms, 43.58 MB]
#> + lib_mer_str_tax_cla dispatched
#> ✔ lib_mer_str_tax_cla completed [0ms, 2.51 MB]
#> + lib_mer_str_tax_npc dispatched
#> ✔ lib_mer_str_tax_npc completed [0ms, 2.44 MB]
#> + lib_mer_org_tax_ott dispatched
#> ✔ lib_mer_org_tax_ott completed [0ms, 939.13 kB]
#> + ann_ms1_pre dispatched
#> TRACE [2025-07-29 12:38:39] Filtering desired adducts and adding mz tolerance
#> INFO [2025-07-29 12:38:41] Already 2112 adducts previously detected
#> TRACE [2025-07-29 12:38:41] Trying to harmonize adducts definitions
#> TRACE [2025-07-29 12:38:41] Calculating rt tolerance
#> TRACE [2025-07-29 12:38:41] Joining within given rt tolerance
#> INFO [2025-07-29 12:38:41] Here are the top 10 observed m/z differences inside the RT windows:
#> INFO [2025-07-29 12:38:41]              bin   N
#>  (4.8501,5.0366] 352
#>  (21.822,22.009] 283
#>   (16.973,17.16] 208
#>  (17.906,18.092] 192
#>  (15.854,16.041] 172
#>    (39.914,40.1] 143
#>  (38.981,39.168] 137
#>  (34.878,35.065] 115
#>  (77.962,78.148] 114
#>  (1.8659,2.0524] 108
#> INFO [2025-07-29 12:38:41] These differences may help identify potential preprocessing issues
#> TRACE [2025-07-29 12:38:41] Forming adducts and clusters
#> TRACE [2025-07-29 12:38:41] Calculating delta mz for single charge adducts and clusters
#> TRACE [2025-07-29 12:38:41] Joining within given delta mz tolerance (neutral losses)
#> TRACE [2025-07-29 12:38:41] Joining within given delta mz tolerance (adducts)
#> TRACE [2025-07-29 12:38:41] Keeping initial and destination feature
#> TRACE [2025-07-29 12:38:41] Joining with initial results (adducts)
#> TRACE [2025-07-29 12:38:41] Joining with initial results (neutral losses)
#> TRACE [2025-07-29 12:38:43] Joining within given mz tol to exact mass library
#> TRACE [2025-07-29 12:38:43] Keeping unique exact masses and molecular formulas
#> TRACE [2025-07-29 12:38:44] Joining exact masses with single charge adducts
#> TRACE [2025-07-29 12:38:44] Getting back to M
#> TRACE [2025-07-29 12:38:44] Calculating multicharged and in source dimers
#> TRACE [2025-07-29 12:39:12] Joining within given rt tolerance
#> TRACE [2025-07-29 12:39:12] Joining within given mz tol and filtering possible adducts
#> TRACE [2025-07-29 12:39:12] Joining single adducts, in source dimers, and multicharged
#> TRACE [2025-07-29 12:39:12] Adding chemical classification
#> INFO [2025-07-29 12:39:13] MS1 annotation led to 48099 annotations on 4224 features
#> INFO [2025-07-29 12:39:13] ... path to used parameters is data/interim/params
#> INFO [2025-07-29 12:39:13] ... path to export is data/interim/features/example_edgesMasses.tsv
#> INFO [2025-07-29 12:39:13] ... path to export is data/interim/annotations/example_ms1Prepared.tsv.gz
#> ✔ ann_ms1_pre completed [55.2s, 158 B]
#> + ann_spe_exp_gnp_pre dispatched
#> WARN [2025-07-29 12:39:15] No GNPS annotations found, returning an empty file instead
#> INFO [2025-07-29 12:39:15] ... path to used parameters is data/interim/params
#> INFO [2025-07-29 12:39:15] ... path to export is data/interim/annotations/example_gnpsPrepared.tsv.gz
#> ✔ ann_spe_exp_gnp_pre completed [20ms, 237 B]
#> + ann_spe_pre dispatched
#> TRACE [2025-07-29 12:39:15] Loading and formatting spectral matches
#> TRACE [2025-07-29 12:39:15] Selecting annotations columns
#> TRACE [2025-07-29 12:39:15] Trying to look for already computed metadata
#> TRACE [2025-07-29 12:39:18] Stereo loaded
#> TRACE [2025-07-29 12:39:22] Metadata loaded
#> TRACE [2025-07-29 12:39:29] Names loaded
#> TRACE [2025-07-29 12:39:29] Classyfire done
#> TRACE [2025-07-29 12:39:30] NPClassifier done
#> TRACE [2025-07-29 12:39:30] Metadata done
#> TRACE [2025-07-29 12:39:30] Names done
#> INFO [2025-07-29 12:39:51] ... path to used parameters is data/interim/params
#> INFO [2025-07-29 12:39:51] ... path to export is data/interim/annotations/example_spectralMatchesPrepared.tsv.gz
#> ✔ ann_spe_pre completed [39s, 60.86 MB]
#> + ann_sir_pre dispatched
#> INFO [2025-07-29 12:39:55] Loading parameters for SIRIUS 6
#> TRACE [2025-07-29 12:39:55] Loading and formatting SIRIUS results...
#> TRACE [2025-07-29 12:39:55] ... CANOPUS loaded
#> TRACE [2025-07-29 12:39:55] ... formulas loaded
#> TRACE [2025-07-29 12:39:55] ... structures loaded
#> TRACE [2025-07-29 12:39:55] ... de novo loaded
#> TRACE [2025-07-29 12:39:55] ... CANOPUS prepared
#> TRACE [2025-07-29 12:39:55] ... formulas prepared
#> TRACE [2025-07-29 12:39:55] ... structures prepared
#> TRACE [2025-07-29 12:39:55] ... denovo prepared
#> TRACE [2025-07-29 12:39:55] Everything joined together
#> TRACE [2025-07-29 12:39:55] Selecting annotations columns
#> TRACE [2025-07-29 12:39:55] Trying to look for already computed metadata
#> TRACE [2025-07-29 12:39:58] Stereo loaded
#> TRACE [2025-07-29 12:40:02] Metadata loaded
#> TRACE [2025-07-29 12:40:08] Names loaded
#> TRACE [2025-07-29 12:40:08] Classyfire done
#> TRACE [2025-07-29 12:40:09] NPClassifier done
#> TRACE [2025-07-29 12:40:09] Metadata done
#> TRACE [2025-07-29 12:40:09] Names done
#> TRACE [2025-07-29 12:40:13] Splitting SIRIUS results
#> INFO [2025-07-29 12:40:13] ... path to used parameters is data/interim/params
#> INFO [2025-07-29 12:40:13] ... path to export is data/interim/annotations/example_canopusPrepared.tsv.gz
#> INFO [2025-07-29 12:40:13] ... path to export is data/interim/annotations/example_formulaPrepared.tsv.gz
#> INFO [2025-07-29 12:40:13] ... path to export is data/interim/annotations/example_siriusPrepared.tsv.gz
#> ✔ ann_sir_pre completed [17.5s, 165 B]
#> + tax_pre dispatched
#> TRACE [2025-07-29 12:40:14] Loading feature table
#> TRACE [2025-07-29 12:40:14] Loading metadata table
#> TRACE [2025-07-29 12:40:14] Preparing organisms names
#> TRACE [2025-07-29 12:40:14] Retrieving already computed Open Tree of Life Taxonomy
#> TRACE [2025-07-29 12:40:14] Submitting the rest to OTL
#> TRACE [2025-07-29 12:40:14] Testing if Open Tree of Life API is up
#> TRACE [2025-07-29 12:40:15] Success! Submitting request
#> TRACE [2025-07-29 12:40:15] Request finished!
#> INFO [2025-07-29 12:40:15] Retrying with blk
#> TRACE [2025-07-29 12:40:15] Getting taxonomy
#> TRACE [2025-07-29 12:40:15] Taxonomy retrieved!
#> TRACE [2025-07-29 12:40:15] Got OTTaxonomy!
#> TRACE [2025-07-29 12:40:15] Joining all results
#> TRACE [2025-07-29 12:40:15] Joining with metadata table
#> TRACE [2025-07-29 12:40:15] Joining with cleaned taxonomy table
#> INFO [2025-07-29 12:40:15] ... path to used parameters is data/interim/params
#> INFO [2025-07-29 12:40:15] Directory data/interim/taxa created.
#> INFO [2025-07-29 12:40:15] ... path to export is data/interim/taxa/example_taxed.tsv.gz
#> ✔ tax_pre completed [1.3s, 19.70 kB]
#> + ann_ms1_pre_edg dispatched
#> ✔ ann_ms1_pre_edg completed [1ms, 81.71 kB]
#> + ann_ms1_pre_ann dispatched
#> ✔ ann_ms1_pre_ann completed [1ms, 10.81 MB]
#> + ann_sir_pre_can dispatched
#> ✔ ann_sir_pre_can completed [0ms, 784 B]
#> + ann_sir_pre_for dispatched
#> ✔ ann_sir_pre_for completed [0ms, 487 B]
#> + ann_sir_pre_str dispatched
#> ✔ ann_sir_pre_str completed [0ms, 24.42 kB]
#> + fea_edg_pre dispatched
#> TRACE [2025-07-29 12:40:17] Loading edge table
#> TRACE [2025-07-29 12:40:17] Formatting edge table
#> INFO [2025-07-29 12:40:18] ... path to used parameters is data/interim/params
#> INFO [2025-07-29 12:40:18] ... path to export is data/interim/features/example_edges.tsv
#> ✔ fea_edg_pre completed [54ms, 758.28 kB]
#> + ann_fil dispatched
#> TRACE [2025-07-29 12:40:18] ... features
#> TRACE [2025-07-29 12:40:18] ... annotations
#> TRACE [2025-07-29 12:40:23] Removing MS1 annotations for which we have spectral hits
#> TRACE [2025-07-29 12:40:25] ... retention times
#> INFO [2025-07-29 12:40:26] Filtering annotations outside of Inf minutes tolerance
#> INFO [2025-07-29 12:40:30] 0 candidates were removed based on retention time.
#> INFO [2025-07-29 12:40:30] ... path to used parameters is data/interim/params
#> INFO [2025-07-29 12:40:30] ... path to export is data/interim/annotations/example_annotationsFiltered.tsv.gz
#> ✔ ann_fil completed [15.5s, 56.07 MB]
#> + fea_com dispatched
#> INFO [2025-07-29 12:40:34] ... path to used parameters is data/interim/params
#> INFO [2025-07-29 12:40:34] ... path to export is data/interim/features/example_components.tsv
#> ✔ fea_com completed [53ms, 51.39 kB]
#> + int_com dispatched
#> ✔ int_com completed [1ms, 51.39 kB]
#> + fea_com_pre dispatched
#> TRACE [2025-07-29 12:40:35] Loading files
#> TRACE [2025-07-29 12:40:35] Components table
#> INFO [2025-07-29 12:40:35] ... path to used parameters is data/interim/params
#> INFO [2025-07-29 12:40:35] ... path to export is data/interim/features/example_componentsPrepared.tsv
#> ✔ fea_com_pre completed [20ms, 51.38 kB]
#> + ann_pre dispatched
#> TRACE [2025-07-29 12:40:35] Loading files ...
#> TRACE [2025-07-29 12:40:35] ... components
#> TRACE [2025-07-29 12:40:35] ... edges
#> TRACE [2025-07-29 12:40:35] ... structure-organism pairs
#> TRACE [2025-07-29 12:40:46] ... canopus
#> TRACE [2025-07-29 12:40:46] ... formula
#> TRACE [2025-07-29 12:40:46] ... annotations
#> TRACE [2025-07-29 12:40:54] Initial annotations:
#> INFO [2025-07-29 12:40:55]  candidate_library      n
#>    ISDB - Wikidata 577751
#>           TIMA MS1  82324
#>               gnps  25161
#>             merlin  23150
#>           massbank   3591
#>             SIRIUS    479
#> TRACE [2025-07-29 12:40:55] Re-arranging annotations
#> TRACE [2025-07-29 12:41:02] Adding biological organism metadata
#> TRACE [2025-07-29 12:41:02] Performing taxonomically informed scoring
#> TRACE [2025-07-29 12:41:06] Calculating biological score at all levels ...
#> TRACE [2025-07-29 12:41:06] ... domain
#> TRACE [2025-07-29 12:41:06] ... kingdom
#> TRACE [2025-07-29 12:41:06] ... phylum
#> TRACE [2025-07-29 12:41:06] ... class
#> TRACE [2025-07-29 12:41:06] ... order
#> TRACE [2025-07-29 12:41:06] ... family
#> TRACE [2025-07-29 12:41:06] ... tribe
#> TRACE [2025-07-29 12:41:06] ... genus
#> TRACE [2025-07-29 12:41:06] ... species
#> TRACE [2025-07-29 12:41:06] ... varietas
#> TRACE [2025-07-29 12:41:06] Keeping best biological score
#> INFO [2025-07-29 12:41:09] Taxonomically informed metabolite annotation led to 
#> 41238 annotations reranked at the kingdom level, 
#> 40784 annotations reranked at the phylum level, 
#> 35057 annotations reranked at the class level, 
#> 9353 annotations reranked at the order level, 
#> 7515 annotations reranked at the family level, 
#> 1184 annotations reranked at the tribe level, 
#> 919 annotations reranked at the genus level, 
#> 402 annotations reranked at the species level, and 
#> 0 annotations reranked at the variety level. 
#> WITHOUT TAKING CONSISTENCY SCORE INTO ACCOUNT! (for later predictions)
#> TRACE [2025-07-29 12:41:09] Calculating chemical consistency of features with at least 2 neighbors
#> TRACE [2025-07-29 12:41:10] Among all edges
#> TRACE [2025-07-29 12:41:10] ... at the (classyfire) kingdom level
#> TRACE [2025-07-29 12:41:10] ... at the (NPC) pathway level
#> TRACE [2025-07-29 12:41:10] ... at the (classyfire) superclass level
#> TRACE [2025-07-29 12:41:10] ... at the (NPC) superclass level
#> TRACE [2025-07-29 12:41:11] ... at the (classyfire) class level
#> TRACE [2025-07-29 12:41:12] ... at the (NPC) class level
#> TRACE [2025-07-29 12:41:13] ... at the (classyfire) parent level
#> TRACE [2025-07-29 12:41:15] Splitting already computed predictions
#> TRACE [2025-07-29 12:41:18] Joining all except -1 together
#> TRACE [2025-07-29 12:41:24] Adding already computed predictions back
#> TRACE [2025-07-29 12:41:30] Calculating chemical score at all levels ...
#> TRACE [2025-07-29 12:41:30] ... (classyfire) kingdom
#> TRACE [2025-07-29 12:41:30] ... (NPC) pathway
#> TRACE [2025-07-29 12:41:30] ... (classyfire) superclass
#> TRACE [2025-07-29 12:41:30] ... (NPC) superclass
#> TRACE [2025-07-29 12:41:30] ... (classyfire) class
#> TRACE [2025-07-29 12:41:30] ... (NPC) class
#> TRACE [2025-07-29 12:41:30] ... (classyfire) parent
#> TRACE [2025-07-29 12:41:30] ... keeping best chemical score
#> INFO [2025-07-29 12:41:31] Chemically informed scoring led to 
#> 40464 annotations reranked at the (classyfire) kingdom level, 
#> 31334 annotations reranked at the (NPC) pathway level, 
#> 28047 annotations reranked at the (classyfire) superclass level, 
#> 23948 annotations reranked at the (NPC) superclass level, 
#> 28047 annotations reranked at the (classyfire) class level, 
#> 23948 annotations reranked at the (NPC) class level, and 
#> 23927 annotations reranked at the (classyfire) parent level. 
#> WITHOUT TAKING CONSISTENCY SCORE INTO ACCOUNT!
#> INFO [2025-07-29 12:41:31] Filtering top 1 candidates and keeping only MS1 candidates with minimum 0 biological score OR 0 chemical score 
#> TRACE [2025-07-29 12:41:40] Minimizing results
#> TRACE [2025-07-29 12:41:43] Keeping high confidence candidates only
#> INFO [2025-07-29 12:41:43] Removed 693032 low confidence candidates out of the 695221 total ones.
#> INFO [2025-07-29 12:41:43] 2189 high confidence candidates remaining.
#> TRACE [2025-07-29 12:41:43] Processing full results
#> TRACE [2025-07-29 12:41:43] Adding initial metadata (RT, etc.) and simplifying columns
#> TRACE [2025-07-29 12:41:48] Selecting columns to export
#> TRACE [2025-07-29 12:41:48] Adding consensus again to droped candidates
#> TRACE [2025-07-29 12:41:54] Processing filtered results
#> TRACE [2025-07-29 12:41:54] Adding initial metadata (RT, etc.) and simplifying columns
#> TRACE [2025-07-29 12:41:58] Selecting columns to export
#> TRACE [2025-07-29 12:41:58] Adding consensus again to droped candidates
#> INFO [2025-07-29 12:42:04] Directory data/processed/20250729_124204_example created.
#> INFO [2025-07-29 12:42:04] ... path to used parameters is data/processed/20250729_124204_example
#> INFO [2025-07-29 12:42:05] ... path to used parameters is data/processed/20250729_124204_example
#> INFO [2025-07-29 12:42:05] ... path to export is data/processed/20250729_124204_example/example_results_mini.tsv
#> INFO [2025-07-29 12:42:05] ... path to export is data/processed/20250729_124204_example/example_results_filtered.tsv
#> INFO [2025-07-29 12:42:05] ... path to export is data/processed/20250729_124204_example/example_results.tsv
#> ✔ ann_pre completed [1m 29.4s, 3.97 MB]
#> ✔ ended pipeline [16m 40.4s, 126 completed, 0 skipped]
#> There were 14 warnings (use warnings() to see them)
#> Processing complete. Total molecules processed: 54
#> [1] TRUE

The final exported file is formatted in order to be easily imported in Cytoscape to further explore your data!

We hope you enjoyed using TIMA and are pleased to hear from you!

For any remark or suggestion, please fill an issue or feel free to contact us directly.

Citation

BibTeX citation:
@online{rutz2025,
  author = {Rutz, Adriano},
  title = {3 {Performing} {Taxonomically} {Informed} {Metabolite}
    {Annotation}},
  date = {2025-07-29},
  url = {https://taxonomicallyinformedannotation.github.io/tima/vignettes/articles/III-processing.html},
  langid = {en}
}
For attribution, please cite this work as:
Rutz, Adriano. 2025. “3 Performing Taxonomically Informed Metabolite Annotation.” July 29, 2025. https://taxonomicallyinformedannotation.github.io/tima/vignettes/articles/III-processing.html.