/home/hltcoe/rwicks/.conda/envs/nicklid/lib/python3.10/site-packages/transformers/utils/hub.py:111: FutureWarning: Using `TRANSFORMERS_CACHE` is deprecated and will be removed in v5 of Transformers. Use `HF_HOME` instead. warnings.warn( 2025-10-02 00:18:29 | INFO | datasets | PyTorch version 2.6.0 available. 2025-10-02 00:18:36 | INFO | lidirl | Namespace(num_layers=1, bidirectional=False, architecture='lstm', config='config.yaml', data_path='/exp/rwicks/langid/build-data/huggingface/filtered+openlid-v2+ext', augmentations=['antspeak=1', 'ngrams=1', 'hashtag=1', 'short=1', 'spongebob=1', 'codeswitch=1', 'leetspeak=1', 'cyrillic=1', 'replaceurl=1', 'addurl=1', 'html=1', 'addemojis=1', 'replaceemojis=1', 'delete=1', 'add=1', 'swap=1', 'allcaps=1', 'email=1', 'whitespace=1', 'accenting=1', 'binarization=1'], augmentation_probability=0.0, temperature=3.3, input_type='bytes', batch_size=32, num_workers=1, outdir='/exp/rwicks/langid/exp/filtered+openlid-v2+ext/augless/flat/lstm/Hidden_LARGE_Embed_LARGE_NLayer_SMALL_LR_0.001/models', checkpoint=None, experiment_version=None, tb_dir='/exp/rwicks/langid/exp/filtered+openlid-v2+ext/augless/flat/lstm/Hidden_LARGE_Embed_LARGE_NLayer_SMALL_LR_0.001/tensorboard', save_top_k=10, save_checkpoint_last=True, log_interval=512, lr=0.001, cpu=False, min_epochs=0, max_epochs=-1, update_interval=16, lr_step_interval=100, validation_interval=8192, patience=inf, max_updates=250000.0, seed=141414, compile=False, scheduler='schedule_free', warmup_steps=0, gamma=0.0, alpha=-1.0, pred_type='multilabel', embed_dim=256, hidden_dim=256, dropout=0.1, montecarlo_layer=False, max_length=1024, layout='flat') 2025-10-02 00:25:48 | INFO | lidirl | Creating a train dataset with bytes inputs. 2025-10-02 00:25:48 | INFO | lidirl | Reading training files to calculate sampling probability... 2025-10-02 00:25:48 | INFO | lidirl | Label __label__ace_Arab changed from original probability 0.0024 to temperature adjusted probability 0.1308. 2025-10-02 00:25:48 | INFO | lidirl | Label __label__ace_Latn changed from original probability 0.0113 to temperature adjusted probability 0.2082. 2025-10-02 00:25:48 | INFO | lidirl | Label __label__acm_Arab changed from original probability 0.0036 to temperature adjusted probability 0.1470. 2025-10-02 00:25:48 | INFO | lidirl | Label __label__acq_Arab changed from original probability 0.0007 to temperature adjusted probability 0.0898. 2025-10-02 00:25:48 | INFO | lidirl | Label __label__aeb_Arab changed from original probability 0.0131 to temperature adjusted probability 0.2181. 2025-10-02 00:25:48 | INFO | lidirl | Label __label__afr_Latn changed from original probability 0.5857 to temperature adjusted probability 0.6898. 2025-10-02 00:25:48 | INFO | lidirl | Label __label__als_Latn changed from original probability 0.1358 to temperature adjusted probability 0.4430. 2025-10-02 00:25:48 | INFO | lidirl | Label __label__amh_Ethi changed from original probability 0.3179 to temperature adjusted probability 0.5731. 2025-10-02 00:25:48 | INFO | lidirl | Label __label__apc_Arab changed from original probability 0.0451 to temperature adjusted probability 0.3172. 2025-10-02 00:25:48 | INFO | lidirl | Label __label__arb_Arab changed from original probability 3.4596 to temperature adjusted probability 1.1815. 2025-10-02 00:25:48 | INFO | lidirl | Label __label__ars_Arab changed from original probability 0.0106 to temperature adjusted probability 0.2045. 2025-10-02 00:25:48 | INFO | lidirl | Label __label__ary_Arab changed from original probability 0.0234 to temperature adjusted probability 0.2601. 2025-10-02 00:25:48 | INFO | lidirl | Label __label__arz_Arab changed from original probability 0.0318 to temperature adjusted probability 0.2853. 2025-10-02 00:25:48 | INFO | lidirl | Label __label__asm_Beng changed from original probability 0.0766 to temperature adjusted probability 0.3724. 2025-10-02 00:25:48 | INFO | lidirl | Label __label__ast_Latn changed from original probability 0.4507 to temperature adjusted probability 0.6371. 2025-10-02 00:25:48 | INFO | lidirl | Label __label__awa_Deva changed from original probability 0.0044 to temperature adjusted probability 0.1570. 2025-10-02 00:25:48 | INFO | lidirl | Label __label__ayr_Latn changed from original probability 0.0539 to temperature adjusted probability 0.3347. 2025-10-02 00:25:48 | INFO | lidirl | Label __label__azb_Arab changed from original probability 0.1302 to temperature adjusted probability 0.4373. 2025-10-02 00:25:48 | INFO | lidirl | Label __label__azj_Latn changed from original probability 0.5173 to temperature adjusted probability 0.6643. 2025-10-02 00:25:48 | INFO | lidirl | Label __label__bak_Cyrl changed from original probability 0.1950 to temperature adjusted probability 0.4943. 2025-10-02 00:25:48 | INFO | lidirl | Label __label__bam_Latn changed from original probability 0.0045 to temperature adjusted probability 0.1574. 2025-10-02 00:25:48 | INFO | lidirl | Label __label__ban_Latn changed from original probability 0.0198 to temperature adjusted probability 0.2471. 2025-10-02 00:25:48 | INFO | lidirl | Label __label__bel_Cyrl changed from original probability 0.3793 to temperature adjusted probability 0.6047. 2025-10-02 00:25:48 | INFO | lidirl | Label __label__bem_Latn changed from original probability 0.1456 to temperature adjusted probability 0.4524. 2025-10-02 00:25:48 | INFO | lidirl | Label __label__ben_Beng changed from original probability 0.4944 to temperature adjusted probability 0.6552. 2025-10-02 00:25:48 | INFO | lidirl | Label __label__bho_Deva changed from original probability 0.0281 to temperature adjusted probability 0.2747. 2025-10-02 00:25:48 | INFO | lidirl | Label __label__bjn_Arab changed from original probability 0.0024 to temperature adjusted probability 0.1303. 2025-10-02 00:25:48 | INFO | lidirl | Label __label__bjn_Latn changed from original probability 0.0149 to temperature adjusted probability 0.2269. 2025-10-02 00:25:48 | INFO | lidirl | Label __label__bod_Tibt changed from original probability 0.0175 to temperature adjusted probability 0.2381. 2025-10-02 00:25:48 | INFO | lidirl | Label __label__bos_Latn changed from original probability 1.6792 to temperature adjusted probability 0.9491. 2025-10-02 00:25:48 | INFO | lidirl | Label __label__bug_Latn changed from original probability 0.0043 to temperature adjusted probability 0.1552. 2025-10-02 00:25:48 | INFO | lidirl | Label __label__bul_Cyrl changed from original probability 0.8136 to temperature adjusted probability 0.7620. 2025-10-02 00:25:48 | INFO | lidirl | Label __label__cat_Latn changed from original probability 1.7819 to temperature adjusted probability 0.9663. 2025-10-02 00:25:48 | INFO | lidirl | Label __label__ceb_Latn changed from original probability 1.7190 to temperature adjusted probability 0.9559. 2025-10-02 00:25:48 | INFO | lidirl | Label __label__ces_Latn changed from original probability 1.4709 to temperature adjusted probability 0.9118. 2025-10-02 00:25:48 | INFO | lidirl | Label __label__cjk_Latn changed from original probability 0.0136 to temperature adjusted probability 0.2207. 2025-10-02 00:25:48 | INFO | lidirl | Label __label__ckb_Arab changed from original probability 0.0602 to temperature adjusted probability 0.3462. 2025-10-02 00:25:48 | INFO | lidirl | Label __label__cmn_Hans changed from original probability 0.3900 to temperature adjusted probability 0.6098. 2025-10-02 00:25:48 | INFO | lidirl | Label __label__cmn_Hant changed from original probability 0.7544 to temperature adjusted probability 0.7448. 2025-10-02 00:25:48 | INFO | lidirl | Label __label__crh_Latn changed from original probability 0.0171 to temperature adjusted probability 0.2363. 2025-10-02 00:25:48 | INFO | lidirl | Label __label__cym_Latn changed from original probability 0.2621 to temperature adjusted probability 0.5406. 2025-10-02 00:25:48 | INFO | lidirl | Label __label__dan_Latn changed from original probability 1.5608 to temperature adjusted probability 0.9283. 2025-10-02 00:25:48 | INFO | lidirl | Label __label__deu_Latn changed from original probability 2.1275 to temperature adjusted probability 1.0197. 2025-10-02 00:25:48 | INFO | lidirl | Label __label__dik_Latn changed from original probability 0.0104 to temperature adjusted probability 0.2031. 2025-10-02 00:25:48 | INFO | lidirl | Label __label__dyu_Latn changed from original probability 0.0064 to temperature adjusted probability 0.1754. 2025-10-02 00:25:48 | INFO | lidirl | Label __label__dzo_Tibt changed from original probability 0.0040 to temperature adjusted probability 0.1522. 2025-10-02 00:25:48 | INFO | lidirl | Label __label__ekk_Latn changed from original probability 1.7178 to temperature adjusted probability 0.9557. 2025-10-02 00:25:48 | INFO | lidirl | Label __label__ell_Grek changed from original probability 1.8254 to temperature adjusted probability 0.9734. 2025-10-02 00:25:48 | INFO | lidirl | Label __label__eng_Latn changed from original probability 3.7925 to temperature adjusted probability 1.2149. 2025-10-02 00:25:48 | INFO | lidirl | Label __label__epo_Latn changed from original probability 0.6594 to temperature adjusted probability 0.7150. 2025-10-02 00:25:48 | INFO | lidirl | Label __label__eus_Latn changed from original probability 0.8015 to temperature adjusted probability 0.7585. 2025-10-02 00:25:48 | INFO | lidirl | Label __label__ewe_Latn changed from original probability 0.2220 to temperature adjusted probability 0.5141. 2025-10-02 00:25:48 | INFO | lidirl | Label __label__fao_Latn changed from original probability 0.0275 to temperature adjusted probability 0.2730. 2025-10-02 00:25:48 | INFO | lidirl | Label __label__fij_Latn changed from original probability 0.1369 to temperature adjusted probability 0.4441. 2025-10-02 00:25:48 | INFO | lidirl | Label __label__fil_Latn changed from original probability 0.5170 to temperature adjusted probability 0.6642. 2025-10-02 00:25:48 | INFO | lidirl | Label __label__fin_Latn changed from original probability 2.0156 to temperature adjusted probability 1.0031. 2025-10-02 00:25:48 | INFO | lidirl | Label __label__fon_Latn changed from original probability 0.0111 to temperature adjusted probability 0.2075. 2025-10-02 00:25:48 | INFO | lidirl | Label __label__fra_Latn changed from original probability 2.0779 to temperature adjusted probability 1.0124. 2025-10-02 00:25:48 | INFO | lidirl | Label __label__fur_Latn changed from original probability 0.0253 to temperature adjusted probability 0.2662. 2025-10-02 00:25:48 | INFO | lidirl | Label __label__fuv_Latn changed from original probability 0.0052 to temperature adjusted probability 0.1652. 2025-10-02 00:25:48 | INFO | lidirl | Label __label__gaz_Latn changed from original probability 0.1271 to temperature adjusted probability 0.4341. 2025-10-02 00:25:48 | INFO | lidirl | Label __label__gla_Latn changed from original probability 0.0332 to temperature adjusted probability 0.2891. 2025-10-02 00:25:48 | INFO | lidirl | Label __label__gle_Latn changed from original probability 0.1343 to temperature adjusted probability 0.4414. 2025-10-02 00:25:48 | INFO | lidirl | Label __label__glg_Latn changed from original probability 0.4653 to temperature adjusted probability 0.6433. 2025-10-02 00:25:48 | INFO | lidirl | Label __label__gug_Latn changed from original probability 0.0308 to temperature adjusted probability 0.2826. 2025-10-02 00:25:48 | INFO | lidirl | Label __label__guj_Gujr changed from original probability 0.3614 to temperature adjusted probability 0.5959. 2025-10-02 00:25:48 | INFO | lidirl | Label __label__hat_Latn changed from original probability 0.1380 to temperature adjusted probability 0.4451. 2025-10-02 00:25:48 | INFO | lidirl | Label __label__hau_Latn changed from original probability 0.1998 to temperature adjusted probability 0.4979. 2025-10-02 00:25:48 | INFO | lidirl | Label __label__heb_Hebr changed from original probability 1.6002 to temperature adjusted probability 0.9353. 2025-10-02 00:25:48 | INFO | lidirl | Label __label__hin_Deva changed from original probability 0.6324 to temperature adjusted probability 0.7060. 2025-10-02 00:25:48 | INFO | lidirl | Label __label__hne_Deva changed from original probability 0.0179 to temperature adjusted probability 0.2396. 2025-10-02 00:25:48 | INFO | lidirl | Label __label__hrv_Latn changed from original probability 2.6058 to temperature adjusted probability 1.0843. 2025-10-02 00:25:48 | INFO | lidirl | Label __label__hun_Latn changed from original probability 2.3485 to temperature adjusted probability 1.0507. 2025-10-02 00:25:48 | INFO | lidirl | Label __label__hye_Armn changed from original probability 0.6994 to temperature adjusted probability 0.7279. 2025-10-02 00:25:48 | INFO | lidirl | Label __label__ibo_Latn changed from original probability 0.2425 to temperature adjusted probability 0.5280. 2025-10-02 00:25:48 | INFO | lidirl | Label __label__ilo_Latn changed from original probability 0.3852 to temperature adjusted probability 0.6075. 2025-10-02 00:25:48 | INFO | lidirl | Label __label__ind_Latn changed from original probability 2.0609 to temperature adjusted probability 1.0099. 2025-10-02 00:25:48 | INFO | lidirl | Label __label__isl_Latn changed from original probability 0.0954 to temperature adjusted probability 0.3980. 2025-10-02 00:25:48 | INFO | lidirl | Label __label__ita_Latn changed from original probability 2.0429 to temperature adjusted probability 1.0072. 2025-10-02 00:25:48 | INFO | lidirl | Label __label__jav_Latn changed from original probability 0.0881 to temperature adjusted probability 0.3885. 2025-10-02 00:25:48 | INFO | lidirl | Label __label__jpn_Jpan changed from original probability 2.2283 to temperature adjusted probability 1.0341. 2025-10-02 00:25:48 | INFO | lidirl | Label __label__kab_Latn changed from original probability 0.0241 to temperature adjusted probability 0.2624. 2025-10-02 00:25:48 | INFO | lidirl | Label __label__kac_Latn changed from original probability 0.0043 to temperature adjusted probability 0.1551. 2025-10-02 00:25:48 | INFO | lidirl | Label __label__kam_Latn changed from original probability 0.0196 to temperature adjusted probability 0.2464. 2025-10-02 00:25:48 | INFO | lidirl | Label __label__kan_Knda changed from original probability 0.2583 to temperature adjusted probability 0.5382. 2025-10-02 00:25:48 | INFO | lidirl | Label __label__kas_Arab changed from original probability 0.0038 to temperature adjusted probability 0.1504. 2025-10-02 00:25:48 | INFO | lidirl | Label __label__kas_Deva changed from original probability 0.0026 to temperature adjusted probability 0.1329. 2025-10-02 00:25:48 | INFO | lidirl | Label __label__kat_Geor changed from original probability 0.3925 to temperature adjusted probability 0.6110. 2025-10-02 00:25:48 | INFO | lidirl | Label __label__kaz_Cyrl changed from original probability 0.2699 to temperature adjusted probability 0.5454. 2025-10-02 00:25:48 | INFO | lidirl | Label __label__kbp_Latn changed from original probability 0.0156 to temperature adjusted probability 0.2300. 2025-10-02 00:25:48 | INFO | lidirl | Label __label__kea_Latn changed from original probability 0.0021 to temperature adjusted probability 0.1250. 2025-10-02 00:25:48 | INFO | lidirl | Label __label__khk_Cyrl changed from original probability 0.0636 to temperature adjusted probability 0.3520. 2025-10-02 00:25:48 | INFO | lidirl | Label __label__khm_Khmr changed from original probability 0.0592 to temperature adjusted probability 0.3443. 2025-10-02 00:25:48 | INFO | lidirl | Label __label__kik_Latn changed from original probability 0.0366 to temperature adjusted probability 0.2977. 2025-10-02 00:25:48 | INFO | lidirl | Label __label__kin_Latn changed from original probability 0.2456 to temperature adjusted probability 0.5301. 2025-10-02 00:25:48 | INFO | lidirl | Label __label__kir_Cyrl changed from original probability 0.2151 to temperature adjusted probability 0.5092. 2025-10-02 00:25:48 | INFO | lidirl | Label __label__kmb_Latn changed from original probability 0.0347 to temperature adjusted probability 0.2929. 2025-10-02 00:25:48 | INFO | lidirl | Label __label__kmr_Latn changed from original probability 0.0058 to temperature adjusted probability 0.1701. 2025-10-02 00:25:48 | INFO | lidirl | Label __label__knc_Arab changed from original probability 0.0024 to temperature adjusted probability 0.1306. 2025-10-02 00:25:48 | INFO | lidirl | Label __label__knc_Latn changed from original probability 0.0024 to temperature adjusted probability 0.1301. 2025-10-02 00:25:48 | INFO | lidirl | Label __label__kor_Hang changed from original probability 1.3296 to temperature adjusted probability 0.8843. 2025-10-02 00:25:48 | INFO | lidirl | Label __label__ktu_Latn changed from original probability 0.0795 to temperature adjusted probability 0.3767. 2025-10-02 00:25:48 | INFO | lidirl | Label __label__lao_Laoo changed from original probability 0.0138 to temperature adjusted probability 0.2217. 2025-10-02 00:25:48 | INFO | lidirl | Label __label__lij_Latn changed from original probability 0.0194 to temperature adjusted probability 0.2457. 2025-10-02 00:25:48 | INFO | lidirl | Label __label__lim_Latn changed from original probability 0.0496 to temperature adjusted probability 0.3265. 2025-10-02 00:25:48 | INFO | lidirl | Label __label__lin_Latn changed from original probability 0.2083 to temperature adjusted probability 0.5043. 2025-10-02 00:25:48 | INFO | lidirl | Label __label__lit_Latn changed from original probability 1.3313 to temperature adjusted probability 0.8846. 2025-10-02 00:25:48 | INFO | lidirl | Label __label__lmo_Latn changed from original probability 0.0593 to temperature adjusted probability 0.3445. 2025-10-02 00:25:48 | INFO | lidirl | Label __label__ltg_Latn changed from original probability 0.0064 to temperature adjusted probability 0.1752. 2025-10-02 00:25:48 | INFO | lidirl | Label __label__ltz_Latn changed from original probability 0.1319 to temperature adjusted probability 0.4390. 2025-10-02 00:25:48 | INFO | lidirl | Label __label__lua_Latn changed from original probability 0.1107 to temperature adjusted probability 0.4164. 2025-10-02 00:25:48 | INFO | lidirl | Label __label__lug_Latn changed from original probability 0.1013 to temperature adjusted probability 0.4053. 2025-10-02 00:25:48 | INFO | lidirl | Label __label__luo_Latn changed from original probability 0.0516 to temperature adjusted probability 0.3304. 2025-10-02 00:25:48 | INFO | lidirl | Label __label__lus_Latn changed from original probability 0.0735 to temperature adjusted probability 0.3678. 2025-10-02 00:25:48 | INFO | lidirl | Label __label__lvs_Latn changed from original probability 1.2744 to temperature adjusted probability 0.8730. 2025-10-02 00:25:48 | INFO | lidirl | Label __label__mag_Deva changed from original probability 0.0024 to temperature adjusted probability 0.1306. 2025-10-02 00:25:48 | INFO | lidirl | Label __label__mai_Deva changed from original probability 0.0184 to temperature adjusted probability 0.2418. 2025-10-02 00:25:48 | INFO | lidirl | Label __label__mal_Mlym changed from original probability 0.2934 to temperature adjusted probability 0.5594. 2025-10-02 00:25:48 | INFO | lidirl | Label __label__mar_Deva changed from original probability 0.4726 to temperature adjusted probability 0.6463. 2025-10-02 00:25:48 | INFO | lidirl | Label __label__min_Latn changed from original probability 0.1134 to temperature adjusted probability 0.4194. 2025-10-02 00:25:48 | INFO | lidirl | Label __label__mkd_Cyrl changed from original probability 0.4574 to temperature adjusted probability 0.6400. 2025-10-02 00:25:48 | INFO | lidirl | Label __label__mlt_Latn changed from original probability 0.8541 to temperature adjusted probability 0.7733. 2025-10-02 00:25:48 | INFO | lidirl | Label __label__mni_Beng changed from original probability 0.0180 to temperature adjusted probability 0.2400. 2025-10-02 00:25:48 | INFO | lidirl | Label __label__mos_Latn changed from original probability 0.0732 to temperature adjusted probability 0.3673. 2025-10-02 00:25:48 | INFO | lidirl | Label __label__mri_Latn changed from original probability 0.0212 to temperature adjusted probability 0.2522. 2025-10-02 00:25:48 | INFO | lidirl | Label __label__mya_Mymr changed from original probability 0.3114 to temperature adjusted probability 0.5696. 2025-10-02 00:25:48 | INFO | lidirl | Label __label__nld_Latn changed from original probability 2.9522 to temperature adjusted probability 1.1261. 2025-10-02 00:25:48 | INFO | lidirl | Label __label__nno_Latn changed from original probability 0.2746 to temperature adjusted probability 0.5483. 2025-10-02 00:25:48 | INFO | lidirl | Label __label__nob_Latn changed from original probability 1.6251 to temperature adjusted probability 0.9397. 2025-10-02 00:25:48 | INFO | lidirl | Label __label__npi_Deva changed from original probability 0.1231 to temperature adjusted probability 0.4300. 2025-10-02 00:25:48 | INFO | lidirl | Label __label__nso_Latn changed from original probability 0.2158 to temperature adjusted probability 0.5097. 2025-10-02 00:25:48 | INFO | lidirl | Label __label__nus_Latn changed from original probability 0.0024 to temperature adjusted probability 0.1301. 2025-10-02 00:25:48 | INFO | lidirl | Label __label__nya_Latn changed from original probability 0.3011 to temperature adjusted probability 0.5638. 2025-10-02 00:25:48 | INFO | lidirl | Label __label__oci_Latn changed from original probability 0.2204 to temperature adjusted probability 0.5130. 2025-10-02 00:25:48 | INFO | lidirl | Label __label__ory_Orya changed from original probability 0.0608 to temperature adjusted probability 0.3473. 2025-10-02 00:25:48 | INFO | lidirl | Label __label__pag_Latn changed from original probability 0.1119 to temperature adjusted probability 0.4177. 2025-10-02 00:25:48 | INFO | lidirl | Label __label__pan_Guru changed from original probability 0.2107 to temperature adjusted probability 0.5060. 2025-10-02 00:25:48 | INFO | lidirl | Label __label__pap_Latn changed from original probability 0.1567 to temperature adjusted probability 0.4626. 2025-10-02 00:25:48 | INFO | lidirl | Label __label__pbt_Arab changed from original probability 0.1603 to temperature adjusted probability 0.4658. 2025-10-02 00:25:48 | INFO | lidirl | Label __label__pes_Arab changed from original probability 1.0801 to temperature adjusted probability 0.8303. 2025-10-02 00:25:48 | INFO | lidirl | Label __label__plt_Latn changed from original probability 0.0180 to temperature adjusted probability 0.2400. 2025-10-02 00:25:48 | INFO | lidirl | Label __label__pol_Latn changed from original probability 3.1004 to temperature adjusted probability 1.1429. 2025-10-02 00:25:48 | INFO | lidirl | Label __label__por_Latn changed from original probability 3.2705 to temperature adjusted probability 1.1616. 2025-10-02 00:25:48 | INFO | lidirl | Label __label__prs_Arab changed from original probability 0.0118 to temperature adjusted probability 0.2115. 2025-10-02 00:25:48 | INFO | lidirl | Label __label__quy_Latn changed from original probability 0.0735 to temperature adjusted probability 0.3677. 2025-10-02 00:25:48 | INFO | lidirl | Label __label__ron_Latn changed from original probability 0.9331 to temperature adjusted probability 0.7943. 2025-10-02 00:25:48 | INFO | lidirl | Label __label__run_Latn changed from original probability 0.1749 to temperature adjusted probability 0.4782. 2025-10-02 00:25:48 | INFO | lidirl | Label __label__rus_Cyrl changed from original probability 3.7658 to temperature adjusted probability 1.2123. 2025-10-02 00:25:48 | INFO | lidirl | Label __label__sag_Latn changed from original probability 0.0966 to temperature adjusted probability 0.3995. 2025-10-02 00:25:48 | INFO | lidirl | Label __label__san_Deva changed from original probability 0.0393 to temperature adjusted probability 0.3042. 2025-10-02 00:25:48 | INFO | lidirl | Label __label__sat_Olck changed from original probability 0.0276 to temperature adjusted probability 0.2732. 2025-10-02 00:25:48 | INFO | lidirl | Label __label__scn_Latn changed from original probability 0.0332 to temperature adjusted probability 0.2891. 2025-10-02 00:25:48 | INFO | lidirl | Label __label__shn_Mymr changed from original probability 0.0171 to temperature adjusted probability 0.2364. 2025-10-02 00:25:48 | INFO | lidirl | Label __label__sin_Sinh changed from original probability 0.2077 to temperature adjusted probability 0.5038. 2025-10-02 00:25:48 | INFO | lidirl | Label __label__slk_Latn changed from original probability 1.5180 to temperature adjusted probability 0.9205. 2025-10-02 00:25:48 | INFO | lidirl | Label __label__slv_Latn changed from original probability 1.5188 to temperature adjusted probability 0.9207. 2025-10-02 00:25:48 | INFO | lidirl | Label __label__smo_Latn changed from original probability 0.1397 to temperature adjusted probability 0.4468. 2025-10-02 00:25:48 | INFO | lidirl | Label __label__sna_Latn changed from original probability 0.3024 to temperature adjusted probability 0.5645. 2025-10-02 00:25:48 | INFO | lidirl | Label __label__snd_Arab changed from original probability 0.0347 to temperature adjusted probability 0.2931. 2025-10-02 00:25:48 | INFO | lidirl | Label __label__som_Latn changed from original probability 0.0841 to temperature adjusted probability 0.3831. 2025-10-02 00:25:48 | INFO | lidirl | Label __label__sot_Latn changed from original probability 0.0008 to temperature adjusted probability 0.0918. 2025-10-02 00:25:48 | INFO | lidirl | Label __label__spa_Latn changed from original probability 2.1143 to temperature adjusted probability 1.0177. 2025-10-02 00:25:48 | INFO | lidirl | Label __label__srd_Latn changed from original probability 0.0300 to temperature adjusted probability 0.2802. 2025-10-02 00:25:48 | INFO | lidirl | Label __label__srp_Cyrl changed from original probability 0.9000 to temperature adjusted probability 0.7857. 2025-10-02 00:25:48 | INFO | lidirl | Label __label__ssw_Latn changed from original probability 0.0439 to temperature adjusted probability 0.3146. 2025-10-02 00:25:48 | INFO | lidirl | Label __label__sun_Latn changed from original probability 0.0638 to temperature adjusted probability 0.3522. 2025-10-02 00:25:48 | INFO | lidirl | Label __label__swe_Latn changed from original probability 2.7984 to temperature adjusted probability 1.1080. 2025-10-02 00:25:48 | INFO | lidirl | Label __label__swh_Latn changed from original probability 0.1829 to temperature adjusted probability 0.4848. 2025-10-02 00:25:48 | INFO | lidirl | Label __label__szl_Latn changed from original probability 0.0125 to temperature adjusted probability 0.2149. 2025-10-02 00:25:48 | INFO | lidirl | Label __label__tam_Taml changed from original probability 0.4788 to temperature adjusted probability 0.6489. 2025-10-02 00:25:48 | INFO | lidirl | Label __label__taq_Latn changed from original probability 0.0039 to temperature adjusted probability 0.1511. 2025-10-02 00:25:48 | INFO | lidirl | Label __label__taq_Tfng changed from original probability 0.0024 to temperature adjusted probability 0.1303. 2025-10-02 00:25:48 | INFO | lidirl | Label __label__tat_Cyrl changed from original probability 0.3135 to temperature adjusted probability 0.5708. 2025-10-02 00:25:48 | INFO | lidirl | Label __label__tel_Telu changed from original probability 0.2947 to temperature adjusted probability 0.5602. 2025-10-02 00:25:48 | INFO | lidirl | Label __label__tgk_Cyrl changed from original probability 0.1136 to temperature adjusted probability 0.4196. 2025-10-02 00:25:48 | INFO | lidirl | Label __label__tha_Thai changed from original probability 0.6287 to temperature adjusted probability 0.7047. 2025-10-02 00:25:48 | INFO | lidirl | Label __label__tir_Ethi changed from original probability 0.1802 to temperature adjusted probability 0.4825. 2025-10-02 00:25:48 | INFO | lidirl | Label __label__tpi_Latn changed from original probability 0.1773 to temperature adjusted probability 0.4802. 2025-10-02 00:25:48 | INFO | lidirl | Label __label__tsn_Latn changed from original probability 0.2997 to temperature adjusted probability 0.5630. 2025-10-02 00:25:48 | INFO | lidirl | Label __label__tso_Latn changed from original probability 0.2878 to temperature adjusted probability 0.5561. 2025-10-02 00:25:48 | INFO | lidirl | Label __label__tuk_Latn changed from original probability 0.0703 to temperature adjusted probability 0.3628. 2025-10-02 00:25:48 | INFO | lidirl | Label __label__tum_Latn changed from original probability 0.1023 to temperature adjusted probability 0.4065. 2025-10-02 00:25:48 | INFO | lidirl | Label __label__tur_Latn changed from original probability 1.0008 to temperature adjusted probability 0.8113. 2025-10-02 00:25:48 | INFO | lidirl | Label __label__twi_Latn changed from original probability 0.2129 to temperature adjusted probability 0.5076. 2025-10-02 00:25:48 | INFO | lidirl | Label __label__uig_Arab changed from original probability 0.0478 to temperature adjusted probability 0.3228. 2025-10-02 00:25:48 | INFO | lidirl | Label __label__ukr_Cyrl changed from original probability 2.2594 to temperature adjusted probability 1.0384. 2025-10-02 00:25:48 | INFO | lidirl | Label __label__umb_Latn changed from original probability 0.0831 to temperature adjusted probability 0.3816. 2025-10-02 00:25:48 | INFO | lidirl | Label __label__urd_Arab changed from original probability 0.3844 to temperature adjusted probability 0.6071. 2025-10-02 00:25:48 | INFO | lidirl | Label __label__uzn_Latn changed from original probability 0.9297 to temperature adjusted probability 0.7934. 2025-10-02 00:25:48 | INFO | lidirl | Label __label__vec_Latn changed from original probability 0.0530 to temperature adjusted probability 0.3331. 2025-10-02 00:25:48 | INFO | lidirl | Label __label__vie_Latn changed from original probability 1.5446 to temperature adjusted probability 0.9254. 2025-10-02 00:25:48 | INFO | lidirl | Label __label__war_Latn changed from original probability 0.5861 to temperature adjusted probability 0.6899. 2025-10-02 00:25:48 | INFO | lidirl | Label __label__wol_Latn changed from original probability 0.0135 to temperature adjusted probability 0.2202. 2025-10-02 00:25:48 | INFO | lidirl | Label __label__xho_Latn changed from original probability 0.3516 to temperature adjusted probability 0.5909. 2025-10-02 00:25:48 | INFO | lidirl | Label __label__ydd_Hebr changed from original probability 0.0249 to temperature adjusted probability 0.2649. 2025-10-02 00:25:48 | INFO | lidirl | Label __label__yor_Latn changed from original probability 0.2124 to temperature adjusted probability 0.5072. 2025-10-02 00:25:48 | INFO | lidirl | Label __label__yue_Hant changed from original probability 0.0652 to temperature adjusted probability 0.3546. 2025-10-02 00:25:48 | INFO | lidirl | Label __label__zgh_Tfng changed from original probability 0.0036 to temperature adjusted probability 0.1474. 2025-10-02 00:25:48 | INFO | lidirl | Label __label__zsm_Latn changed from original probability 0.5400 to temperature adjusted probability 0.6730. 2025-10-02 00:25:48 | INFO | lidirl | Label __label__zul_Latn changed from original probability 0.3669 to temperature adjusted probability 0.5986. 2025-10-02 00:25:48 | INFO | lidirl | Set to 1 parallel workers. 2025-10-02 00:25:48 | INFO | lidirl | Creating a valid dataset with bytes inputs. 2025-10-02 00:25:48 | INFO | lidirl | Reading training files to calculate sampling probability... 2025-10-02 00:25:48 | INFO | lidirl | Set to 1 parallel workers. 2025-10-02 00:25:48 | INFO | lidirl | FlatModel( (encoder): LSTMBlock( (model): Sequential( (0): MinLSTMCell( (linear_f): Linear(in_features=256, out_features=256, bias=True) (linear_i): Linear(in_features=256, out_features=256, bias=True) (linear_h): Linear(in_features=256, out_features=256, bias=True) ) (1): LayerNorm((256,), eps=1e-05, elementwise_affine=True) (2): GELU(approximate='none') (3): Dropout(p=0.1, inplace=False) ) ) (embed_layer): Embedding(258, 256) (proj): ProjectionLayer( (proj): Sequential( (0): Linear(in_features=256, out_features=200, bias=True) ) ) ) INFO: GPU available: True (cuda), used: True 2025-10-02 00:26:00 | INFO | lightning.pytorch.utilities.rank_zero | GPU available: True (cuda), used: True INFO: TPU available: False, using: 0 TPU cores 2025-10-02 00:26:00 | INFO | lightning.pytorch.utilities.rank_zero | TPU available: False, using: 0 TPU cores INFO: HPU available: False, using: 0 HPUs 2025-10-02 00:26:00 | INFO | lightning.pytorch.utilities.rank_zero | HPU available: False, using: 0 HPUs INFO: LOCAL_RANK: 0 - CUDA_VISIBLE_DEVICES: [0] 2025-10-02 00:26:33 | INFO | lightning.pytorch.accelerators.cuda | LOCAL_RANK: 0 - CUDA_VISIBLE_DEVICES: [0] INFO: | Name | Type | Params | Mode -------------------------------------------------------- 0 | encoder | LSTMBlock | 197 K | train 1 | embed_layer | Embedding | 66.0 K | train 2 | proj | ProjectionLayer | 51.4 K | train -------------------------------------------------------- 315 K Trainable params 0 Non-trainable params 315 K Total params 1.261 Total estimated model params size (MB) 13 Modules in train mode 0 Modules in eval mode 2025-10-02 00:26:33 | INFO | lightning.pytorch.callbacks.model_summary | | Name | Type | Params | Mode -------------------------------------------------------- 0 | encoder | LSTMBlock | 197 K | train 1 | embed_layer | Embedding | 66.0 K | train 2 | proj | ProjectionLayer | 51.4 K | train -------------------------------------------------------- 315 K Trainable params 0 Non-trainable params 315 K Total params 1.261 Total estimated model params size (MB) 13 Modules in train mode 0 Modules in eval mode 2025-10-02 00:26:33 | INFO | lidirl | {'vocab_size': 258, 'label_size': 200, 'embed_dim': 256, 'max_length': 1024, 'multilabel': True, 'montecarlo_layer': False, 'vocab': {'0': 0, '1': 1, '2': 2, '3': 3, '4': 4, '5': 5, '6': 6, '7': 7, '8': 8, '9': 9, '10': 10, '11': 11, '12': 12, '13': 13, '14': 14, '15': 15, '16': 16, '17': 17, '18': 18, '19': 19, '20': 20, '21': 21, '22': 22, '23': 23, '24': 24, '25': 25, '26': 26, '27': 27, '28': 28, '29': 29, '30': 30, '31': 31, '32': 32, '33': 33, '34': 34, '35': 35, '36': 36, '37': 37, '38': 38, '39': 39, '40': 40, '41': 41, '42': 42, '43': 43, '44': 44, '45': 45, '46': 46, '47': 47, '48': 48, '49': 49, '50': 50, '51': 51, '52': 52, '53': 53, '54': 54, '55': 55, '56': 56, '57': 57, '58': 58, '59': 59, '60': 60, '61': 61, '62': 62, '63': 63, '64': 64, '65': 65, '66': 66, '67': 67, '68': 68, '69': 69, '70': 70, '71': 71, '72': 72, '73': 73, '74': 74, '75': 75, '76': 76, '77': 77, '78': 78, '79': 79, '80': 80, '81': 81, '82': 82, '83': 83, '84': 84, '85': 85, '86': 86, '87': 87, '88': 88, '89': 89, '90': 90, '91': 91, '92': 92, '93': 93, '94': 94, '95': 95, '96': 96, '97': 97, '98': 98, '99': 99, '100': 100, '101': 101, '102': 102, '103': 103, '104': 104, '105': 105, '106': 106, '107': 107, '108': 108, '109': 109, '110': 110, '111': 111, '112': 112, '113': 113, '114': 114, '115': 115, '116': 116, '117': 117, '118': 118, '119': 119, '120': 120, '121': 121, '122': 122, '123': 123, '124': 124, '125': 125, '126': 126, '127': 127, '128': 128, '129': 129, '130': 130, '131': 131, '132': 132, '133': 133, '134': 134, '135': 135, '136': 136, '137': 137, '138': 138, '139': 139, '140': 140, '141': 141, '142': 142, '143': 143, '144': 144, '145': 145, '146': 146, '147': 147, '148': 148, '149': 149, '150': 150, '151': 151, '152': 152, '153': 153, '154': 154, '155': 155, '156': 156, '157': 157, '158': 158, '159': 159, '160': 160, '161': 161, '162': 162, '163': 163, '164': 164, '165': 165, '166': 166, '167': 167, '168': 168, '169': 169, '170': 170, '171': 171, '172': 172, '173': 173, '174': 174, '175': 175, '176': 176, '177': 177, '178': 178, '179': 179, '180': 180, '181': 181, '182': 182, '183': 183, '184': 184, '185': 185, '186': 186, '187': 187, '188': 188, '189': 189, '190': 190, '191': 191, '192': 192, '193': 193, '194': 194, '195': 195, '196': 196, '197': 197, '198': 198, '199': 199, '200': 200, '201': 201, '202': 202, '203': 203, '204': 204, '205': 205, '206': 206, '207': 207, '208': 208, '209': 209, '210': 210, '211': 211, '212': 212, '213': 213, '214': 214, '215': 215, '216': 216, '217': 217, '218': 218, '219': 219, '220': 220, '221': 221, '222': 222, '223': 223, '224': 224, '225': 225, '226': 226, '227': 227, '228': 228, '229': 229, '230': 230, '231': 231, '232': 232, '233': 233, '234': 234, '235': 235, '236': 236, '237': 237, '238': 238, '239': 239, '240': 240, '241': 241, '242': 242, '243': 243, '244': 244, '245': 245, '246': 246, '247': 247, '248': 248, '249': 249, '250': 250, '251': 251, '252': 252, '253': 253, '254': 254, '255': 255, '[PAD]': 256, '[UNK]': 257}, 'labels': {'__label__ace_Arab': 0, '__label__ace_Latn': 1, '__label__acm_Arab': 2, '__label__acq_Arab': 3, '__label__aeb_Arab': 4, '__label__afr_Latn': 5, '__label__als_Latn': 6, '__label__amh_Ethi': 7, '__label__apc_Arab': 8, '__label__arb_Arab': 9, '__label__ars_Arab': 10, '__label__ary_Arab': 11, '__label__arz_Arab': 12, '__label__asm_Beng': 13, '__label__ast_Latn': 14, '__label__awa_Deva': 15, '__label__ayr_Latn': 16, '__label__azb_Arab': 17, '__label__azj_Latn': 18, '__label__bak_Cyrl': 19, '__label__bam_Latn': 20, '__label__ban_Latn': 21, '__label__bel_Cyrl': 22, '__label__bem_Latn': 23, '__label__ben_Beng': 24, '__label__bho_Deva': 25, '__label__bjn_Arab': 26, '__label__bjn_Latn': 27, '__label__bod_Tibt': 28, '__label__bos_Latn': 29, '__label__bug_Latn': 30, '__label__bul_Cyrl': 31, '__label__cat_Latn': 32, '__label__ceb_Latn': 33, '__label__ces_Latn': 34, '__label__cjk_Latn': 35, '__label__ckb_Arab': 36, '__label__cmn_Hans': 37, '__label__cmn_Hant': 38, '__label__crh_Latn': 39, '__label__cym_Latn': 40, '__label__dan_Latn': 41, '__label__deu_Latn': 42, '__label__dik_Latn': 43, '__label__dyu_Latn': 44, '__label__dzo_Tibt': 45, '__label__ekk_Latn': 46, '__label__ell_Grek': 47, '__label__eng_Latn': 48, '__label__epo_Latn': 49, '__label__eus_Latn': 50, '__label__ewe_Latn': 51, '__label__fao_Latn': 52, '__label__fij_Latn': 53, '__label__fil_Latn': 54, '__label__fin_Latn': 55, '__label__fon_Latn': 56, '__label__fra_Latn': 57, '__label__fur_Latn': 58, '__label__fuv_Latn': 59, '__label__gaz_Latn': 60, '__label__gla_Latn': 61, '__label__gle_Latn': 62, '__label__glg_Latn': 63, '__label__gug_Latn': 64, '__label__guj_Gujr': 65, '__label__hat_Latn': 66, '__label__hau_Latn': 67, '__label__heb_Hebr': 68, '__label__hin_Deva': 69, '__label__hne_Deva': 70, '__label__hrv_Latn': 71, '__label__hun_Latn': 72, '__label__hye_Armn': 73, '__label__ibo_Latn': 74, '__label__ilo_Latn': 75, '__label__ind_Latn': 76, '__label__isl_Latn': 77, '__label__ita_Latn': 78, '__label__jav_Latn': 79, '__label__jpn_Jpan': 80, '__label__kab_Latn': 81, '__label__kac_Latn': 82, '__label__kam_Latn': 83, '__label__kan_Knda': 84, '__label__kas_Arab': 85, '__label__kas_Deva': 86, '__label__kat_Geor': 87, '__label__kaz_Cyrl': 88, '__label__kbp_Latn': 89, '__label__kea_Latn': 90, '__label__khk_Cyrl': 91, '__label__khm_Khmr': 92, '__label__kik_Latn': 93, '__label__kin_Latn': 94, '__label__kir_Cyrl': 95, '__label__kmb_Latn': 96, '__label__kmr_Latn': 97, '__label__knc_Arab': 98, '__label__knc_Latn': 99, '__label__kor_Hang': 100, '__label__ktu_Latn': 101, '__label__lao_Laoo': 102, '__label__lij_Latn': 103, '__label__lim_Latn': 104, '__label__lin_Latn': 105, '__label__lit_Latn': 106, '__label__lmo_Latn': 107, '__label__ltg_Latn': 108, '__label__ltz_Latn': 109, '__label__lua_Latn': 110, '__label__lug_Latn': 111, '__label__luo_Latn': 112, '__label__lus_Latn': 113, '__label__lvs_Latn': 114, '__label__mag_Deva': 115, '__label__mai_Deva': 116, '__label__mal_Mlym': 117, '__label__mar_Deva': 118, '__label__min_Latn': 119, '__label__mkd_Cyrl': 120, '__label__mlt_Latn': 121, '__label__mni_Beng': 122, '__label__mos_Latn': 123, '__label__mri_Latn': 124, '__label__mya_Mymr': 125, '__label__nld_Latn': 126, '__label__nno_Latn': 127, '__label__nob_Latn': 128, '__label__npi_Deva': 129, '__label__nso_Latn': 130, '__label__nus_Latn': 131, '__label__nya_Latn': 132, '__label__oci_Latn': 133, '__label__ory_Orya': 134, '__label__pag_Latn': 135, '__label__pan_Guru': 136, '__label__pap_Latn': 137, '__label__pbt_Arab': 138, '__label__pes_Arab': 139, '__label__plt_Latn': 140, '__label__pol_Latn': 141, '__label__por_Latn': 142, '__label__prs_Arab': 143, '__label__quy_Latn': 144, '__label__ron_Latn': 145, '__label__run_Latn': 146, '__label__rus_Cyrl': 147, '__label__sag_Latn': 148, '__label__san_Deva': 149, '__label__sat_Olck': 150, '__label__scn_Latn': 151, '__label__shn_Mymr': 152, '__label__sin_Sinh': 153, '__label__slk_Latn': 154, '__label__slv_Latn': 155, '__label__smo_Latn': 156, '__label__sna_Latn': 157, '__label__snd_Arab': 158, '__label__som_Latn': 159, '__label__sot_Latn': 160, '__label__spa_Latn': 161, '__label__srd_Latn': 162, '__label__srp_Cyrl': 163, '__label__ssw_Latn': 164, '__label__sun_Latn': 165, '__label__swe_Latn': 166, '__label__swh_Latn': 167, '__label__szl_Latn': 168, '__label__tam_Taml': 169, '__label__taq_Latn': 170, '__label__taq_Tfng': 171, '__label__tat_Cyrl': 172, '__label__tel_Telu': 173, '__label__tgk_Cyrl': 174, '__label__tha_Thai': 175, '__label__tir_Ethi': 176, '__label__tpi_Latn': 177, '__label__tsn_Latn': 178, '__label__tso_Latn': 179, '__label__tuk_Latn': 180, '__label__tum_Latn': 181, '__label__tur_Latn': 182, '__label__twi_Latn': 183, '__label__uig_Arab': 184, '__label__ukr_Cyrl': 185, '__label__umb_Latn': 186, '__label__urd_Arab': 187, '__label__uzn_Latn': 188, '__label__vec_Latn': 189, '__label__vie_Latn': 190, '__label__war_Latn': 191, '__label__wol_Latn': 192, '__label__xho_Latn': 193, '__label__ydd_Hebr': 194, '__label__yor_Latn': 195, '__label__yue_Hant': 196, '__label__zgh_Tfng': 197, '__label__zsm_Latn': 198, '__label__zul_Latn': 199}, 'num_layers': 1, 'bidirectional': False, 'architecture': 'lstm', 'config': 'config.yaml', 'data_path': '/exp/rwicks/langid/build-data/huggingface/filtered+openlid-v2+ext', 'augmentations': ['antspeak=1', 'ngrams=1', 'hashtag=1', 'short=1', 'spongebob=1', 'codeswitch=1', 'leetspeak=1', 'cyrillic=1', 'replaceurl=1', 'addurl=1', 'html=1', 'addemojis=1', 'replaceemojis=1', 'delete=1', 'add=1', 'swap=1', 'allcaps=1', 'email=1', 'whitespace=1', 'accenting=1', 'binarization=1'], 'augmentation_probability': 0.0, 'temperature': 3.3, 'input_type': 'bytes', 'batch_size': 32, 'num_workers': 1, 'outdir': '/exp/rwicks/langid/exp/filtered+openlid-v2+ext/augless/flat/lstm/Hidden_LARGE_Embed_LARGE_NLayer_SMALL_LR_0.001/models', 'checkpoint': None, 'experiment_version': None, 'tb_dir': '/exp/rwicks/langid/exp/filtered+openlid-v2+ext/augless/flat/lstm/Hidden_LARGE_Embed_LARGE_NLayer_SMALL_LR_0.001/tensorboard', 'save_top_k': 10, 'save_checkpoint_last': True, 'log_interval': 512, 'lr': 0.001, 'cpu': False, 'min_epochs': 0, 'max_epochs': -1, 'update_interval': 16, 'lr_step_interval': 100, 'validation_interval': 8192, 'patience': inf, 'max_updates': 250000.0, 'seed': 141414, 'compile': False, 'scheduler': 'schedule_free', 'warmup_steps': 0, 'gamma': 0.0, 'alpha': -1.0, 'pred_type': 'multilabel', 'hidden_dim': 256, 'dropout': 0.1, 'layout': 'flat'} /home/hltcoe/rwicks/.conda/envs/nicklid/lib/python3.10/site-packages/lightning/pytorch/trainer/connectors/data_connector.py:425: The 'val_dataloader' does not have many workers which may be a bottleneck. Consider increasing the value of the `num_workers` argument` to `num_workers=39` in the `DataLoader` to improve performance. /home/hltcoe/rwicks/.conda/envs/nicklid/lib/python3.10/site-packages/lightning/pytorch/utilities/data.py:123: Your `IterableDataset` has `__len__` defined. In combination with multi-process data loading (when num_workers > 1), `__len__` could be inaccurate if each worker is not configured independently to avoid having duplicate data. /home/hltcoe/rwicks/.conda/envs/nicklid/lib/python3.10/site-packages/lightning/pytorch/trainer/connectors/data_connector.py:425: The 'train_dataloader' does not have many workers which may be a bottleneck. Consider increasing the value of the `num_workers` argument` to `num_workers=39` in the `DataLoader` to improve performance. 2025-10-02 01:41:46 | INFO | lidirl | {"batch_idx_step": 8191.0, "num_updates_step": 512.0, "train_loss_step": 0.01941743493080139, "train_examples_step": 262144.0, "lr_step": 0.0010000000474974513, "epoch": 0} 2025-10-02 02:51:10 | INFO | lidirl | {"batch_idx_step": 16383.0, "num_updates_step": 1024.0, "train_loss_step": 0.009770467877388, "train_examples_step": 262144.0, "lr_step": 0.0010000000474974513, "epoch": 0} 2025-10-02 04:04:10 | INFO | lidirl | {"batch_idx_step": 24575.0, "num_updates_step": 1536.0, "train_loss_step": 0.006942494306713343, "train_examples_step": 262144.0, "lr_step": 0.0010000000474974513, "epoch": 0} 2025-10-02 05:02:01 | INFO | lidirl | {"batch_idx_step": 32767.0, "num_updates_step": 2048.0, "train_loss_step": 0.005953986197710037, "train_examples_step": 262144.0, "lr_step": 0.0010000000474974513, "epoch": 0} 2025-10-02 06:06:26 | INFO | lidirl | {"batch_idx_step": 40959.0, "num_updates_step": 2560.0, "train_loss_step": 0.005498938262462616, "train_examples_step": 262144.0, "lr_step": 0.0010000000474974513, "epoch": 0} 2025-10-02 07:30:33 | INFO | lidirl | {"batch_idx_step": 49151.0, "num_updates_step": 3072.0, "train_loss_step": 0.0051994710229337215, "train_examples_step": 262144.0, "lr_step": 0.0010000000474974513, "epoch": 0} 2025-10-02 09:15:26 | INFO | lidirl | {"batch_idx_step": 57343.0, "num_updates_step": 3584.0, "train_loss_step": 0.004996065050363541, "train_examples_step": 262144.0, "lr_step": 0.0010000000474974513, "epoch": 0} 2025-10-02 11:18:31 | INFO | lidirl | {"batch_idx_step": 65535.0, "num_updates_step": 4096.0, "train_loss_step": 0.004827307537198067, "train_examples_step": 262144.0, "lr_step": 0.0010000000474974513, "epoch": 0} 2025-10-02 12:45:30 | INFO | lidirl | {"batch_idx_step": 73727.0, "num_updates_step": 4608.0, "train_loss_step": 0.004767696373164654, "train_examples_step": 262144.0, "lr_step": 0.0010000000474974513, "epoch": 0} 2025-10-02 13:56:52 | INFO | lidirl | {"batch_idx_step": 81919.0, "num_updates_step": 5120.0, "train_loss_step": 0.004642277490347624, "train_examples_step": 262144.0, "lr_step": 0.0010000000474974513, "epoch": 0} 2025-10-02 15:31:30 | INFO | lidirl | {"batch_idx_step": 90111.0, "num_updates_step": 5632.0, "train_loss_step": 0.004584399051964283, "train_examples_step": 262144.0, "lr_step": 0.0010000000474974513, "epoch": 0} 2025-10-02 16:15:17 | INFO | lidirl | {"batch_idx_step": 98303.0, "num_updates_step": 6144.0, "train_loss_step": 0.0045361751690506935, "train_examples_step": 262144.0, "lr_step": 0.0010000000474974513, "epoch": 0} 2025-10-02 16:24:52 | INFO | lidirl | {"batch_idx_step": 106495.0, "num_updates_step": 6656.0, "train_loss_step": 0.004464586265385151, "train_examples_step": 262144.0, "lr_step": 0.0010000000474974513, "epoch": 0} 2025-10-02 16:29:28 | INFO | lidirl | {"batch_idx_step": 114687.0, "num_updates_step": 7168.0, "train_loss_step": 0.004420810844749212, "train_examples_step": 262144.0, "lr_step": 0.0010000000474974513, "epoch": 0} 2025-10-02 16:32:46 | INFO | lidirl | {"batch_idx_step": 122879.0, "num_updates_step": 7680.0, "train_loss_step": 0.004420328885316849, "train_examples_step": 262144.0, "lr_step": 0.0010000000474974513, "epoch": 0} 2025-10-02 16:35:59 | INFO | lidirl | {"batch_idx_step": 131071.0, "num_updates_step": 8192.0, "train_loss_step": 0.004353038035333157, "train_examples_step": 262144.0, "lr_step": 0.0010000000474974513, "epoch": 0} 2025-10-02 16:38:29 | INFO | lidirl | {"val_loss_total": 332106.34375, "val_examples_total": 219340.0, "val_f1_total": 0.7614367604255676, "val_loss_dev/floresplus": 332106.34375, "val_examples_dev/floresplus": 219340.0, "val_f1_dev/floresplus": 0.7614367604255676, "epoch": 0} 2025-10-02 16:41:29 | INFO | lidirl | {"batch_idx_step": 139263.0, "num_updates_step": 8704.0, "train_loss_step": 0.0043507227674126625, "train_examples_step": 262144.0, "lr_step": 0.0010000000474974513, "epoch": 0} 2025-10-02 16:44:20 | INFO | lidirl | {"batch_idx_step": 147455.0, "num_updates_step": 9216.0, "train_loss_step": 0.004307538270950317, "train_examples_step": 262144.0, "lr_step": 0.0010000000474974513, "epoch": 0} 2025-10-02 16:47:17 | INFO | lidirl | {"batch_idx_step": 155647.0, "num_updates_step": 9728.0, "train_loss_step": 0.00427871011197567, "train_examples_step": 262144.0, "lr_step": 0.0010000000474974513, "epoch": 0} 2025-10-02 16:50:05 | INFO | lidirl | {"batch_idx_step": 163839.0, "num_updates_step": 10240.0, "train_loss_step": 0.004268428776413202, "train_examples_step": 262144.0, "lr_step": 0.0010000000474974513, "epoch": 0} 2025-10-02 16:52:48 | INFO | lidirl | {"batch_idx_step": 172031.0, "num_updates_step": 10752.0, "train_loss_step": 0.004239294677972794, "train_examples_step": 262144.0, "lr_step": 0.0010000000474974513, "epoch": 0} 2025-10-02 16:55:39 | INFO | lidirl | {"batch_idx_step": 180223.0, "num_updates_step": 11264.0, "train_loss_step": 0.004250483121722937, "train_examples_step": 262144.0, "lr_step": 0.0010000000474974513, "epoch": 0} 2025-10-02 16:58:24 | INFO | lidirl | {"batch_idx_step": 188415.0, "num_updates_step": 11776.0, "train_loss_step": 0.004200596362352371, "train_examples_step": 262144.0, "lr_step": 0.0010000000474974513, "epoch": 0} 2025-10-02 17:01:07 | INFO | lidirl | {"batch_idx_step": 196607.0, "num_updates_step": 12288.0, "train_loss_step": 0.004188165999948978, "train_examples_step": 262144.0, "lr_step": 0.0010000000474974513, "epoch": 0} 2025-10-02 17:03:54 | INFO | lidirl | {"batch_idx_step": 204799.0, "num_updates_step": 12800.0, "train_loss_step": 0.004152348265051842, "train_examples_step": 262144.0, "lr_step": 0.0010000000474974513, "epoch": 0} 2025-10-02 17:06:38 | INFO | lidirl | {"batch_idx_step": 212991.0, "num_updates_step": 13312.0, "train_loss_step": 0.0041763633489608765, "train_examples_step": 262144.0, "lr_step": 0.0010000000474974513, "epoch": 0} 2025-10-02 17:09:20 | INFO | lidirl | {"batch_idx_step": 221183.0, "num_updates_step": 13824.0, "train_loss_step": 0.0041552213951945305, "train_examples_step": 262144.0, "lr_step": 0.0010000000474974513, "epoch": 0} 2025-10-02 17:12:03 | INFO | lidirl | {"batch_idx_step": 229375.0, "num_updates_step": 14336.0, "train_loss_step": 0.004163771402090788, "train_examples_step": 262144.0, "lr_step": 0.0010000000474974513, "epoch": 0} 2025-10-02 17:14:46 | INFO | lidirl | {"batch_idx_step": 237567.0, "num_updates_step": 14848.0, "train_loss_step": 0.00413028709590435, "train_examples_step": 262144.0, "lr_step": 0.0010000000474974513, "epoch": 0} 2025-10-02 17:17:29 | INFO | lidirl | {"batch_idx_step": 245759.0, "num_updates_step": 15360.0, "train_loss_step": 0.004140691831707954, "train_examples_step": 262144.0, "lr_step": 0.0010000000474974513, "epoch": 0} 2025-10-02 17:20:12 | INFO | lidirl | {"batch_idx_step": 253951.0, "num_updates_step": 15872.0, "train_loss_step": 0.004111051559448242, "train_examples_step": 262144.0, "lr_step": 0.0010000000474974513, "epoch": 0} 2025-10-02 17:22:55 | INFO | lidirl | {"batch_idx_step": 262143.0, "num_updates_step": 16384.0, "train_loss_step": 0.004105329979211092, "train_examples_step": 262144.0, "lr_step": 0.0010000000474974513, "epoch": 0} 2025-10-02 17:25:15 | INFO | lidirl | {"val_loss_total": 329784.71875, "val_examples_total": 219340.0, "val_f1_total": 0.7709320783615112, "val_loss_dev/floresplus": 329784.71875, "val_examples_dev/floresplus": 219340.0, "val_f1_dev/floresplus": 0.7709320783615112, "epoch": 0} 2025-10-02 17:27:56 | INFO | lidirl | {"batch_idx_step": 270335.0, "num_updates_step": 16896.0, "train_loss_step": 0.004058386664837599, "train_examples_step": 262144.0, "lr_step": 0.0010000000474974513, "epoch": 0} 2025-10-02 17:30:38 | INFO | lidirl | {"batch_idx_step": 278527.0, "num_updates_step": 17408.0, "train_loss_step": 0.004102254752069712, "train_examples_step": 262144.0, "lr_step": 0.0010000000474974513, "epoch": 0} 2025-10-02 17:33:19 | INFO | lidirl | {"batch_idx_step": 286719.0, "num_updates_step": 17920.0, "train_loss_step": 0.00408757571130991, "train_examples_step": 262144.0, "lr_step": 0.0010000000474974513, "epoch": 0} 2025-10-02 17:36:02 | INFO | lidirl | {"batch_idx_step": 294911.0, "num_updates_step": 18432.0, "train_loss_step": 0.004073678515851498, "train_examples_step": 262144.0, "lr_step": 0.0010000000474974513, "epoch": 0} 2025-10-02 17:38:43 | INFO | lidirl | {"batch_idx_step": 303103.0, "num_updates_step": 18944.0, "train_loss_step": 0.004056871403008699, "train_examples_step": 262144.0, "lr_step": 0.0010000000474974513, "epoch": 0} 2025-10-02 17:41:25 | INFO | lidirl | {"batch_idx_step": 311295.0, "num_updates_step": 19456.0, "train_loss_step": 0.004072197712957859, "train_examples_step": 262144.0, "lr_step": 0.0010000000474974513, "epoch": 0} 2025-10-02 17:44:07 | INFO | lidirl | {"batch_idx_step": 319487.0, "num_updates_step": 19968.0, "train_loss_step": 0.004039428662508726, "train_examples_step": 262144.0, "lr_step": 0.0010000000474974513, "epoch": 0} 2025-10-02 17:46:49 | INFO | lidirl | {"batch_idx_step": 327679.0, "num_updates_step": 20480.0, "train_loss_step": 0.00402473658323288, "train_examples_step": 262144.0, "lr_step": 0.0010000000474974513, "epoch": 0} 2025-10-02 17:49:29 | INFO | lidirl | {"batch_idx_step": 335871.0, "num_updates_step": 20992.0, "train_loss_step": 0.0040382337756454945, "train_examples_step": 262144.0, "lr_step": 0.0010000000474974513, "epoch": 0} 2025-10-02 17:52:12 | INFO | lidirl | {"batch_idx_step": 344063.0, "num_updates_step": 21504.0, "train_loss_step": 0.004014882724732161, "train_examples_step": 262144.0, "lr_step": 0.0010000000474974513, "epoch": 0} 2025-10-02 17:54:57 | INFO | lidirl | {"batch_idx_step": 352255.0, "num_updates_step": 22016.0, "train_loss_step": 0.004038569517433643, "train_examples_step": 262144.0, "lr_step": 0.0010000000474974513, "epoch": 0} 2025-10-02 17:57:38 | INFO | lidirl | {"batch_idx_step": 360447.0, "num_updates_step": 22528.0, "train_loss_step": 0.004002938512712717, "train_examples_step": 262144.0, "lr_step": 0.0010000000474974513, "epoch": 0} 2025-10-02 18:00:23 | INFO | lidirl | {"batch_idx_step": 368639.0, "num_updates_step": 23040.0, "train_loss_step": 0.004009871277958155, "train_examples_step": 262144.0, "lr_step": 0.0010000000474974513, "epoch": 0} 2025-10-02 18:03:05 | INFO | lidirl | {"batch_idx_step": 376831.0, "num_updates_step": 23552.0, "train_loss_step": 0.004001563414931297, "train_examples_step": 262144.0, "lr_step": 0.0010000000474974513, "epoch": 0} 2025-10-02 18:05:49 | INFO | lidirl | {"batch_idx_step": 385023.0, "num_updates_step": 24064.0, "train_loss_step": 0.004006559029221535, "train_examples_step": 262144.0, "lr_step": 0.0010000000474974513, "epoch": 0} 2025-10-02 18:08:31 | INFO | lidirl | {"batch_idx_step": 393215.0, "num_updates_step": 24576.0, "train_loss_step": 0.004041020292788744, "train_examples_step": 262144.0, "lr_step": 0.0010000000474974513, "epoch": 0} 2025-10-02 18:10:55 | INFO | lidirl | {"val_loss_total": 331421.4375, "val_examples_total": 219340.0, "val_f1_total": 0.773709237575531, "val_loss_dev/floresplus": 331421.4375, "val_examples_dev/floresplus": 219340.0, "val_f1_dev/floresplus": 0.773709237575531, "epoch": 0} 2025-10-02 18:13:37 | INFO | lidirl | {"batch_idx_step": 401407.0, "num_updates_step": 25088.0, "train_loss_step": 0.0039925421588122845, "train_examples_step": 262144.0, "lr_step": 0.0010000000474974513, "epoch": 0} 2025-10-02 18:16:19 | INFO | lidirl | {"batch_idx_step": 409599.0, "num_updates_step": 25600.0, "train_loss_step": 0.003983519971370697, "train_examples_step": 262144.0, "lr_step": 0.0010000000474974513, "epoch": 0} 2025-10-02 18:19:02 | INFO | lidirl | {"batch_idx_step": 417791.0, "num_updates_step": 26112.0, "train_loss_step": 0.003973681014031172, "train_examples_step": 262144.0, "lr_step": 0.0010000000474974513, "epoch": 0} 2025-10-02 18:21:43 | INFO | lidirl | {"batch_idx_step": 425983.0, "num_updates_step": 26624.0, "train_loss_step": 0.00396771403029561, "train_examples_step": 262144.0, "lr_step": 0.0010000000474974513, "epoch": 0} 2025-10-02 18:24:27 | INFO | lidirl | {"batch_idx_step": 434175.0, "num_updates_step": 27136.0, "train_loss_step": 0.0039902604185044765, "train_examples_step": 262144.0, "lr_step": 0.0010000000474974513, "epoch": 0} 2025-10-02 18:27:07 | INFO | lidirl | {"batch_idx_step": 442367.0, "num_updates_step": 27648.0, "train_loss_step": 0.00398779334500432, "train_examples_step": 262144.0, "lr_step": 0.0010000000474974513, "epoch": 0} 2025-10-02 18:29:51 | INFO | lidirl | {"batch_idx_step": 450559.0, "num_updates_step": 28160.0, "train_loss_step": 0.003956412896513939, "train_examples_step": 262144.0, "lr_step": 0.0010000000474974513, "epoch": 0} 2025-10-02 18:32:32 | INFO | lidirl | {"batch_idx_step": 458751.0, "num_updates_step": 28672.0, "train_loss_step": 0.00396580807864666, "train_examples_step": 262144.0, "lr_step": 0.0010000000474974513, "epoch": 0} 2025-10-02 18:35:16 | INFO | lidirl | {"batch_idx_step": 466943.0, "num_updates_step": 29184.0, "train_loss_step": 0.003940432332456112, "train_examples_step": 262144.0, "lr_step": 0.0010000000474974513, "epoch": 0} 2025-10-02 18:37:57 | INFO | lidirl | {"batch_idx_step": 475135.0, "num_updates_step": 29696.0, "train_loss_step": 0.003968800883740187, "train_examples_step": 262144.0, "lr_step": 0.0010000000474974513, "epoch": 0} 2025-10-02 18:40:39 | INFO | lidirl | {"batch_idx_step": 483327.0, "num_updates_step": 30208.0, "train_loss_step": 0.00396288838237524, "train_examples_step": 262144.0, "lr_step": 0.0010000000474974513, "epoch": 0} 2025-10-02 18:43:20 | INFO | lidirl | {"batch_idx_step": 491519.0, "num_updates_step": 30720.0, "train_loss_step": 0.003954924643039703, "train_examples_step": 262144.0, "lr_step": 0.0010000000474974513, "epoch": 0} 2025-10-02 18:46:01 | INFO | lidirl | {"batch_idx_step": 499711.0, "num_updates_step": 31232.0, "train_loss_step": 0.003940288908779621, "train_examples_step": 262144.0, "lr_step": 0.0010000000474974513, "epoch": 0} 2025-10-02 18:48:42 | INFO | lidirl | {"batch_idx_step": 507903.0, "num_updates_step": 31744.0, "train_loss_step": 0.003934894688427448, "train_examples_step": 262144.0, "lr_step": 0.0010000000474974513, "epoch": 0} 2025-10-02 18:51:23 | INFO | lidirl | {"batch_idx_step": 516095.0, "num_updates_step": 32256.0, "train_loss_step": 0.003936111461371183, "train_examples_step": 262144.0, "lr_step": 0.0010000000474974513, "epoch": 0} 2025-10-02 18:54:05 | INFO | lidirl | {"batch_idx_step": 524287.0, "num_updates_step": 32768.0, "train_loss_step": 0.003915409557521343, "train_examples_step": 262144.0, "lr_step": 0.0010000000474974513, "epoch": 0} 2025-10-02 18:56:23 | INFO | lidirl | {"val_loss_total": 328740.40625, "val_examples_total": 219340.0, "val_f1_total": 0.7770471572875977, "val_loss_dev/floresplus": 328740.40625, "val_examples_dev/floresplus": 219340.0, "val_f1_dev/floresplus": 0.7770471572875977, "epoch": 0} 2025-10-02 18:59:06 | INFO | lidirl | {"batch_idx_step": 532479.0, "num_updates_step": 33280.0, "train_loss_step": 0.003943779971450567, "train_examples_step": 262144.0, "lr_step": 0.0010000000474974513, "epoch": 0} 2025-10-02 19:01:47 | INFO | lidirl | {"batch_idx_step": 540671.0, "num_updates_step": 33792.0, "train_loss_step": 0.00391749432310462, "train_examples_step": 262144.0, "lr_step": 0.0010000000474974513, "epoch": 0} 2025-10-02 19:04:28 | INFO | lidirl | {"batch_idx_step": 548863.0, "num_updates_step": 34304.0, "train_loss_step": 0.003936567343771458, "train_examples_step": 262144.0, "lr_step": 0.0010000000474974513, "epoch": 0} 2025-10-02 19:07:09 | INFO | lidirl | {"batch_idx_step": 557055.0, "num_updates_step": 34816.0, "train_loss_step": 0.003936244640499353, "train_examples_step": 262144.0, "lr_step": 0.0010000000474974513, "epoch": 0} 2025-10-02 19:09:50 | INFO | lidirl | {"batch_idx_step": 565247.0, "num_updates_step": 35328.0, "train_loss_step": 0.00392134441062808, "train_examples_step": 262144.0, "lr_step": 0.0010000000474974513, "epoch": 0} 2025-10-02 19:12:31 | INFO | lidirl | {"batch_idx_step": 573439.0, "num_updates_step": 35840.0, "train_loss_step": 0.003953898791223764, "train_examples_step": 262144.0, "lr_step": 0.0010000000474974513, "epoch": 0} 2025-10-02 19:15:13 | INFO | lidirl | {"batch_idx_step": 581631.0, "num_updates_step": 36352.0, "train_loss_step": 0.003930938430130482, "train_examples_step": 262144.0, "lr_step": 0.0010000000474974513, "epoch": 0} 2025-10-02 19:17:54 | INFO | lidirl | {"batch_idx_step": 589823.0, "num_updates_step": 36864.0, "train_loss_step": 0.003914704546332359, "train_examples_step": 262144.0, "lr_step": 0.0010000000474974513, "epoch": 0} 2025-10-02 19:20:35 | INFO | lidirl | {"batch_idx_step": 598015.0, "num_updates_step": 37376.0, "train_loss_step": 0.003932399675250053, "train_examples_step": 262144.0, "lr_step": 0.0010000000474974513, "epoch": 0} 2025-10-02 19:26:25 | INFO | lidirl | {"batch_idx_step": 606207.0, "num_updates_step": 37888.0, "train_loss_step": 0.0039315964095294476, "train_examples_step": 262144.0, "lr_step": 0.0010000000474974513, "epoch": 0} 2025-10-02 19:33:56 | INFO | lidirl | {"batch_idx_step": 614399.0, "num_updates_step": 38400.0, "train_loss_step": 0.003926742356270552, "train_examples_step": 262144.0, "lr_step": 0.0010000000474974513, "epoch": 0} 2025-10-02 19:40:00 | INFO | lidirl | {"batch_idx_step": 622591.0, "num_updates_step": 38912.0, "train_loss_step": 0.003900042036548257, "train_examples_step": 262144.0, "lr_step": 0.0010000000474974513, "epoch": 0} 2025-10-02 19:44:28 | INFO | lidirl | {"batch_idx_step": 630783.0, "num_updates_step": 39424.0, "train_loss_step": 0.0038917241618037224, "train_examples_step": 262144.0, "lr_step": 0.0010000000474974513, "epoch": 0} 2025-10-02 19:48:25 | INFO | lidirl | {"batch_idx_step": 638975.0, "num_updates_step": 39936.0, "train_loss_step": 0.003910171799361706, "train_examples_step": 262144.0, "lr_step": 0.0010000000474974513, "epoch": 0} 2025-10-02 19:52:01 | INFO | lidirl | {"batch_idx_step": 647167.0, "num_updates_step": 40448.0, "train_loss_step": 0.003892576089128852, "train_examples_step": 262144.0, "lr_step": 0.0010000000474974513, "epoch": 0} 2025-10-02 19:55:22 | INFO | lidirl | {"batch_idx_step": 655359.0, "num_updates_step": 40960.0, "train_loss_step": 0.003907828126102686, "train_examples_step": 262144.0, "lr_step": 0.0010000000474974513, "epoch": 0} 2025-10-02 19:57:40 | INFO | lidirl | {"val_loss_total": 330858.0625, "val_examples_total": 219340.0, "val_f1_total": 0.7784448862075806, "val_loss_dev/floresplus": 330858.0625, "val_examples_dev/floresplus": 219340.0, "val_f1_dev/floresplus": 0.7784448862075806, "epoch": 0} 2025-10-02 20:00:56 | INFO | lidirl | {"batch_idx_step": 663551.0, "num_updates_step": 41472.0, "train_loss_step": 0.003899885108694434, "train_examples_step": 262144.0, "lr_step": 0.0010000000474974513, "epoch": 0} 2025-10-02 20:04:02 | INFO | lidirl | {"batch_idx_step": 671743.0, "num_updates_step": 41984.0, "train_loss_step": 0.0039176540449261665, "train_examples_step": 262144.0, "lr_step": 0.0010000000474974513, "epoch": 0} 2025-10-02 20:07:07 | INFO | lidirl | {"batch_idx_step": 679935.0, "num_updates_step": 42496.0, "train_loss_step": 0.0039141555316746235, "train_examples_step": 262144.0, "lr_step": 0.0010000000474974513, "epoch": 0} 2025-10-02 20:10:09 | INFO | lidirl | {"batch_idx_step": 688127.0, "num_updates_step": 43008.0, "train_loss_step": 0.0039047712925821543, "train_examples_step": 262144.0, "lr_step": 0.0010000000474974513, "epoch": 0} 2025-10-02 20:13:08 | INFO | lidirl | {"batch_idx_step": 696319.0, "num_updates_step": 43520.0, "train_loss_step": 0.003922988194972277, "train_examples_step": 262144.0, "lr_step": 0.0010000000474974513, "epoch": 0} 2025-10-02 20:16:07 | INFO | lidirl | {"batch_idx_step": 704511.0, "num_updates_step": 44032.0, "train_loss_step": 0.003882376244291663, "train_examples_step": 262144.0, "lr_step": 0.0010000000474974513, "epoch": 0} 2025-10-02 20:19:03 | INFO | lidirl | {"batch_idx_step": 712703.0, "num_updates_step": 44544.0, "train_loss_step": 0.003898849245160818, "train_examples_step": 262144.0, "lr_step": 0.0010000000474974513, "epoch": 0} 2025-10-02 20:21:56 | INFO | lidirl | {"batch_idx_step": 720895.0, "num_updates_step": 45056.0, "train_loss_step": 0.0038985491264611483, "train_examples_step": 262144.0, "lr_step": 0.0010000000474974513, "epoch": 0} 2025-10-02 20:24:51 | INFO | lidirl | {"batch_idx_step": 729087.0, "num_updates_step": 45568.0, "train_loss_step": 0.0038689912762492895, "train_examples_step": 262144.0, "lr_step": 0.0010000000474974513, "epoch": 0} 2025-10-02 20:27:41 | INFO | lidirl | {"batch_idx_step": 737279.0, "num_updates_step": 46080.0, "train_loss_step": 0.003892247099429369, "train_examples_step": 262144.0, "lr_step": 0.0010000000474974513, "epoch": 0} 2025-10-02 20:30:35 | INFO | lidirl | {"batch_idx_step": 745471.0, "num_updates_step": 46592.0, "train_loss_step": 0.0038843252696096897, "train_examples_step": 262144.0, "lr_step": 0.0010000000474974513, "epoch": 0} 2025-10-02 20:33:23 | INFO | lidirl | {"batch_idx_step": 753663.0, "num_updates_step": 47104.0, "train_loss_step": 0.003895190078765154, "train_examples_step": 262144.0, "lr_step": 0.0010000000474974513, "epoch": 0} 2025-10-02 20:36:14 | INFO | lidirl | {"batch_idx_step": 761855.0, "num_updates_step": 47616.0, "train_loss_step": 0.003864327445626259, "train_examples_step": 262144.0, "lr_step": 0.0010000000474974513, "epoch": 0} 2025-10-02 20:39:01 | INFO | lidirl | {"batch_idx_step": 770047.0, "num_updates_step": 48128.0, "train_loss_step": 0.0038764802739024162, "train_examples_step": 262144.0, "lr_step": 0.0010000000474974513, "epoch": 0} 2025-10-02 20:41:48 | INFO | lidirl | {"batch_idx_step": 778239.0, "num_updates_step": 48640.0, "train_loss_step": 0.003882929217070341, "train_examples_step": 262144.0, "lr_step": 0.0010000000474974513, "epoch": 0} 2025-10-02 20:44:38 | INFO | lidirl | {"batch_idx_step": 786431.0, "num_updates_step": 49152.0, "train_loss_step": 0.0038843024522066116, "train_examples_step": 262144.0, "lr_step": 0.0010000000474974513, "epoch": 0} 2025-10-02 20:46:57 | INFO | lidirl | {"val_loss_total": 331694.75, "val_examples_total": 219340.0, "val_f1_total": 0.7794760465621948, "val_loss_dev/floresplus": 331694.75, "val_examples_dev/floresplus": 219340.0, "val_f1_dev/floresplus": 0.7794760465621948, "epoch": 0} 2025-10-02 20:49:45 | INFO | lidirl | {"batch_idx_step": 794623.0, "num_updates_step": 49664.0, "train_loss_step": 0.0038817960303276777, "train_examples_step": 262144.0, "lr_step": 0.0010000000474974513, "epoch": 0} 2025-10-02 20:52:31 | INFO | lidirl | {"batch_idx_step": 802815.0, "num_updates_step": 50176.0, "train_loss_step": 0.0038653856609016657, "train_examples_step": 262144.0, "lr_step": 0.0010000000474974513, "epoch": 0} 2025-10-02 20:55:19 | INFO | lidirl | {"batch_idx_step": 811007.0, "num_updates_step": 50688.0, "train_loss_step": 0.003866136772558093, "train_examples_step": 262144.0, "lr_step": 0.0010000000474974513, "epoch": 0} 2025-10-02 20:58:04 | INFO | lidirl | {"batch_idx_step": 819199.0, "num_updates_step": 51200.0, "train_loss_step": 0.0038952019531279802, "train_examples_step": 262144.0, "lr_step": 0.0010000000474974513, "epoch": 0} 2025-10-02 21:00:51 | INFO | lidirl | {"batch_idx_step": 827391.0, "num_updates_step": 51712.0, "train_loss_step": 0.0038553273770958185, "train_examples_step": 262144.0, "lr_step": 0.0010000000474974513, "epoch": 0} 2025-10-02 21:03:36 | INFO | lidirl | {"batch_idx_step": 835583.0, "num_updates_step": 52224.0, "train_loss_step": 0.0038592747878283262, "train_examples_step": 262144.0, "lr_step": 0.0010000000474974513, "epoch": 0} 2025-10-02 21:06:22 | INFO | lidirl | {"batch_idx_step": 843775.0, "num_updates_step": 52736.0, "train_loss_step": 0.003863926511257887, "train_examples_step": 262144.0, "lr_step": 0.0010000000474974513, "epoch": 0} 2025-10-02 21:09:06 | INFO | lidirl | {"batch_idx_step": 851967.0, "num_updates_step": 53248.0, "train_loss_step": 0.0038575860671699047, "train_examples_step": 262144.0, "lr_step": 0.0010000000474974513, "epoch": 0} 2025-10-02 21:11:51 | INFO | lidirl | {"batch_idx_step": 860159.0, "num_updates_step": 53760.0, "train_loss_step": 0.003878008807078004, "train_examples_step": 262144.0, "lr_step": 0.0010000000474974513, "epoch": 0} 2025-10-02 21:14:34 | INFO | lidirl | {"batch_idx_step": 868351.0, "num_updates_step": 54272.0, "train_loss_step": 0.003864018013700843, "train_examples_step": 262144.0, "lr_step": 0.0010000000474974513, "epoch": 0} 2025-10-02 21:17:19 | INFO | lidirl | {"batch_idx_step": 876543.0, "num_updates_step": 54784.0, "train_loss_step": 0.003872720990329981, "train_examples_step": 262144.0, "lr_step": 0.0010000000474974513, "epoch": 0} 2025-10-02 21:20:04 | INFO | lidirl | {"batch_idx_step": 884735.0, "num_updates_step": 55296.0, "train_loss_step": 0.0038548647426068783, "train_examples_step": 262144.0, "lr_step": 0.0010000000474974513, "epoch": 0} 2025-10-02 21:22:47 | INFO | lidirl | {"batch_idx_step": 892927.0, "num_updates_step": 55808.0, "train_loss_step": 0.0038593800272792578, "train_examples_step": 262144.0, "lr_step": 0.0010000000474974513, "epoch": 0} 2025-10-02 21:25:31 | INFO | lidirl | {"batch_idx_step": 901119.0, "num_updates_step": 56320.0, "train_loss_step": 0.003853890812024474, "train_examples_step": 262144.0, "lr_step": 0.0010000000474974513, "epoch": 0} 2025-10-02 21:28:15 | INFO | lidirl | {"batch_idx_step": 909311.0, "num_updates_step": 56832.0, "train_loss_step": 0.003850474487990141, "train_examples_step": 262144.0, "lr_step": 0.0010000000474974513, "epoch": 0} 2025-10-02 21:30:57 | INFO | lidirl | {"batch_idx_step": 917503.0, "num_updates_step": 57344.0, "train_loss_step": 0.0038710765074938536, "train_examples_step": 262144.0, "lr_step": 0.0010000000474974513, "epoch": 0} 2025-10-02 21:33:35 | INFO | lidirl | {"val_loss_total": 330348.75, "val_examples_total": 219340.0, "val_f1_total": 0.7804418206214905, "val_loss_dev/floresplus": 330348.75, "val_examples_dev/floresplus": 219340.0, "val_f1_dev/floresplus": 0.7804418206214905, "epoch": 0} 2025-10-02 21:36:18 | INFO | lidirl | {"batch_idx_step": 925695.0, "num_updates_step": 57856.0, "train_loss_step": 0.00384996528737247, "train_examples_step": 262144.0, "lr_step": 0.0010000000474974513, "epoch": 0} 2025-10-02 21:39:02 | INFO | lidirl | {"batch_idx_step": 933887.0, "num_updates_step": 58368.0, "train_loss_step": 0.0038491226732730865, "train_examples_step": 262144.0, "lr_step": 0.0010000000474974513, "epoch": 0} 2025-10-02 21:41:45 | INFO | lidirl | {"batch_idx_step": 942079.0, "num_updates_step": 58880.0, "train_loss_step": 0.003854387905448675, "train_examples_step": 262144.0, "lr_step": 0.0010000000474974513, "epoch": 0} 2025-10-02 21:44:27 | INFO | lidirl | {"batch_idx_step": 950271.0, "num_updates_step": 59392.0, "train_loss_step": 0.003833531169220805, "train_examples_step": 262144.0, "lr_step": 0.0010000000474974513, "epoch": 0} 2025-10-02 21:47:10 | INFO | lidirl | {"batch_idx_step": 958463.0, "num_updates_step": 59904.0, "train_loss_step": 0.003827014472335577, "train_examples_step": 262144.0, "lr_step": 0.0010000000474974513, "epoch": 0} 2025-10-02 21:49:52 | INFO | lidirl | {"batch_idx_step": 966655.0, "num_updates_step": 60416.0, "train_loss_step": 0.0038227983750402927, "train_examples_step": 262144.0, "lr_step": 0.0010000000474974513, "epoch": 0} 2025-10-02 21:52:35 | INFO | lidirl | {"batch_idx_step": 974847.0, "num_updates_step": 60928.0, "train_loss_step": 0.0038258228451013565, "train_examples_step": 262144.0, "lr_step": 0.0010000000474974513, "epoch": 0} 2025-10-02 21:55:17 | INFO | lidirl | {"batch_idx_step": 983039.0, "num_updates_step": 61440.0, "train_loss_step": 0.0038260039873421192, "train_examples_step": 262144.0, "lr_step": 0.0010000000474974513, "epoch": 0} 2025-10-02 21:58:01 | INFO | lidirl | {"batch_idx_step": 991231.0, "num_updates_step": 61952.0, "train_loss_step": 0.003854670561850071, "train_examples_step": 262144.0, "lr_step": 0.0010000000474974513, "epoch": 0} 2025-10-02 22:00:42 | INFO | lidirl | {"batch_idx_step": 999423.0, "num_updates_step": 62464.0, "train_loss_step": 0.0038460679352283478, "train_examples_step": 262144.0, "lr_step": 0.0010000000474974513, "epoch": 0} 2025-10-02 22:03:25 | INFO | lidirl | {"batch_idx_step": 1007615.0, "num_updates_step": 62976.0, "train_loss_step": 0.003809545887634158, "train_examples_step": 262144.0, "lr_step": 0.0010000000474974513, "epoch": 0} 2025-10-02 22:06:07 | INFO | lidirl | {"batch_idx_step": 1015807.0, "num_updates_step": 63488.0, "train_loss_step": 0.0038186954334378242, "train_examples_step": 262144.0, "lr_step": 0.0010000000474974513, "epoch": 0} 2025-10-02 22:08:50 | INFO | lidirl | {"batch_idx_step": 1023999.0, "num_updates_step": 64000.0, "train_loss_step": 0.0038094185292720795, "train_examples_step": 262144.0, "lr_step": 0.0010000000474974513, "epoch": 0} 2025-10-02 22:11:32 | INFO | lidirl | {"batch_idx_step": 1032191.0, "num_updates_step": 64512.0, "train_loss_step": 0.0038284717593342066, "train_examples_step": 262144.0, "lr_step": 0.0010000000474974513, "epoch": 0} 2025-10-02 22:14:14 | INFO | lidirl | {"batch_idx_step": 1040383.0, "num_updates_step": 65024.0, "train_loss_step": 0.0038367165252566338, "train_examples_step": 262144.0, "lr_step": 0.0010000000474974513, "epoch": 0} 2025-10-02 22:16:56 | INFO | lidirl | {"batch_idx_step": 1048575.0, "num_updates_step": 65536.0, "train_loss_step": 0.003823033068329096, "train_examples_step": 262144.0, "lr_step": 0.0010000000474974513, "epoch": 0} 2025-10-02 22:19:14 | INFO | lidirl | {"val_loss_total": 330842.90625, "val_examples_total": 219340.0, "val_f1_total": 0.7809673547744751, "val_loss_dev/floresplus": 330842.90625, "val_examples_dev/floresplus": 219340.0, "val_f1_dev/floresplus": 0.7809673547744751, "epoch": 0} 2025-10-02 22:21:57 | INFO | lidirl | {"batch_idx_step": 1056767.0, "num_updates_step": 66048.0, "train_loss_step": 0.003816890763118863, "train_examples_step": 262144.0, "lr_step": 0.0010000000474974513, "epoch": 0} 2025-10-02 22:24:39 | INFO | lidirl | {"batch_idx_step": 1064959.0, "num_updates_step": 66560.0, "train_loss_step": 0.003820710349828005, "train_examples_step": 262144.0, "lr_step": 0.0010000000474974513, "epoch": 0} 2025-10-02 22:27:21 | INFO | lidirl | {"batch_idx_step": 1073151.0, "num_updates_step": 67072.0, "train_loss_step": 0.0038420986384153366, "train_examples_step": 262144.0, "lr_step": 0.0010000000474974513, "epoch": 0} 2025-10-02 22:30:03 | INFO | lidirl | {"batch_idx_step": 1081343.0, "num_updates_step": 67584.0, "train_loss_step": 0.003806302323937416, "train_examples_step": 262144.0, "lr_step": 0.0010000000474974513, "epoch": 0} 2025-10-02 22:32:45 | INFO | lidirl | {"batch_idx_step": 1089535.0, "num_updates_step": 68096.0, "train_loss_step": 0.0038387638051062822, "train_examples_step": 262144.0, "lr_step": 0.0010000000474974513, "epoch": 0} 2025-10-02 22:35:28 | INFO | lidirl | {"batch_idx_step": 1097727.0, "num_updates_step": 68608.0, "train_loss_step": 0.003823555540293455, "train_examples_step": 262144.0, "lr_step": 0.0010000000474974513, "epoch": 0} 2025-10-02 22:38:10 | INFO | lidirl | {"batch_idx_step": 1105919.0, "num_updates_step": 69120.0, "train_loss_step": 0.0038146681617945433, "train_examples_step": 262144.0, "lr_step": 0.0010000000474974513, "epoch": 0} 2025-10-02 22:41:02 | INFO | lidirl | {"batch_idx_step": 1114111.0, "num_updates_step": 69632.0, "train_loss_step": 0.003829699708148837, "train_examples_step": 262144.0, "lr_step": 0.0010000000474974513, "epoch": 0} 2025-10-02 22:44:02 | INFO | lidirl | {"batch_idx_step": 1122303.0, "num_updates_step": 70144.0, "train_loss_step": 0.0038318634033203125, "train_examples_step": 262144.0, "lr_step": 0.0010000000474974513, "epoch": 0} 2025-10-02 22:47:03 | INFO | lidirl | {"batch_idx_step": 1130495.0, "num_updates_step": 70656.0, "train_loss_step": 0.003809950780123472, "train_examples_step": 262144.0, "lr_step": 0.0010000000474974513, "epoch": 0} 2025-10-02 22:49:57 | INFO | lidirl | {"batch_idx_step": 1138687.0, "num_updates_step": 71168.0, "train_loss_step": 0.00379787664860487, "train_examples_step": 262144.0, "lr_step": 0.0010000000474974513, "epoch": 0} 2025-10-02 22:52:51 | INFO | lidirl | {"batch_idx_step": 1146879.0, "num_updates_step": 71680.0, "train_loss_step": 0.003823194419965148, "train_examples_step": 262144.0, "lr_step": 0.0010000000474974513, "epoch": 0} 2025-10-02 22:55:40 | INFO | lidirl | {"batch_idx_step": 1155071.0, "num_updates_step": 72192.0, "train_loss_step": 0.003844862338155508, "train_examples_step": 262144.0, "lr_step": 0.0010000000474974513, "epoch": 0} 2025-10-02 22:58:29 | INFO | lidirl | {"batch_idx_step": 1163263.0, "num_updates_step": 72704.0, "train_loss_step": 0.0038012005388736725, "train_examples_step": 262144.0, "lr_step": 0.0010000000474974513, "epoch": 0} 2025-10-02 23:01:16 | INFO | lidirl | {"batch_idx_step": 1171455.0, "num_updates_step": 73216.0, "train_loss_step": 0.003792304079979658, "train_examples_step": 262144.0, "lr_step": 0.0010000000474974513, "epoch": 0} 2025-10-02 23:04:04 | INFO | lidirl | {"batch_idx_step": 1179647.0, "num_updates_step": 73728.0, "train_loss_step": 0.003809321904554963, "train_examples_step": 262144.0, "lr_step": 0.0010000000474974513, "epoch": 0} 2025-10-02 23:06:21 | INFO | lidirl | {"val_loss_total": 329987.1875, "val_examples_total": 219340.0, "val_f1_total": 0.7816880941390991, "val_loss_dev/floresplus": 329987.1875, "val_examples_dev/floresplus": 219340.0, "val_f1_dev/floresplus": 0.7816880941390991, "epoch": 0} 2025-10-02 23:09:08 | INFO | lidirl | {"batch_idx_step": 1187839.0, "num_updates_step": 74240.0, "train_loss_step": 0.0038226875476539135, "train_examples_step": 262144.0, "lr_step": 0.0010000000474974513, "epoch": 0} 2025-10-02 23:11:51 | INFO | lidirl | {"batch_idx_step": 1196031.0, "num_updates_step": 74752.0, "train_loss_step": 0.0038063840474933386, "train_examples_step": 262144.0, "lr_step": 0.0010000000474974513, "epoch": 0} 2025-10-02 23:14:37 | INFO | lidirl | {"batch_idx_step": 1204223.0, "num_updates_step": 75264.0, "train_loss_step": 0.003815682837739587, "train_examples_step": 262144.0, "lr_step": 0.0010000000474974513, "epoch": 0} 2025-10-02 23:17:20 | INFO | lidirl | {"batch_idx_step": 1212415.0, "num_updates_step": 75776.0, "train_loss_step": 0.00379800982773304, "train_examples_step": 262144.0, "lr_step": 0.0010000000474974513, "epoch": 0} 2025-10-02 23:20:05 | INFO | lidirl | {"batch_idx_step": 1220607.0, "num_updates_step": 76288.0, "train_loss_step": 0.003796621924266219, "train_examples_step": 262144.0, "lr_step": 0.0010000000474974513, "epoch": 0} 2025-10-02 23:22:48 | INFO | lidirl | {"batch_idx_step": 1228799.0, "num_updates_step": 76800.0, "train_loss_step": 0.003810303518548608, "train_examples_step": 262144.0, "lr_step": 0.0010000000474974513, "epoch": 0} 2025-10-02 23:25:31 | INFO | lidirl | {"batch_idx_step": 1236991.0, "num_updates_step": 77312.0, "train_loss_step": 0.0038029716815799475, "train_examples_step": 262144.0, "lr_step": 0.0010000000474974513, "epoch": 0} 2025-10-02 23:28:17 | INFO | lidirl | {"batch_idx_step": 1245183.0, "num_updates_step": 77824.0, "train_loss_step": 0.0038324114866554737, "train_examples_step": 262144.0, "lr_step": 0.0010000000474974513, "epoch": 0} 2025-10-02 23:31:00 | INFO | lidirl | {"batch_idx_step": 1253375.0, "num_updates_step": 78336.0, "train_loss_step": 0.0038238554261624813, "train_examples_step": 262144.0, "lr_step": 0.0010000000474974513, "epoch": 0} 2025-10-02 23:33:45 | INFO | lidirl | {"batch_idx_step": 1261567.0, "num_updates_step": 78848.0, "train_loss_step": 0.0037886197678744793, "train_examples_step": 262144.0, "lr_step": 0.0010000000474974513, "epoch": 0} 2025-10-02 23:36:27 | INFO | lidirl | {"batch_idx_step": 1269759.0, "num_updates_step": 79360.0, "train_loss_step": 0.003808210138231516, "train_examples_step": 262144.0, "lr_step": 0.0010000000474974513, "epoch": 0} 2025-10-02 23:39:09 | INFO | lidirl | {"batch_idx_step": 1277951.0, "num_updates_step": 79872.0, "train_loss_step": 0.003813448827713728, "train_examples_step": 262144.0, "lr_step": 0.0010000000474974513, "epoch": 0} 2025-10-02 23:41:52 | INFO | lidirl | {"batch_idx_step": 1286143.0, "num_updates_step": 80384.0, "train_loss_step": 0.0038050333969295025, "train_examples_step": 262144.0, "lr_step": 0.0010000000474974513, "epoch": 0} 2025-10-02 23:44:36 | INFO | lidirl | {"batch_idx_step": 1294335.0, "num_updates_step": 80896.0, "train_loss_step": 0.003793597687035799, "train_examples_step": 262144.0, "lr_step": 0.0010000000474974513, "epoch": 0} 2025-10-02 23:47:18 | INFO | lidirl | {"batch_idx_step": 1302527.0, "num_updates_step": 81408.0, "train_loss_step": 0.003790620481595397, "train_examples_step": 262144.0, "lr_step": 0.0010000000474974513, "epoch": 0} 2025-10-02 23:50:01 | INFO | lidirl | {"batch_idx_step": 1310719.0, "num_updates_step": 81920.0, "train_loss_step": 0.003762042848393321, "train_examples_step": 262144.0, "lr_step": 0.0010000000474974513, "epoch": 0} 2025-10-02 23:52:21 | INFO | lidirl | {"val_loss_total": 329773.71875, "val_examples_total": 219340.0, "val_f1_total": 0.7821275591850281, "val_loss_dev/floresplus": 329773.71875, "val_examples_dev/floresplus": 219340.0, "val_f1_dev/floresplus": 0.7821275591850281, "epoch": 0} 2025-10-02 23:55:03 | INFO | lidirl | {"batch_idx_step": 1318911.0, "num_updates_step": 82432.0, "train_loss_step": 0.0038113389164209366, "train_examples_step": 262144.0, "lr_step": 0.0010000000474974513, "epoch": 0} 2025-10-02 23:57:44 | INFO | lidirl | {"batch_idx_step": 1327103.0, "num_updates_step": 82944.0, "train_loss_step": 0.003787762252613902, "train_examples_step": 262144.0, "lr_step": 0.0010000000474974513, "epoch": 0} 2025-10-03 00:00:27 | INFO | lidirl | {"batch_idx_step": 1335295.0, "num_updates_step": 83456.0, "train_loss_step": 0.0037894907873123884, "train_examples_step": 262144.0, "lr_step": 0.0010000000474974513, "epoch": 0} 2025-10-03 00:03:08 | INFO | lidirl | {"batch_idx_step": 1343487.0, "num_updates_step": 83968.0, "train_loss_step": 0.003806293709203601, "train_examples_step": 262144.0, "lr_step": 0.0010000000474974513, "epoch": 0} 2025-10-03 00:05:50 | INFO | lidirl | {"batch_idx_step": 1351679.0, "num_updates_step": 84480.0, "train_loss_step": 0.0037939464673399925, "train_examples_step": 262144.0, "lr_step": 0.0010000000474974513, "epoch": 0} 2025-10-03 00:08:32 | INFO | lidirl | {"batch_idx_step": 1359871.0, "num_updates_step": 84992.0, "train_loss_step": 0.0038005097303539515, "train_examples_step": 262144.0, "lr_step": 0.0010000000474974513, "epoch": 0} 2025-10-03 00:11:14 | INFO | lidirl | {"batch_idx_step": 1368063.0, "num_updates_step": 85504.0, "train_loss_step": 0.003814101917669177, "train_examples_step": 262144.0, "lr_step": 0.0010000000474974513, "epoch": 0} 2025-10-03 00:13:57 | INFO | lidirl | {"batch_idx_step": 1376255.0, "num_updates_step": 86016.0, "train_loss_step": 0.0037768061738461256, "train_examples_step": 262144.0, "lr_step": 0.0010000000474974513, "epoch": 0} 2025-10-03 00:18:32 | INFO | lidirl | {"batch_idx_step": 1384447.0, "num_updates_step": 86528.0, "train_loss_step": 0.0037935900036245584, "train_examples_step": 262144.0, "lr_step": 0.0010000000474974513, "epoch": 0} 2025-10-03 00:25:52 | INFO | lidirl | {"batch_idx_step": 1392639.0, "num_updates_step": 87040.0, "train_loss_step": 0.003791320836171508, "train_examples_step": 262144.0, "lr_step": 0.0010000000474974513, "epoch": 0} 2025-10-03 00:36:52 | INFO | lidirl | {"batch_idx_step": 1400831.0, "num_updates_step": 87552.0, "train_loss_step": 0.003789072623476386, "train_examples_step": 262144.0, "lr_step": 0.0010000000474974513, "epoch": 0} 2025-10-03 00:45:55 | INFO | lidirl | {"batch_idx_step": 1409023.0, "num_updates_step": 88064.0, "train_loss_step": 0.0037844846956431866, "train_examples_step": 262144.0, "lr_step": 0.0010000000474974513, "epoch": 0} 2025-10-03 00:54:38 | INFO | lidirl | {"batch_idx_step": 1417215.0, "num_updates_step": 88576.0, "train_loss_step": 0.0038224642630666494, "train_examples_step": 262144.0, "lr_step": 0.0010000000474974513, "epoch": 0} 2025-10-03 01:00:54 | INFO | lidirl | {"batch_idx_step": 1425407.0, "num_updates_step": 89088.0, "train_loss_step": 0.0038239206187427044, "train_examples_step": 262144.0, "lr_step": 0.0010000000474974513, "epoch": 0} 2025-10-03 01:06:55 | INFO | lidirl | {"batch_idx_step": 1433599.0, "num_updates_step": 89600.0, "train_loss_step": 0.0037788990885019302, "train_examples_step": 262144.0, "lr_step": 0.0010000000474974513, "epoch": 0} 2025-10-03 01:14:25 | INFO | lidirl | {"batch_idx_step": 1441791.0, "num_updates_step": 90112.0, "train_loss_step": 0.0037976964376866817, "train_examples_step": 262144.0, "lr_step": 0.0010000000474974513, "epoch": 0} 2025-10-03 01:16:39 | INFO | lidirl | {"val_loss_total": 329850.25, "val_examples_total": 219340.0, "val_f1_total": 0.7828937768936157, "val_loss_dev/floresplus": 329850.25, "val_examples_dev/floresplus": 219340.0, "val_f1_dev/floresplus": 0.7828937768936157, "epoch": 0} 2025-10-03 01:25:30 | INFO | lidirl | {"batch_idx_step": 1449983.0, "num_updates_step": 90624.0, "train_loss_step": 0.0037546248640865088, "train_examples_step": 262144.0, "lr_step": 0.0010000000474974513, "epoch": 0} 2025-10-03 01:32:18 | INFO | lidirl | {"batch_idx_step": 1458175.0, "num_updates_step": 91136.0, "train_loss_step": 0.0037891422398388386, "train_examples_step": 262144.0, "lr_step": 0.0010000000474974513, "epoch": 0} 2025-10-03 01:38:21 | INFO | lidirl | {"batch_idx_step": 1466367.0, "num_updates_step": 91648.0, "train_loss_step": 0.0037767011672258377, "train_examples_step": 262144.0, "lr_step": 0.0010000000474974513, "epoch": 0} 2025-10-03 01:45:03 | INFO | lidirl | {"batch_idx_step": 1474559.0, "num_updates_step": 92160.0, "train_loss_step": 0.003775248071178794, "train_examples_step": 262144.0, "lr_step": 0.0010000000474974513, "epoch": 0} 2025-10-03 01:51:14 | INFO | lidirl | {"batch_idx_step": 1482751.0, "num_updates_step": 92672.0, "train_loss_step": 0.0038091428577899933, "train_examples_step": 262144.0, "lr_step": 0.0010000000474974513, "epoch": 0} 2025-10-03 01:56:49 | INFO | lidirl | {"batch_idx_step": 1490943.0, "num_updates_step": 93184.0, "train_loss_step": 0.003791087307035923, "train_examples_step": 262144.0, "lr_step": 0.0010000000474974513, "epoch": 0} 2025-10-03 02:02:01 | INFO | lidirl | {"batch_idx_step": 1499135.0, "num_updates_step": 93696.0, "train_loss_step": 0.0038051048759371042, "train_examples_step": 262144.0, "lr_step": 0.0010000000474974513, "epoch": 0} 2025-10-03 02:07:41 | INFO | lidirl | {"batch_idx_step": 1507327.0, "num_updates_step": 94208.0, "train_loss_step": 0.003771039191633463, "train_examples_step": 262144.0, "lr_step": 0.0010000000474974513, "epoch": 0} 2025-10-03 02:13:37 | INFO | lidirl | {"batch_idx_step": 1515519.0, "num_updates_step": 94720.0, "train_loss_step": 0.003763975342735648, "train_examples_step": 262144.0, "lr_step": 0.0010000000474974513, "epoch": 0} 2025-10-03 02:18:59 | INFO | lidirl | {"batch_idx_step": 1523711.0, "num_updates_step": 95232.0, "train_loss_step": 0.0037604202516376972, "train_examples_step": 262144.0, "lr_step": 0.0010000000474974513, "epoch": 0} 2025-10-03 02:23:34 | INFO | lidirl | {"batch_idx_step": 1531903.0, "num_updates_step": 95744.0, "train_loss_step": 0.003821220248937607, "train_examples_step": 262144.0, "lr_step": 0.0010000000474974513, "epoch": 0} 2025-10-03 02:29:09 | INFO | lidirl | {"batch_idx_step": 1540095.0, "num_updates_step": 96256.0, "train_loss_step": 0.0037753356155008078, "train_examples_step": 262144.0, "lr_step": 0.0010000000474974513, "epoch": 0} 2025-10-03 02:34:00 | INFO | lidirl | {"batch_idx_step": 1548287.0, "num_updates_step": 96768.0, "train_loss_step": 0.003805435262620449, "train_examples_step": 262144.0, "lr_step": 0.0010000000474974513, "epoch": 0} 2025-10-03 02:39:11 | INFO | lidirl | {"batch_idx_step": 1556479.0, "num_updates_step": 97280.0, "train_loss_step": 0.003749431576579809, "train_examples_step": 262144.0, "lr_step": 0.0010000000474974513, "epoch": 0} 2025-10-03 02:44:54 | INFO | lidirl | {"batch_idx_step": 1564671.0, "num_updates_step": 97792.0, "train_loss_step": 0.003767519723623991, "train_examples_step": 262144.0, "lr_step": 0.0010000000474974513, "epoch": 0} 2025-10-03 02:50:44 | INFO | lidirl | {"batch_idx_step": 1572863.0, "num_updates_step": 98304.0, "train_loss_step": 0.0037655597552657127, "train_examples_step": 262144.0, "lr_step": 0.0010000000474974513, "epoch": 0} 2025-10-03 02:53:01 | INFO | lidirl | {"val_loss_total": 328947.6875, "val_examples_total": 219340.0, "val_f1_total": 0.783616840839386, "val_loss_dev/floresplus": 328947.6875, "val_examples_dev/floresplus": 219340.0, "val_f1_dev/floresplus": 0.783616840839386, "epoch": 0} 2025-10-03 02:58:01 | INFO | lidirl | {"batch_idx_step": 1581055.0, "num_updates_step": 98816.0, "train_loss_step": 0.0037529414985328913, "train_examples_step": 262144.0, "lr_step": 0.0010000000474974513, "epoch": 0} 2025-10-03 03:02:34 | INFO | lidirl | {"batch_idx_step": 1589247.0, "num_updates_step": 99328.0, "train_loss_step": 0.003754756646230817, "train_examples_step": 262144.0, "lr_step": 0.0010000000474974513, "epoch": 0} 2025-10-03 03:06:40 | INFO | lidirl | {"batch_idx_step": 1597439.0, "num_updates_step": 99840.0, "train_loss_step": 0.0037794869858771563, "train_examples_step": 262144.0, "lr_step": 0.0010000000474974513, "epoch": 0} 2025-10-03 03:11:05 | INFO | lidirl | {"batch_idx_step": 1605631.0, "num_updates_step": 100352.0, "train_loss_step": 0.0037616880144923925, "train_examples_step": 262144.0, "lr_step": 0.0010000000474974513, "epoch": 0} 2025-10-03 03:15:10 | INFO | lidirl | {"batch_idx_step": 1613823.0, "num_updates_step": 100864.0, "train_loss_step": 0.003762617940083146, "train_examples_step": 262144.0, "lr_step": 0.0010000000474974513, "epoch": 0} 2025-10-03 03:20:14 | INFO | lidirl | {"batch_idx_step": 1622015.0, "num_updates_step": 101376.0, "train_loss_step": 0.0037624770775437355, "train_examples_step": 262144.0, "lr_step": 0.0010000000474974513, "epoch": 0} 2025-10-03 03:26:50 | INFO | lidirl | {"batch_idx_step": 1630207.0, "num_updates_step": 101888.0, "train_loss_step": 0.0037990293931216, "train_examples_step": 262144.0, "lr_step": 0.0010000000474974513, "epoch": 0} 2025-10-03 03:33:03 | INFO | lidirl | {"batch_idx_step": 1638399.0, "num_updates_step": 102400.0, "train_loss_step": 0.0037790413480252028, "train_examples_step": 262144.0, "lr_step": 0.0010000000474974513, "epoch": 0} 2025-10-03 03:37:53 | INFO | lidirl | {"batch_idx_step": 1646591.0, "num_updates_step": 102912.0, "train_loss_step": 0.003761685686185956, "train_examples_step": 262144.0, "lr_step": 0.0010000000474974513, "epoch": 0} 2025-10-03 03:42:12 | INFO | lidirl | {"batch_idx_step": 1654783.0, "num_updates_step": 103424.0, "train_loss_step": 0.0037549161352217197, "train_examples_step": 262144.0, "lr_step": 0.0010000000474974513, "epoch": 0} 2025-10-03 03:45:55 | INFO | lidirl | {"batch_idx_step": 1662975.0, "num_updates_step": 103936.0, "train_loss_step": 0.003808322362601757, "train_examples_step": 262144.0, "lr_step": 0.0010000000474974513, "epoch": 0} 2025-10-03 03:49:35 | INFO | lidirl | {"batch_idx_step": 1671167.0, "num_updates_step": 104448.0, "train_loss_step": 0.003788028610870242, "train_examples_step": 262144.0, "lr_step": 0.0010000000474974513, "epoch": 0} 2025-10-03 03:53:21 | INFO | lidirl | {"batch_idx_step": 1679359.0, "num_updates_step": 104960.0, "train_loss_step": 0.0037797163240611553, "train_examples_step": 262144.0, "lr_step": 0.0010000000474974513, "epoch": 0} 2025-10-03 03:56:52 | INFO | lidirl | {"batch_idx_step": 1687551.0, "num_updates_step": 105472.0, "train_loss_step": 0.0037796818651258945, "train_examples_step": 262144.0, "lr_step": 0.0010000000474974513, "epoch": 0} 2025-10-03 04:00:20 | INFO | lidirl | {"batch_idx_step": 1695743.0, "num_updates_step": 105984.0, "train_loss_step": 0.003769477130845189, "train_examples_step": 262144.0, "lr_step": 0.0010000000474974513, "epoch": 0} 2025-10-03 04:03:59 | INFO | lidirl | {"batch_idx_step": 1703935.0, "num_updates_step": 106496.0, "train_loss_step": 0.0037694894708693027, "train_examples_step": 262144.0, "lr_step": 0.0010000000474974513, "epoch": 0} 2025-10-03 04:06:18 | INFO | lidirl | {"val_loss_total": 328682.21875, "val_examples_total": 219340.0, "val_f1_total": 0.7840186357498169, "val_loss_dev/floresplus": 328682.21875, "val_examples_dev/floresplus": 219340.0, "val_f1_dev/floresplus": 0.7840186357498169, "epoch": 0} 2025-10-03 04:09:49 | INFO | lidirl | {"batch_idx_step": 1712127.0, "num_updates_step": 107008.0, "train_loss_step": 0.0037570868153125048, "train_examples_step": 262144.0, "lr_step": 0.0010000000474974513, "epoch": 0} 2025-10-03 04:13:32 | INFO | lidirl | {"batch_idx_step": 1720319.0, "num_updates_step": 107520.0, "train_loss_step": 0.0037926400545984507, "train_examples_step": 262144.0, "lr_step": 0.0010000000474974513, "epoch": 0} 2025-10-03 04:17:25 | INFO | lidirl | {"batch_idx_step": 1728511.0, "num_updates_step": 108032.0, "train_loss_step": 0.003748785238713026, "train_examples_step": 262144.0, "lr_step": 0.0010000000474974513, "epoch": 0} 2025-10-03 04:21:43 | INFO | lidirl | {"batch_idx_step": 1736703.0, "num_updates_step": 108544.0, "train_loss_step": 0.0037672787439078093, "train_examples_step": 262144.0, "lr_step": 0.0010000000474974513, "epoch": 0} 2025-10-03 04:26:02 | INFO | lidirl | {"batch_idx_step": 1744895.0, "num_updates_step": 109056.0, "train_loss_step": 0.003767135553061962, "train_examples_step": 262144.0, "lr_step": 0.0010000000474974513, "epoch": 0} 2025-10-03 04:30:54 | INFO | lidirl | {"batch_idx_step": 1753087.0, "num_updates_step": 109568.0, "train_loss_step": 0.0037465770728886127, "train_examples_step": 262144.0, "lr_step": 0.0010000000474974513, "epoch": 0} 2025-10-03 04:36:23 | INFO | lidirl | {"batch_idx_step": 1761279.0, "num_updates_step": 110080.0, "train_loss_step": 0.003750620409846306, "train_examples_step": 262144.0, "lr_step": 0.0010000000474974513, "epoch": 0} 2025-10-03 04:42:09 | INFO | lidirl | {"batch_idx_step": 1769471.0, "num_updates_step": 110592.0, "train_loss_step": 0.0037827924825251102, "train_examples_step": 262144.0, "lr_step": 0.0010000000474974513, "epoch": 0} 2025-10-03 04:46:47 | INFO | lidirl | {"batch_idx_step": 1777663.0, "num_updates_step": 111104.0, "train_loss_step": 0.003779829014092684, "train_examples_step": 262144.0, "lr_step": 0.0010000000474974513, "epoch": 0} 2025-10-03 04:51:02 | INFO | lidirl | {"batch_idx_step": 1785855.0, "num_updates_step": 111616.0, "train_loss_step": 0.0037489228416234255, "train_examples_step": 262144.0, "lr_step": 0.0010000000474974513, "epoch": 0} 2025-10-03 04:55:13 | INFO | lidirl | {"batch_idx_step": 1794047.0, "num_updates_step": 112128.0, "train_loss_step": 0.00380152789875865, "train_examples_step": 262144.0, "lr_step": 0.0010000000474974513, "epoch": 0} 2025-10-03 04:59:36 | INFO | lidirl | {"batch_idx_step": 1802239.0, "num_updates_step": 112640.0, "train_loss_step": 0.003738656872883439, "train_examples_step": 262144.0, "lr_step": 0.0010000000474974513, "epoch": 0} 2025-10-03 05:03:42 | INFO | lidirl | {"batch_idx_step": 1810431.0, "num_updates_step": 113152.0, "train_loss_step": 0.0037735854275524616, "train_examples_step": 262144.0, "lr_step": 0.0010000000474974513, "epoch": 0} 2025-10-03 05:07:37 | INFO | lidirl | {"batch_idx_step": 1818623.0, "num_updates_step": 113664.0, "train_loss_step": 0.003769523464143276, "train_examples_step": 262144.0, "lr_step": 0.0010000000474974513, "epoch": 0} 2025-10-03 05:11:36 | INFO | lidirl | {"batch_idx_step": 1826815.0, "num_updates_step": 114176.0, "train_loss_step": 0.0037407181225717068, "train_examples_step": 262144.0, "lr_step": 0.0010000000474974513, "epoch": 0} 2025-10-03 05:15:07 | INFO | lidirl | {"batch_idx_step": 1835007.0, "num_updates_step": 114688.0, "train_loss_step": 0.003740608459338546, "train_examples_step": 262144.0, "lr_step": 0.0010000000474974513, "epoch": 0} 2025-10-03 05:17:34 | INFO | lidirl | {"val_loss_total": 328799.8125, "val_examples_total": 219340.0, "val_f1_total": 0.7842815518379211, "val_loss_dev/floresplus": 328799.8125, "val_examples_dev/floresplus": 219340.0, "val_f1_dev/floresplus": 0.7842815518379211, "epoch": 0} 2025-10-03 05:30:46 | INFO | lidirl | {"batch_idx_step": 1843199.0, "num_updates_step": 115200.0, "train_loss_step": 0.003767777467146516, "train_examples_step": 262144.0, "lr_step": 0.0010000000474974513, "epoch": 0} 2025-10-03 05:54:57 | INFO | lidirl | {"batch_idx_step": 1851391.0, "num_updates_step": 115712.0, "train_loss_step": 0.0037653835024684668, "train_examples_step": 262144.0, "lr_step": 0.0010000000474974513, "epoch": 0} 2025-10-03 06:14:02 | INFO | lidirl | {"batch_idx_step": 1859583.0, "num_updates_step": 116224.0, "train_loss_step": 0.003746452508494258, "train_examples_step": 262144.0, "lr_step": 0.0010000000474974513, "epoch": 0} 2025-10-03 06:32:53 | INFO | lidirl | {"batch_idx_step": 1867775.0, "num_updates_step": 116736.0, "train_loss_step": 0.0037657981738448143, "train_examples_step": 262144.0, "lr_step": 0.0010000000474974513, "epoch": 0} 2025-10-03 06:48:55 | INFO | lidirl | {"batch_idx_step": 1875967.0, "num_updates_step": 117248.0, "train_loss_step": 0.0037363304290920496, "train_examples_step": 262144.0, "lr_step": 0.0010000000474974513, "epoch": 0} 2025-10-03 07:05:01 | INFO | lidirl | {"batch_idx_step": 1884159.0, "num_updates_step": 117760.0, "train_loss_step": 0.003752517979592085, "train_examples_step": 262144.0, "lr_step": 0.0010000000474974513, "epoch": 0} 2025-10-03 07:24:15 | INFO | lidirl | {"batch_idx_step": 1892351.0, "num_updates_step": 118272.0, "train_loss_step": 0.003755110315978527, "train_examples_step": 262144.0, "lr_step": 0.0010000000474974513, "epoch": 0} 2025-10-03 07:46:36 | INFO | lidirl | {"batch_idx_step": 1900543.0, "num_updates_step": 118784.0, "train_loss_step": 0.003770570270717144, "train_examples_step": 262144.0, "lr_step": 0.0010000000474974513, "epoch": 0} 2025-10-03 08:07:35 | INFO | lidirl | {"batch_idx_step": 1908735.0, "num_updates_step": 119296.0, "train_loss_step": 0.003758470294997096, "train_examples_step": 262144.0, "lr_step": 0.0010000000474974513, "epoch": 0} 2025-10-03 08:28:50 | INFO | lidirl | {"batch_idx_step": 1916927.0, "num_updates_step": 119808.0, "train_loss_step": 0.003730116179212928, "train_examples_step": 262144.0, "lr_step": 0.0010000000474974513, "epoch": 0} 2025-10-03 08:51:41 | INFO | lidirl | {"batch_idx_step": 1925119.0, "num_updates_step": 120320.0, "train_loss_step": 0.0037349972408264875, "train_examples_step": 262144.0, "lr_step": 0.0010000000474974513, "epoch": 0} 2025-10-03 09:11:08 | INFO | lidirl | {"batch_idx_step": 1933311.0, "num_updates_step": 120832.0, "train_loss_step": 0.0037293732166290283, "train_examples_step": 262144.0, "lr_step": 0.0010000000474974513, "epoch": 0} 2025-10-03 09:31:33 | INFO | lidirl | {"batch_idx_step": 1941503.0, "num_updates_step": 121344.0, "train_loss_step": 0.0037316284142434597, "train_examples_step": 262144.0, "lr_step": 0.0010000000474974513, "epoch": 0} 2025-10-03 09:59:42 | INFO | lidirl | {"batch_idx_step": 1949695.0, "num_updates_step": 121856.0, "train_loss_step": 0.00375405908562243, "train_examples_step": 262144.0, "lr_step": 0.0010000000474974513, "epoch": 0} 2025-10-03 10:27:32 | INFO | lidirl | {"batch_idx_step": 1957887.0, "num_updates_step": 122368.0, "train_loss_step": 0.003733017249032855, "train_examples_step": 262144.0, "lr_step": 0.0010000000474974513, "epoch": 0} 2025-10-03 11:05:48 | INFO | lidirl | {"batch_idx_step": 1966079.0, "num_updates_step": 122880.0, "train_loss_step": 0.0037581438664346933, "train_examples_step": 262144.0, "lr_step": 0.0010000000474974513, "epoch": 0} 2025-10-03 11:08:02 | INFO | lidirl | {"val_loss_total": 328668.90625, "val_examples_total": 219340.0, "val_f1_total": 0.7845580577850342, "val_loss_dev/floresplus": 328668.90625, "val_examples_dev/floresplus": 219340.0, "val_f1_dev/floresplus": 0.7845580577850342, "epoch": 0} 2025-10-03 11:38:35 | INFO | lidirl | {"batch_idx_step": 1974271.0, "num_updates_step": 123392.0, "train_loss_step": 0.0037645986303687096, "train_examples_step": 262144.0, "lr_step": 0.0010000000474974513, "epoch": 0} 2025-10-03 11:45:57 | INFO | lidirl | {"batch_idx_step": 1982463.0, "num_updates_step": 123904.0, "train_loss_step": 0.003765707602724433, "train_examples_step": 262144.0, "lr_step": 0.0010000000474974513, "epoch": 0} 2025-10-03 11:50:05 | INFO | lidirl | {"batch_idx_step": 1990655.0, "num_updates_step": 124416.0, "train_loss_step": 0.0037357117980718613, "train_examples_step": 262144.0, "lr_step": 0.0010000000474974513, "epoch": 0} 2025-10-03 11:53:31 | INFO | lidirl | {"batch_idx_step": 1998847.0, "num_updates_step": 124928.0, "train_loss_step": 0.003756972262635827, "train_examples_step": 262144.0, "lr_step": 0.0010000000474974513, "epoch": 0} 2025-10-03 11:56:35 | INFO | lidirl | {"batch_idx_step": 2007039.0, "num_updates_step": 125440.0, "train_loss_step": 0.003736974438652396, "train_examples_step": 262144.0, "lr_step": 0.0010000000474974513, "epoch": 0} 2025-10-03 11:59:29 | INFO | lidirl | {"batch_idx_step": 2015231.0, "num_updates_step": 125952.0, "train_loss_step": 0.0037292037159204483, "train_examples_step": 262144.0, "lr_step": 0.0010000000474974513, "epoch": 0} 2025-10-03 12:02:20 | INFO | lidirl | {"batch_idx_step": 2023423.0, "num_updates_step": 126464.0, "train_loss_step": 0.0037724075373262167, "train_examples_step": 262144.0, "lr_step": 0.0010000000474974513, "epoch": 0} 2025-10-03 12:05:09 | INFO | lidirl | {"batch_idx_step": 2031615.0, "num_updates_step": 126976.0, "train_loss_step": 0.003740719286724925, "train_examples_step": 262144.0, "lr_step": 0.0010000000474974513, "epoch": 0} 2025-10-03 12:07:59 | INFO | lidirl | {"batch_idx_step": 2039807.0, "num_updates_step": 127488.0, "train_loss_step": 0.003750072792172432, "train_examples_step": 262144.0, "lr_step": 0.0010000000474974513, "epoch": 0} 2025-10-03 12:10:45 | INFO | lidirl | {"batch_idx_step": 2047999.0, "num_updates_step": 128000.0, "train_loss_step": 0.003725440474227071, "train_examples_step": 262144.0, "lr_step": 0.0010000000474974513, "epoch": 0} 2025-10-03 12:13:34 | INFO | lidirl | {"batch_idx_step": 2056191.0, "num_updates_step": 128512.0, "train_loss_step": 0.003720210399478674, "train_examples_step": 262144.0, "lr_step": 0.0010000000474974513, "epoch": 0} 2025-10-03 12:16:19 | INFO | lidirl | {"batch_idx_step": 2064383.0, "num_updates_step": 129024.0, "train_loss_step": 0.0037547433748841286, "train_examples_step": 262144.0, "lr_step": 0.0010000000474974513, "epoch": 0} 2025-10-03 12:19:03 | INFO | lidirl | {"batch_idx_step": 2072575.0, "num_updates_step": 129536.0, "train_loss_step": 0.003759344108402729, "train_examples_step": 262144.0, "lr_step": 0.0010000000474974513, "epoch": 0} 2025-10-03 12:21:46 | INFO | lidirl | {"batch_idx_step": 2080767.0, "num_updates_step": 130048.0, "train_loss_step": 0.0037558863405138254, "train_examples_step": 262144.0, "lr_step": 0.0010000000474974513, "epoch": 0} 2025-10-03 12:24:29 | INFO | lidirl | {"batch_idx_step": 2088959.0, "num_updates_step": 130560.0, "train_loss_step": 0.003745598252862692, "train_examples_step": 262144.0, "lr_step": 0.0010000000474974513, "epoch": 0} 2025-10-03 12:27:13 | INFO | lidirl | {"batch_idx_step": 2097151.0, "num_updates_step": 131072.0, "train_loss_step": 0.0037321546114981174, "train_examples_step": 262144.0, "lr_step": 0.0010000000474974513, "epoch": 0} 2025-10-03 12:29:27 | INFO | lidirl | {"val_loss_total": 329073.5625, "val_examples_total": 219340.0, "val_f1_total": 0.7845032215118408, "val_loss_dev/floresplus": 329073.5625, "val_examples_dev/floresplus": 219340.0, "val_f1_dev/floresplus": 0.7845032215118408, "epoch": 0} 2025-10-03 12:32:10 | INFO | lidirl | {"batch_idx_step": 2105343.0, "num_updates_step": 131584.0, "train_loss_step": 0.0037332416977733374, "train_examples_step": 262144.0, "lr_step": 0.0010000000474974513, "epoch": 0} 2025-10-03 12:34:52 | INFO | lidirl | {"batch_idx_step": 2113535.0, "num_updates_step": 132096.0, "train_loss_step": 0.003757418366149068, "train_examples_step": 262144.0, "lr_step": 0.0010000000474974513, "epoch": 0} 2025-10-03 12:37:36 | INFO | lidirl | {"batch_idx_step": 2121727.0, "num_updates_step": 132608.0, "train_loss_step": 0.0037573545705527067, "train_examples_step": 262144.0, "lr_step": 0.0010000000474974513, "epoch": 0} 2025-10-03 12:40:21 | INFO | lidirl | {"batch_idx_step": 2129919.0, "num_updates_step": 133120.0, "train_loss_step": 0.003713746787980199, "train_examples_step": 262144.0, "lr_step": 0.0010000000474974513, "epoch": 0} 2025-10-03 12:43:05 | INFO | lidirl | {"batch_idx_step": 2138111.0, "num_updates_step": 133632.0, "train_loss_step": 0.003736605867743492, "train_examples_step": 262144.0, "lr_step": 0.0010000000474974513, "epoch": 0} 2025-10-03 12:45:49 | INFO | lidirl | {"batch_idx_step": 2146303.0, "num_updates_step": 134144.0, "train_loss_step": 0.0037338663823902607, "train_examples_step": 262144.0, "lr_step": 0.0010000000474974513, "epoch": 0} 2025-10-03 12:48:31 | INFO | lidirl | {"batch_idx_step": 2154495.0, "num_updates_step": 134656.0, "train_loss_step": 0.0037199785001575947, "train_examples_step": 262144.0, "lr_step": 0.0010000000474974513, "epoch": 0} 2025-10-03 12:51:14 | INFO | lidirl | {"batch_idx_step": 2162687.0, "num_updates_step": 135168.0, "train_loss_step": 0.003724888665601611, "train_examples_step": 262144.0, "lr_step": 0.0010000000474974513, "epoch": 0} 2025-10-03 12:53:57 | INFO | lidirl | {"batch_idx_step": 2170879.0, "num_updates_step": 135680.0, "train_loss_step": 0.0037287105806171894, "train_examples_step": 262144.0, "lr_step": 0.0010000000474974513, "epoch": 0} 2025-10-03 12:56:39 | INFO | lidirl | {"batch_idx_step": 2179071.0, "num_updates_step": 136192.0, "train_loss_step": 0.003767957678064704, "train_examples_step": 262144.0, "lr_step": 0.0010000000474974513, "epoch": 0} 2025-10-03 12:59:19 | INFO | lidirl | {"batch_idx_step": 2187263.0, "num_updates_step": 136704.0, "train_loss_step": 0.003754135686904192, "train_examples_step": 262144.0, "lr_step": 0.0010000000474974513, "epoch": 0} 2025-10-03 13:02:01 | INFO | lidirl | {"batch_idx_step": 2195455.0, "num_updates_step": 137216.0, "train_loss_step": 0.0037325574085116386, "train_examples_step": 262144.0, "lr_step": 0.0010000000474974513, "epoch": 0} 2025-10-03 13:04:44 | INFO | lidirl | {"batch_idx_step": 2203647.0, "num_updates_step": 137728.0, "train_loss_step": 0.0037433623801916838, "train_examples_step": 262144.0, "lr_step": 0.0010000000474974513, "epoch": 0} 2025-10-03 13:07:25 | INFO | lidirl | {"batch_idx_step": 2211839.0, "num_updates_step": 138240.0, "train_loss_step": 0.0037147460971027613, "train_examples_step": 262144.0, "lr_step": 0.0010000000474974513, "epoch": 0} 2025-10-03 13:10:09 | INFO | lidirl | {"batch_idx_step": 2220031.0, "num_updates_step": 138752.0, "train_loss_step": 0.003765373956412077, "train_examples_step": 262144.0, "lr_step": 0.0010000000474974513, "epoch": 0} 2025-10-03 13:12:51 | INFO | lidirl | {"batch_idx_step": 2228223.0, "num_updates_step": 139264.0, "train_loss_step": 0.003714290913194418, "train_examples_step": 262144.0, "lr_step": 0.0010000000474974513, "epoch": 0} 2025-10-03 13:15:08 | INFO | lidirl | {"val_loss_total": 328700.5, "val_examples_total": 219340.0, "val_f1_total": 0.7846893668174744, "val_loss_dev/floresplus": 328700.5, "val_examples_dev/floresplus": 219340.0, "val_f1_dev/floresplus": 0.7846893668174744, "epoch": 0} 2025-10-03 13:17:49 | INFO | lidirl | {"batch_idx_step": 2236415.0, "num_updates_step": 139776.0, "train_loss_step": 0.003737095510587096, "train_examples_step": 262144.0, "lr_step": 0.0010000000474974513, "epoch": 0} 2025-10-03 13:20:33 | INFO | lidirl | {"batch_idx_step": 2244607.0, "num_updates_step": 140288.0, "train_loss_step": 0.003758302191272378, "train_examples_step": 262144.0, "lr_step": 0.0010000000474974513, "epoch": 0} 2025-10-03 13:23:15 | INFO | lidirl | {"batch_idx_step": 2252799.0, "num_updates_step": 140800.0, "train_loss_step": 0.003720228560268879, "train_examples_step": 262144.0, "lr_step": 0.0010000000474974513, "epoch": 0} 2025-10-03 13:25:57 | INFO | lidirl | {"batch_idx_step": 2260991.0, "num_updates_step": 141312.0, "train_loss_step": 0.003695592051371932, "train_examples_step": 262144.0, "lr_step": 0.0010000000474974513, "epoch": 0} 2025-10-03 13:28:38 | INFO | lidirl | {"batch_idx_step": 2269183.0, "num_updates_step": 141824.0, "train_loss_step": 0.0037321927957236767, "train_examples_step": 262144.0, "lr_step": 0.0010000000474974513, "epoch": 0} 2025-10-03 13:31:20 | INFO | lidirl | {"batch_idx_step": 2277375.0, "num_updates_step": 142336.0, "train_loss_step": 0.0037562160287052393, "train_examples_step": 262144.0, "lr_step": 0.0010000000474974513, "epoch": 0} 2025-10-03 13:34:02 | INFO | lidirl | {"batch_idx_step": 2285567.0, "num_updates_step": 142848.0, "train_loss_step": 0.0037188229616731405, "train_examples_step": 262144.0, "lr_step": 0.0010000000474974513, "epoch": 0} 2025-10-03 13:36:43 | INFO | lidirl | {"batch_idx_step": 2293759.0, "num_updates_step": 143360.0, "train_loss_step": 0.003709148382768035, "train_examples_step": 262144.0, "lr_step": 0.0010000000474974513, "epoch": 0} 2025-10-03 13:39:25 | INFO | lidirl | {"batch_idx_step": 2301951.0, "num_updates_step": 143872.0, "train_loss_step": 0.003739644307643175, "train_examples_step": 262144.0, "lr_step": 0.0010000000474974513, "epoch": 0} 2025-10-03 13:42:06 | INFO | lidirl | {"batch_idx_step": 2310143.0, "num_updates_step": 144384.0, "train_loss_step": 0.0037241403479129076, "train_examples_step": 262144.0, "lr_step": 0.0010000000474974513, "epoch": 0} 2025-10-03 13:44:47 | INFO | lidirl | {"batch_idx_step": 2318335.0, "num_updates_step": 144896.0, "train_loss_step": 0.0037219389341771603, "train_examples_step": 262144.0, "lr_step": 0.0010000000474974513, "epoch": 0} 2025-10-03 13:47:28 | INFO | lidirl | {"batch_idx_step": 2326527.0, "num_updates_step": 145408.0, "train_loss_step": 0.0037336600944399834, "train_examples_step": 262144.0, "lr_step": 0.0010000000474974513, "epoch": 0} 2025-10-03 13:50:08 | INFO | lidirl | {"batch_idx_step": 2334719.0, "num_updates_step": 145920.0, "train_loss_step": 0.003737621009349823, "train_examples_step": 262144.0, "lr_step": 0.0010000000474974513, "epoch": 0} 2025-10-03 13:52:51 | INFO | lidirl | {"batch_idx_step": 2342911.0, "num_updates_step": 146432.0, "train_loss_step": 0.003729174379259348, "train_examples_step": 262144.0, "lr_step": 0.0010000000474974513, "epoch": 0} 2025-10-03 13:55:31 | INFO | lidirl | {"batch_idx_step": 2351103.0, "num_updates_step": 146944.0, "train_loss_step": 0.003718435764312744, "train_examples_step": 262144.0, "lr_step": 0.0010000000474974513, "epoch": 0} 2025-10-03 13:58:12 | INFO | lidirl | {"batch_idx_step": 2359295.0, "num_updates_step": 147456.0, "train_loss_step": 0.003720894455909729, "train_examples_step": 262144.0, "lr_step": 0.0010000000474974513, "epoch": 0} 2025-10-03 14:00:36 | INFO | lidirl | {"val_loss_total": 328716.53125, "val_examples_total": 219340.0, "val_f1_total": 0.7848162055015564, "val_loss_dev/floresplus": 328716.53125, "val_examples_dev/floresplus": 219340.0, "val_f1_dev/floresplus": 0.7848162055015564, "epoch": 0} 2025-10-03 14:03:18 | INFO | lidirl | {"batch_idx_step": 2367487.0, "num_updates_step": 147968.0, "train_loss_step": 0.003710022661834955, "train_examples_step": 262144.0, "lr_step": 0.0010000000474974513, "epoch": 0} 2025-10-03 14:05:59 | INFO | lidirl | {"batch_idx_step": 2375679.0, "num_updates_step": 148480.0, "train_loss_step": 0.0037519836332648993, "train_examples_step": 262144.0, "lr_step": 0.0010000000474974513, "epoch": 0} 2025-10-03 14:08:39 | INFO | lidirl | {"batch_idx_step": 2383871.0, "num_updates_step": 148992.0, "train_loss_step": 0.003741105552762747, "train_examples_step": 262144.0, "lr_step": 0.0010000000474974513, "epoch": 0} 2025-10-03 14:11:16 | INFO | lidirl | {"batch_idx_step": 2392063.0, "num_updates_step": 149504.0, "train_loss_step": 0.0037374242674559355, "train_examples_step": 262144.0, "lr_step": 0.0010000000474974513, "epoch": 0} 2025-10-03 14:13:54 | INFO | lidirl | {"batch_idx_step": 2400255.0, "num_updates_step": 150016.0, "train_loss_step": 0.003746636211872101, "train_examples_step": 262144.0, "lr_step": 0.0010000000474974513, "epoch": 0} 2025-10-03 14:16:32 | INFO | lidirl | {"batch_idx_step": 2408447.0, "num_updates_step": 150528.0, "train_loss_step": 0.003739546285942197, "train_examples_step": 262144.0, "lr_step": 0.0010000000474974513, "epoch": 0} 2025-10-03 14:19:10 | INFO | lidirl | {"batch_idx_step": 2416639.0, "num_updates_step": 151040.0, "train_loss_step": 0.003709095297381282, "train_examples_step": 262144.0, "lr_step": 0.0010000000474974513, "epoch": 0} 2025-10-03 14:21:48 | INFO | lidirl | {"batch_idx_step": 2424831.0, "num_updates_step": 151552.0, "train_loss_step": 0.003729583928361535, "train_examples_step": 262144.0, "lr_step": 0.0010000000474974513, "epoch": 0} 2025-10-03 14:24:26 | INFO | lidirl | {"batch_idx_step": 2433023.0, "num_updates_step": 152064.0, "train_loss_step": 0.003697526641190052, "train_examples_step": 262144.0, "lr_step": 0.0010000000474974513, "epoch": 0} 2025-10-03 14:27:04 | INFO | lidirl | {"batch_idx_step": 2441215.0, "num_updates_step": 152576.0, "train_loss_step": 0.0037434143014252186, "train_examples_step": 262144.0, "lr_step": 0.0010000000474974513, "epoch": 0} 2025-10-03 14:29:42 | INFO | lidirl | {"batch_idx_step": 2449407.0, "num_updates_step": 153088.0, "train_loss_step": 0.003731453325599432, "train_examples_step": 262144.0, "lr_step": 0.0010000000474974513, "epoch": 0} 2025-10-03 14:32:20 | INFO | lidirl | {"batch_idx_step": 2457599.0, "num_updates_step": 153600.0, "train_loss_step": 0.00372928730212152, "train_examples_step": 262144.0, "lr_step": 0.0010000000474974513, "epoch": 0} 2025-10-03 14:34:58 | INFO | lidirl | {"batch_idx_step": 2465791.0, "num_updates_step": 154112.0, "train_loss_step": 0.003710314864292741, "train_examples_step": 262144.0, "lr_step": 0.0010000000474974513, "epoch": 0} 2025-10-03 14:37:35 | INFO | lidirl | {"batch_idx_step": 2473983.0, "num_updates_step": 154624.0, "train_loss_step": 0.0037051013205200434, "train_examples_step": 262144.0, "lr_step": 0.0010000000474974513, "epoch": 0} 2025-10-03 14:40:13 | INFO | lidirl | {"batch_idx_step": 2482175.0, "num_updates_step": 155136.0, "train_loss_step": 0.0037191726732999086, "train_examples_step": 262144.0, "lr_step": 0.0010000000474974513, "epoch": 0} 2025-10-03 14:42:50 | INFO | lidirl | {"batch_idx_step": 2490367.0, "num_updates_step": 155648.0, "train_loss_step": 0.003690361278131604, "train_examples_step": 262144.0, "lr_step": 0.0010000000474974513, "epoch": 0} 2025-10-03 14:44:58 | INFO | lidirl | {"val_loss_total": 328604.28125, "val_examples_total": 219340.0, "val_f1_total": 0.7849242091178894, "val_loss_dev/floresplus": 328604.28125, "val_examples_dev/floresplus": 219340.0, "val_f1_dev/floresplus": 0.7849242091178894, "epoch": 0} 2025-10-03 14:47:37 | INFO | lidirl | {"batch_idx_step": 2498559.0, "num_updates_step": 156160.0, "train_loss_step": 0.003709074342623353, "train_examples_step": 262144.0, "lr_step": 0.0010000000474974513, "epoch": 0} 2025-10-03 14:50:14 | INFO | lidirl | {"batch_idx_step": 2506751.0, "num_updates_step": 156672.0, "train_loss_step": 0.003699568100273609, "train_examples_step": 262144.0, "lr_step": 0.0010000000474974513, "epoch": 0} 2025-10-03 14:52:52 | INFO | lidirl | {"batch_idx_step": 2514943.0, "num_updates_step": 157184.0, "train_loss_step": 0.0037341518327593803, "train_examples_step": 262144.0, "lr_step": 0.0010000000474974513, "epoch": 0} 2025-10-03 14:55:29 | INFO | lidirl | {"batch_idx_step": 2523135.0, "num_updates_step": 157696.0, "train_loss_step": 0.0037767719477415085, "train_examples_step": 262144.0, "lr_step": 0.0010000000474974513, "epoch": 0} 2025-10-03 14:58:08 | INFO | lidirl | {"batch_idx_step": 2531327.0, "num_updates_step": 158208.0, "train_loss_step": 0.0037493628915399313, "train_examples_step": 262144.0, "lr_step": 0.0010000000474974513, "epoch": 0} 2025-10-03 15:00:47 | INFO | lidirl | {"batch_idx_step": 2539519.0, "num_updates_step": 158720.0, "train_loss_step": 0.0037461889442056417, "train_examples_step": 262144.0, "lr_step": 0.0010000000474974513, "epoch": 0} 2025-10-03 15:03:25 | INFO | lidirl | {"batch_idx_step": 2547711.0, "num_updates_step": 159232.0, "train_loss_step": 0.003717203624546528, "train_examples_step": 262144.0, "lr_step": 0.0010000000474974513, "epoch": 0} 2025-10-03 15:06:03 | INFO | lidirl | {"batch_idx_step": 2555903.0, "num_updates_step": 159744.0, "train_loss_step": 0.0037157211918383837, "train_examples_step": 262144.0, "lr_step": 0.0010000000474974513, "epoch": 0} 2025-10-03 15:08:41 | INFO | lidirl | {"batch_idx_step": 2564095.0, "num_updates_step": 160256.0, "train_loss_step": 0.00372280809096992, "train_examples_step": 262144.0, "lr_step": 0.0010000000474974513, "epoch": 0} 2025-10-03 15:11:20 | INFO | lidirl | {"batch_idx_step": 2572287.0, "num_updates_step": 160768.0, "train_loss_step": 0.003726536873728037, "train_examples_step": 262144.0, "lr_step": 0.0010000000474974513, "epoch": 0} 2025-10-03 15:14:00 | INFO | lidirl | {"batch_idx_step": 2580479.0, "num_updates_step": 161280.0, "train_loss_step": 0.0037083830684423447, "train_examples_step": 262144.0, "lr_step": 0.0010000000474974513, "epoch": 0} 2025-10-03 15:16:41 | INFO | lidirl | {"batch_idx_step": 2588671.0, "num_updates_step": 161792.0, "train_loss_step": 0.0037192886229604483, "train_examples_step": 262144.0, "lr_step": 0.0010000000474974513, "epoch": 0} 2025-10-03 15:19:23 | INFO | lidirl | {"batch_idx_step": 2596863.0, "num_updates_step": 162304.0, "train_loss_step": 0.0037191861774772406, "train_examples_step": 262144.0, "lr_step": 0.0010000000474974513, "epoch": 0} 2025-10-03 15:22:04 | INFO | lidirl | {"batch_idx_step": 2605055.0, "num_updates_step": 162816.0, "train_loss_step": 0.0037087115924805403, "train_examples_step": 262144.0, "lr_step": 0.0010000000474974513, "epoch": 0} 2025-10-03 15:24:45 | INFO | lidirl | {"batch_idx_step": 2613247.0, "num_updates_step": 163328.0, "train_loss_step": 0.003710339078679681, "train_examples_step": 262144.0, "lr_step": 0.0010000000474974513, "epoch": 0} 2025-10-03 15:27:29 | INFO | lidirl | {"batch_idx_step": 2621439.0, "num_updates_step": 163840.0, "train_loss_step": 0.0037059893365949392, "train_examples_step": 262144.0, "lr_step": 0.0010000000474974513, "epoch": 0} 2025-10-03 15:29:46 | INFO | lidirl | {"val_loss_total": 327662.875, "val_examples_total": 219340.0, "val_f1_total": 0.7851393818855286, "val_loss_dev/floresplus": 327662.875, "val_examples_dev/floresplus": 219340.0, "val_f1_dev/floresplus": 0.7851393818855286, "epoch": 0} 2025-10-03 15:41:18 | INFO | lidirl | {"batch_idx_step": 2629631.0, "num_updates_step": 164352.0, "train_loss_step": 0.003738781902939081, "train_examples_step": 262144.0, "lr_step": 0.0010000000474974513, "epoch": 0} 2025-10-03 15:54:21 | INFO | lidirl | {"batch_idx_step": 2637823.0, "num_updates_step": 164864.0, "train_loss_step": 0.003721984103322029, "train_examples_step": 262144.0, "lr_step": 0.0010000000474974513, "epoch": 0} 2025-10-03 16:04:37 | INFO | lidirl | {"batch_idx_step": 2646015.0, "num_updates_step": 165376.0, "train_loss_step": 0.0036986381746828556, "train_examples_step": 262144.0, "lr_step": 0.0010000000474974513, "epoch": 0} 2025-10-03 16:16:15 | INFO | lidirl | {"batch_idx_step": 2654207.0, "num_updates_step": 165888.0, "train_loss_step": 0.0037263925187289715, "train_examples_step": 262144.0, "lr_step": 0.0010000000474974513, "epoch": 0} 2025-10-03 16:24:50 | INFO | lidirl | {"batch_idx_step": 2662399.0, "num_updates_step": 166400.0, "train_loss_step": 0.0037371625658124685, "train_examples_step": 262144.0, "lr_step": 0.0010000000474974513, "epoch": 0} 2025-10-03 16:32:27 | INFO | lidirl | {"batch_idx_step": 2670591.0, "num_updates_step": 166912.0, "train_loss_step": 0.003732958808541298, "train_examples_step": 262144.0, "lr_step": 0.0010000000474974513, "epoch": 0} 2025-10-03 16:39:03 | INFO | lidirl | {"batch_idx_step": 2678783.0, "num_updates_step": 167424.0, "train_loss_step": 0.0037260630633682013, "train_examples_step": 262144.0, "lr_step": 0.0010000000474974513, "epoch": 0} 2025-10-03 16:44:22 | INFO | lidirl | {"batch_idx_step": 2686975.0, "num_updates_step": 167936.0, "train_loss_step": 0.003700452158227563, "train_examples_step": 262144.0, "lr_step": 0.0010000000474974513, "epoch": 0} 2025-10-03 16:49:15 | INFO | lidirl | {"batch_idx_step": 2695167.0, "num_updates_step": 168448.0, "train_loss_step": 0.0037091155536472797, "train_examples_step": 262144.0, "lr_step": 0.0010000000474974513, "epoch": 0} 2025-10-03 16:53:32 | INFO | lidirl | {"batch_idx_step": 2703359.0, "num_updates_step": 168960.0, "train_loss_step": 0.0037387129850685596, "train_examples_step": 262144.0, "lr_step": 0.0010000000474974513, "epoch": 0} 2025-10-03 16:57:40 | INFO | lidirl | {"batch_idx_step": 2711551.0, "num_updates_step": 169472.0, "train_loss_step": 0.0037119395565241575, "train_examples_step": 262144.0, "lr_step": 0.0010000000474974513, "epoch": 0} 2025-10-03 17:01:26 | INFO | lidirl | {"batch_idx_step": 2719743.0, "num_updates_step": 169984.0, "train_loss_step": 0.0037051383405923843, "train_examples_step": 262144.0, "lr_step": 0.0010000000474974513, "epoch": 0} 2025-10-03 17:05:02 | INFO | lidirl | {"batch_idx_step": 2727935.0, "num_updates_step": 170496.0, "train_loss_step": 0.0037034570705145597, "train_examples_step": 262144.0, "lr_step": 0.0010000000474974513, "epoch": 0} 2025-10-03 17:08:41 | INFO | lidirl | {"batch_idx_step": 2736127.0, "num_updates_step": 171008.0, "train_loss_step": 0.0037253061309456825, "train_examples_step": 262144.0, "lr_step": 0.0010000000474974513, "epoch": 0} 2025-10-03 17:12:06 | INFO | lidirl | {"batch_idx_step": 2744319.0, "num_updates_step": 171520.0, "train_loss_step": 0.003703585360199213, "train_examples_step": 262144.0, "lr_step": 0.0010000000474974513, "epoch": 0} 2025-10-03 17:15:29 | INFO | lidirl | {"batch_idx_step": 2752511.0, "num_updates_step": 172032.0, "train_loss_step": 0.003727482631802559, "train_examples_step": 262144.0, "lr_step": 0.0010000000474974513, "epoch": 0} 2025-10-03 17:17:37 | INFO | lidirl | {"val_loss_total": 326909.40625, "val_examples_total": 219340.0, "val_f1_total": 0.7854782342910767, "val_loss_dev/floresplus": 326909.40625, "val_examples_dev/floresplus": 219340.0, "val_f1_dev/floresplus": 0.7854782342910767, "epoch": 0} 2025-10-03 17:20:58 | INFO | lidirl | {"batch_idx_step": 2760703.0, "num_updates_step": 172544.0, "train_loss_step": 0.003708949312567711, "train_examples_step": 262144.0, "lr_step": 0.0010000000474974513, "epoch": 0} 2025-10-03 17:24:11 | INFO | lidirl | {"batch_idx_step": 2768895.0, "num_updates_step": 173056.0, "train_loss_step": 0.003716395702213049, "train_examples_step": 262144.0, "lr_step": 0.0010000000474974513, "epoch": 0} 2025-10-03 17:27:22 | INFO | lidirl | {"batch_idx_step": 2777087.0, "num_updates_step": 173568.0, "train_loss_step": 0.0037424995098263025, "train_examples_step": 262144.0, "lr_step": 0.0010000000474974513, "epoch": 0} 2025-10-03 17:30:26 | INFO | lidirl | {"batch_idx_step": 2785279.0, "num_updates_step": 174080.0, "train_loss_step": 0.003723814385011792, "train_examples_step": 262144.0, "lr_step": 0.0010000000474974513, "epoch": 0} 2025-10-03 17:33:34 | INFO | lidirl | {"batch_idx_step": 2793471.0, "num_updates_step": 174592.0, "train_loss_step": 0.003696637926623225, "train_examples_step": 262144.0, "lr_step": 0.0010000000474974513, "epoch": 0} 2025-10-03 17:36:38 | INFO | lidirl | {"batch_idx_step": 2801663.0, "num_updates_step": 175104.0, "train_loss_step": 0.003688954282552004, "train_examples_step": 262144.0, "lr_step": 0.0010000000474974513, "epoch": 0} 2025-10-03 17:39:41 | INFO | lidirl | {"batch_idx_step": 2809855.0, "num_updates_step": 175616.0, "train_loss_step": 0.003687127958983183, "train_examples_step": 262144.0, "lr_step": 0.0010000000474974513, "epoch": 0} 2025-10-03 17:42:41 | INFO | lidirl | {"batch_idx_step": 2818047.0, "num_updates_step": 176128.0, "train_loss_step": 0.0037032333202660084, "train_examples_step": 262144.0, "lr_step": 0.0010000000474974513, "epoch": 0} 2025-10-03 17:45:41 | INFO | lidirl | {"batch_idx_step": 2826239.0, "num_updates_step": 176640.0, "train_loss_step": 0.0036871435586363077, "train_examples_step": 262144.0, "lr_step": 0.0010000000474974513, "epoch": 0} 2025-10-03 17:48:40 | INFO | lidirl | {"batch_idx_step": 2834431.0, "num_updates_step": 177152.0, "train_loss_step": 0.0037218171637505293, "train_examples_step": 262144.0, "lr_step": 0.0010000000474974513, "epoch": 0} 2025-10-03 17:51:37 | INFO | lidirl | {"batch_idx_step": 2842623.0, "num_updates_step": 177664.0, "train_loss_step": 0.003715572878718376, "train_examples_step": 262144.0, "lr_step": 0.0010000000474974513, "epoch": 0} 2025-10-03 17:54:36 | INFO | lidirl | {"batch_idx_step": 2850815.0, "num_updates_step": 178176.0, "train_loss_step": 0.003705920884385705, "train_examples_step": 262144.0, "lr_step": 0.0010000000474974513, "epoch": 0} 2025-10-03 17:57:25 | INFO | lidirl | {"batch_idx_step": 2859007.0, "num_updates_step": 178688.0, "train_loss_step": 0.003690620418637991, "train_examples_step": 262144.0, "lr_step": 0.0010000000474974513, "epoch": 0} 2025-10-03 18:00:24 | INFO | lidirl | {"batch_idx_step": 2867199.0, "num_updates_step": 179200.0, "train_loss_step": 0.003707395400851965, "train_examples_step": 262144.0, "lr_step": 0.0010000000474974513, "epoch": 0} 2025-10-03 18:03:14 | INFO | lidirl | {"batch_idx_step": 2875391.0, "num_updates_step": 179712.0, "train_loss_step": 0.003699658904224634, "train_examples_step": 262144.0, "lr_step": 0.0010000000474974513, "epoch": 0} 2025-10-03 18:06:07 | INFO | lidirl | {"batch_idx_step": 2883583.0, "num_updates_step": 180224.0, "train_loss_step": 0.0037200532387942076, "train_examples_step": 262144.0, "lr_step": 0.0010000000474974513, "epoch": 0} 2025-10-03 18:08:17 | INFO | lidirl | {"val_loss_total": 326598.3125, "val_examples_total": 219340.0, "val_f1_total": 0.7857795357704163, "val_loss_dev/floresplus": 326598.3125, "val_examples_dev/floresplus": 219340.0, "val_f1_dev/floresplus": 0.7857795357704163, "epoch": 0} 2025-10-03 18:11:08 | INFO | lidirl | {"batch_idx_step": 2891775.0, "num_updates_step": 180736.0, "train_loss_step": 0.0037004691548645496, "train_examples_step": 262144.0, "lr_step": 0.0010000000474974513, "epoch": 0} 2025-10-03 18:14:00 | INFO | lidirl | {"batch_idx_step": 2899967.0, "num_updates_step": 181248.0, "train_loss_step": 0.0036897233221679926, "train_examples_step": 262144.0, "lr_step": 0.0010000000474974513, "epoch": 0} 2025-10-03 18:16:52 | INFO | lidirl | {"batch_idx_step": 2908159.0, "num_updates_step": 181760.0, "train_loss_step": 0.0037112729623913765, "train_examples_step": 262144.0, "lr_step": 0.0010000000474974513, "epoch": 0} 2025-10-03 18:19:45 | INFO | lidirl | {"batch_idx_step": 2916351.0, "num_updates_step": 182272.0, "train_loss_step": 0.0037357239052653313, "train_examples_step": 262144.0, "lr_step": 0.0010000000474974513, "epoch": 0} 2025-10-03 18:22:39 | INFO | lidirl | {"batch_idx_step": 2924543.0, "num_updates_step": 182784.0, "train_loss_step": 0.003714478574693203, "train_examples_step": 262144.0, "lr_step": 0.0010000000474974513, "epoch": 0} 2025-10-03 18:25:30 | INFO | lidirl | {"batch_idx_step": 2932735.0, "num_updates_step": 183296.0, "train_loss_step": 0.0036766119301319122, "train_examples_step": 262144.0, "lr_step": 0.0010000000474974513, "epoch": 0} 2025-10-03 18:28:23 | INFO | lidirl | {"batch_idx_step": 2940927.0, "num_updates_step": 183808.0, "train_loss_step": 0.0037321888376027346, "train_examples_step": 262144.0, "lr_step": 0.0010000000474974513, "epoch": 0} 2025-10-03 18:31:11 | INFO | lidirl | {"batch_idx_step": 2949119.0, "num_updates_step": 184320.0, "train_loss_step": 0.0037210939917713404, "train_examples_step": 262144.0, "lr_step": 0.0010000000474974513, "epoch": 0} 2025-10-03 18:34:02 | INFO | lidirl | {"batch_idx_step": 2957311.0, "num_updates_step": 184832.0, "train_loss_step": 0.0037017054855823517, "train_examples_step": 262144.0, "lr_step": 0.0010000000474974513, "epoch": 0} 2025-10-03 18:36:51 | INFO | lidirl | {"batch_idx_step": 2965503.0, "num_updates_step": 185344.0, "train_loss_step": 0.003704201430082321, "train_examples_step": 262144.0, "lr_step": 0.0010000000474974513, "epoch": 0} 2025-10-03 18:39:40 | INFO | lidirl | {"batch_idx_step": 2973695.0, "num_updates_step": 185856.0, "train_loss_step": 0.0037044119089841843, "train_examples_step": 262144.0, "lr_step": 0.0010000000474974513, "epoch": 0} 2025-10-03 18:42:32 | INFO | lidirl | {"batch_idx_step": 2981887.0, "num_updates_step": 186368.0, "train_loss_step": 0.003695150138810277, "train_examples_step": 262144.0, "lr_step": 0.0010000000474974513, "epoch": 0} 2025-10-03 18:45:20 | INFO | lidirl | {"batch_idx_step": 2990079.0, "num_updates_step": 186880.0, "train_loss_step": 0.003692356636747718, "train_examples_step": 262144.0, "lr_step": 0.0010000000474974513, "epoch": 0} 2025-10-03 18:48:11 | INFO | lidirl | {"batch_idx_step": 2998271.0, "num_updates_step": 187392.0, "train_loss_step": 0.003676564432680607, "train_examples_step": 262144.0, "lr_step": 0.0010000000474974513, "epoch": 0} 2025-10-03 18:50:57 | INFO | lidirl | {"batch_idx_step": 3006463.0, "num_updates_step": 187904.0, "train_loss_step": 0.003711214754730463, "train_examples_step": 262144.0, "lr_step": 0.0010000000474974513, "epoch": 0} 2025-10-03 18:53:50 | INFO | lidirl | {"batch_idx_step": 3014655.0, "num_updates_step": 188416.0, "train_loss_step": 0.0037066342774778605, "train_examples_step": 262144.0, "lr_step": 0.0010000000474974513, "epoch": 0} 2025-10-03 18:56:02 | INFO | lidirl | {"val_loss_total": 326497.46875, "val_examples_total": 219340.0, "val_f1_total": 0.7859606742858887, "val_loss_dev/floresplus": 326497.46875, "val_examples_dev/floresplus": 219340.0, "val_f1_dev/floresplus": 0.7859606742858887, "epoch": 0} 2025-10-03 18:58:51 | INFO | lidirl | {"batch_idx_step": 3022847.0, "num_updates_step": 188928.0, "train_loss_step": 0.0037076284643262625, "train_examples_step": 262144.0, "lr_step": 0.0010000000474974513, "epoch": 0} 2025-10-03 19:01:38 | INFO | lidirl | {"batch_idx_step": 3031039.0, "num_updates_step": 189440.0, "train_loss_step": 0.0036761495284736156, "train_examples_step": 262144.0, "lr_step": 0.0010000000474974513, "epoch": 0} 2025-10-03 19:04:26 | INFO | lidirl | {"batch_idx_step": 3039231.0, "num_updates_step": 189952.0, "train_loss_step": 0.003703604219481349, "train_examples_step": 262144.0, "lr_step": 0.0010000000474974513, "epoch": 0} 2025-10-03 19:07:15 | INFO | lidirl | {"batch_idx_step": 3047423.0, "num_updates_step": 190464.0, "train_loss_step": 0.00369464885443449, "train_examples_step": 262144.0, "lr_step": 0.0010000000474974513, "epoch": 0} 2025-10-03 19:10:00 | INFO | lidirl | {"batch_idx_step": 3055615.0, "num_updates_step": 190976.0, "train_loss_step": 0.0037051173858344555, "train_examples_step": 262144.0, "lr_step": 0.0010000000474974513, "epoch": 0} 2025-10-03 19:12:48 | INFO | lidirl | {"batch_idx_step": 3063807.0, "num_updates_step": 191488.0, "train_loss_step": 0.0037197289057075977, "train_examples_step": 262144.0, "lr_step": 0.0010000000474974513, "epoch": 0} 2025-10-03 19:15:33 | INFO | lidirl | {"batch_idx_step": 3071999.0, "num_updates_step": 192000.0, "train_loss_step": 0.003689774079248309, "train_examples_step": 262144.0, "lr_step": 0.0010000000474974513, "epoch": 0} 2025-10-03 19:18:20 | INFO | lidirl | {"batch_idx_step": 3080191.0, "num_updates_step": 192512.0, "train_loss_step": 0.003712254576385021, "train_examples_step": 262144.0, "lr_step": 0.0010000000474974513, "epoch": 0} 2025-10-03 19:21:04 | INFO | lidirl | {"batch_idx_step": 3088383.0, "num_updates_step": 193024.0, "train_loss_step": 0.0037138001061975956, "train_examples_step": 262144.0, "lr_step": 0.0010000000474974513, "epoch": 0} 2025-10-03 19:23:50 | INFO | lidirl | {"batch_idx_step": 3096575.0, "num_updates_step": 193536.0, "train_loss_step": 0.0037064743228256702, "train_examples_step": 262144.0, "lr_step": 0.0010000000474974513, "epoch": 0} 2025-10-03 19:26:33 | INFO | lidirl | {"batch_idx_step": 3104767.0, "num_updates_step": 194048.0, "train_loss_step": 0.0037009126972407103, "train_examples_step": 262144.0, "lr_step": 0.0010000000474974513, "epoch": 0} 2025-10-03 19:29:19 | INFO | lidirl | {"batch_idx_step": 3112959.0, "num_updates_step": 194560.0, "train_loss_step": 0.0036761872470378876, "train_examples_step": 262144.0, "lr_step": 0.0010000000474974513, "epoch": 0} 2025-10-03 19:32:02 | INFO | lidirl | {"batch_idx_step": 3121151.0, "num_updates_step": 195072.0, "train_loss_step": 0.003690836252644658, "train_examples_step": 262144.0, "lr_step": 0.0010000000474974513, "epoch": 0} 2025-10-03 19:34:47 | INFO | lidirl | {"batch_idx_step": 3129343.0, "num_updates_step": 195584.0, "train_loss_step": 0.003706124145537615, "train_examples_step": 262144.0, "lr_step": 0.0010000000474974513, "epoch": 0} 2025-10-03 19:37:31 | INFO | lidirl | {"batch_idx_step": 3137535.0, "num_updates_step": 196096.0, "train_loss_step": 0.0036863493733108044, "train_examples_step": 262144.0, "lr_step": 0.0010000000474974513, "epoch": 0} 2025-10-03 19:40:16 | INFO | lidirl | {"batch_idx_step": 3145727.0, "num_updates_step": 196608.0, "train_loss_step": 0.0036946870386600494, "train_examples_step": 262144.0, "lr_step": 0.0010000000474974513, "epoch": 0} 2025-10-03 19:42:34 | INFO | lidirl | {"val_loss_total": 326180.34375, "val_examples_total": 219340.0, "val_f1_total": 0.7861031293869019, "val_loss_dev/floresplus": 326180.34375, "val_examples_dev/floresplus": 219340.0, "val_f1_dev/floresplus": 0.7861031293869019, "epoch": 0} 2025-10-03 19:45:18 | INFO | lidirl | {"batch_idx_step": 3153919.0, "num_updates_step": 197120.0, "train_loss_step": 0.0037169933784753084, "train_examples_step": 262144.0, "lr_step": 0.0010000000474974513, "epoch": 0} 2025-10-03 19:48:03 | INFO | lidirl | {"batch_idx_step": 3162111.0, "num_updates_step": 197632.0, "train_loss_step": 0.0037151521537452936, "train_examples_step": 262144.0, "lr_step": 0.0010000000474974513, "epoch": 0} 2025-10-03 19:50:47 | INFO | lidirl | {"batch_idx_step": 3170303.0, "num_updates_step": 198144.0, "train_loss_step": 0.0036942167207598686, "train_examples_step": 262144.0, "lr_step": 0.0010000000474974513, "epoch": 0} 2025-10-03 19:53:33 | INFO | lidirl | {"batch_idx_step": 3178495.0, "num_updates_step": 198656.0, "train_loss_step": 0.0037046605721116066, "train_examples_step": 262144.0, "lr_step": 0.0010000000474974513, "epoch": 0} 2025-10-03 19:56:16 | INFO | lidirl | {"batch_idx_step": 3186687.0, "num_updates_step": 199168.0, "train_loss_step": 0.0036852105986326933, "train_examples_step": 262144.0, "lr_step": 0.0010000000474974513, "epoch": 0} 2025-10-03 19:59:01 | INFO | lidirl | {"batch_idx_step": 3194879.0, "num_updates_step": 199680.0, "train_loss_step": 0.0037136287428438663, "train_examples_step": 262144.0, "lr_step": 0.0010000000474974513, "epoch": 0} 2025-10-03 20:01:46 | INFO | lidirl | {"batch_idx_step": 3203071.0, "num_updates_step": 200192.0, "train_loss_step": 0.003698950167745352, "train_examples_step": 262144.0, "lr_step": 0.0010000000474974513, "epoch": 0} 2025-10-03 20:04:29 | INFO | lidirl | {"batch_idx_step": 3211263.0, "num_updates_step": 200704.0, "train_loss_step": 0.0036847044248133898, "train_examples_step": 262144.0, "lr_step": 0.0010000000474974513, "epoch": 0} 2025-10-03 20:07:12 | INFO | lidirl | {"batch_idx_step": 3219455.0, "num_updates_step": 201216.0, "train_loss_step": 0.0037017485592514277, "train_examples_step": 262144.0, "lr_step": 0.0010000000474974513, "epoch": 0} 2025-10-03 20:09:58 | INFO | lidirl | {"batch_idx_step": 3227647.0, "num_updates_step": 201728.0, "train_loss_step": 0.003672330640256405, "train_examples_step": 262144.0, "lr_step": 0.0010000000474974513, "epoch": 0} 2025-10-03 20:12:40 | INFO | lidirl | {"batch_idx_step": 3235839.0, "num_updates_step": 202240.0, "train_loss_step": 0.0037014735862612724, "train_examples_step": 262144.0, "lr_step": 0.0010000000474974513, "epoch": 0} 2025-10-03 20:15:26 | INFO | lidirl | {"batch_idx_step": 3244031.0, "num_updates_step": 202752.0, "train_loss_step": 0.0036669138353317976, "train_examples_step": 262144.0, "lr_step": 0.0010000000474974513, "epoch": 0} 2025-10-03 20:18:08 | INFO | lidirl | {"batch_idx_step": 3252223.0, "num_updates_step": 203264.0, "train_loss_step": 0.0036589966621249914, "train_examples_step": 262144.0, "lr_step": 0.0010000000474974513, "epoch": 0} 2025-10-03 20:20:52 | INFO | lidirl | {"batch_idx_step": 3260415.0, "num_updates_step": 203776.0, "train_loss_step": 0.0036897414829581976, "train_examples_step": 262144.0, "lr_step": 0.0010000000474974513, "epoch": 0} 2025-10-03 20:27:00 | INFO | lidirl | {"batch_idx_step": 3268607.0, "num_updates_step": 204288.0, "train_loss_step": 0.003693646052852273, "train_examples_step": 262144.0, "lr_step": 0.0010000000474974513, "epoch": 0} 2025-10-03 20:36:13 | INFO | lidirl | {"batch_idx_step": 3276799.0, "num_updates_step": 204800.0, "train_loss_step": 0.003710854798555374, "train_examples_step": 262144.0, "lr_step": 0.0010000000474974513, "epoch": 0} 2025-10-03 20:38:25 | INFO | lidirl | {"val_loss_total": 325550.9375, "val_examples_total": 219340.0, "val_f1_total": 0.7862730622291565, "val_loss_dev/floresplus": 325550.9375, "val_examples_dev/floresplus": 219340.0, "val_f1_dev/floresplus": 0.7862730622291565, "epoch": 0} 2025-10-03 20:46:31 | INFO | lidirl | {"batch_idx_step": 3284991.0, "num_updates_step": 205312.0, "train_loss_step": 0.003716022241860628, "train_examples_step": 262144.0, "lr_step": 0.0010000000474974513, "epoch": 0} 2025-10-03 20:54:33 | INFO | lidirl | {"batch_idx_step": 3293183.0, "num_updates_step": 205824.0, "train_loss_step": 0.0037178518250584602, "train_examples_step": 262144.0, "lr_step": 0.0010000000474974513, "epoch": 0} 2025-10-03 21:01:38 | INFO | lidirl | {"batch_idx_step": 3301375.0, "num_updates_step": 206336.0, "train_loss_step": 0.003698169020935893, "train_examples_step": 262144.0, "lr_step": 0.0010000000474974513, "epoch": 0} 2025-10-03 21:11:20 | INFO | lidirl | {"batch_idx_step": 3309567.0, "num_updates_step": 206848.0, "train_loss_step": 0.003679533489048481, "train_examples_step": 262144.0, "lr_step": 0.0010000000474974513, "epoch": 0} 2025-10-03 21:20:04 | INFO | lidirl | {"batch_idx_step": 3317759.0, "num_updates_step": 207360.0, "train_loss_step": 0.0036927133332937956, "train_examples_step": 262144.0, "lr_step": 0.0010000000474974513, "epoch": 0} 2025-10-03 21:28:07 | INFO | lidirl | {"batch_idx_step": 3325951.0, "num_updates_step": 207872.0, "train_loss_step": 0.003697403706610203, "train_examples_step": 262144.0, "lr_step": 0.0010000000474974513, "epoch": 0} 2025-10-03 21:35:14 | INFO | lidirl | {"batch_idx_step": 3334143.0, "num_updates_step": 208384.0, "train_loss_step": 0.0036783383693546057, "train_examples_step": 262144.0, "lr_step": 0.0010000000474974513, "epoch": 0} 2025-10-03 21:41:46 | INFO | lidirl | {"batch_idx_step": 3342335.0, "num_updates_step": 208896.0, "train_loss_step": 0.0036905882880091667, "train_examples_step": 262144.0, "lr_step": 0.0010000000474974513, "epoch": 0} 2025-10-03 21:47:50 | INFO | lidirl | {"batch_idx_step": 3350527.0, "num_updates_step": 209408.0, "train_loss_step": 0.00369916670024395, "train_examples_step": 262144.0, "lr_step": 0.0010000000474974513, "epoch": 0} 2025-10-03 21:53:45 | INFO | lidirl | {"batch_idx_step": 3358719.0, "num_updates_step": 209920.0, "train_loss_step": 0.0036685853265225887, "train_examples_step": 262144.0, "lr_step": 0.0010000000474974513, "epoch": 0} 2025-10-03 21:59:33 | INFO | lidirl | {"batch_idx_step": 3366911.0, "num_updates_step": 210432.0, "train_loss_step": 0.003691598307341337, "train_examples_step": 262144.0, "lr_step": 0.0010000000474974513, "epoch": 0} 2025-10-03 22:09:23 | INFO | lidirl | {"batch_idx_step": 3375103.0, "num_updates_step": 210944.0, "train_loss_step": 0.003714395919814706, "train_examples_step": 262144.0, "lr_step": 0.0010000000474974513, "epoch": 0} 2025-10-03 22:20:01 | INFO | lidirl | {"batch_idx_step": 3383295.0, "num_updates_step": 211456.0, "train_loss_step": 0.0036500280257314444, "train_examples_step": 262144.0, "lr_step": 0.0010000000474974513, "epoch": 0} 2025-10-03 22:27:51 | INFO | lidirl | {"batch_idx_step": 3391487.0, "num_updates_step": 211968.0, "train_loss_step": 0.0037056938745081425, "train_examples_step": 262144.0, "lr_step": 0.0010000000474974513, "epoch": 0} 2025-10-03 22:33:57 | INFO | lidirl | {"batch_idx_step": 3399679.0, "num_updates_step": 212480.0, "train_loss_step": 0.0036677152384072542, "train_examples_step": 262144.0, "lr_step": 0.0010000000474974513, "epoch": 0} 2025-10-03 22:39:28 | INFO | lidirl | {"batch_idx_step": 3407871.0, "num_updates_step": 212992.0, "train_loss_step": 0.0037109018303453922, "train_examples_step": 262144.0, "lr_step": 0.0010000000474974513, "epoch": 0} 2025-10-03 22:41:46 | INFO | lidirl | {"val_loss_total": 325209.59375, "val_examples_total": 219340.0, "val_f1_total": 0.7864770889282227, "val_loss_dev/floresplus": 325209.59375, "val_examples_dev/floresplus": 219340.0, "val_f1_dev/floresplus": 0.7864770889282227, "epoch": 0} 2025-10-03 22:46:49 | INFO | lidirl | {"batch_idx_step": 3416063.0, "num_updates_step": 213504.0, "train_loss_step": 0.0036912010982632637, "train_examples_step": 262144.0, "lr_step": 0.0010000000474974513, "epoch": 0} 2025-10-03 22:51:26 | INFO | lidirl | {"batch_idx_step": 3424255.0, "num_updates_step": 214016.0, "train_loss_step": 0.0036938050761818886, "train_examples_step": 262144.0, "lr_step": 0.0010000000474974513, "epoch": 0} 2025-10-03 22:56:33 | INFO | lidirl | {"batch_idx_step": 3432447.0, "num_updates_step": 214528.0, "train_loss_step": 0.003719484666362405, "train_examples_step": 262144.0, "lr_step": 0.0010000000474974513, "epoch": 0} 2025-10-03 23:01:31 | INFO | lidirl | {"batch_idx_step": 3440639.0, "num_updates_step": 215040.0, "train_loss_step": 0.0036947557236999273, "train_examples_step": 262144.0, "lr_step": 0.0010000000474974513, "epoch": 0} 2025-10-03 23:06:29 | INFO | lidirl | {"batch_idx_step": 3448831.0, "num_updates_step": 215552.0, "train_loss_step": 0.003694439772516489, "train_examples_step": 262144.0, "lr_step": 0.0010000000474974513, "epoch": 0} 2025-10-03 23:11:00 | INFO | lidirl | {"batch_idx_step": 3457023.0, "num_updates_step": 216064.0, "train_loss_step": 0.003675695974379778, "train_examples_step": 262144.0, "lr_step": 0.0010000000474974513, "epoch": 0} 2025-10-03 23:15:00 | INFO | lidirl | {"batch_idx_step": 3465215.0, "num_updates_step": 216576.0, "train_loss_step": 0.0036698190961033106, "train_examples_step": 262144.0, "lr_step": 0.0010000000474974513, "epoch": 0} 2025-10-03 23:19:26 | INFO | lidirl | {"batch_idx_step": 3473407.0, "num_updates_step": 217088.0, "train_loss_step": 0.003680574707686901, "train_examples_step": 262144.0, "lr_step": 0.0010000000474974513, "epoch": 0} 2025-10-03 23:24:07 | INFO | lidirl | {"batch_idx_step": 3481599.0, "num_updates_step": 217600.0, "train_loss_step": 0.00371348625048995, "train_examples_step": 262144.0, "lr_step": 0.0010000000474974513, "epoch": 0} 2025-10-03 23:28:17 | INFO | lidirl | {"batch_idx_step": 3489791.0, "num_updates_step": 218112.0, "train_loss_step": 0.003674504579976201, "train_examples_step": 262144.0, "lr_step": 0.0010000000474974513, "epoch": 0} 2025-10-03 23:32:28 | INFO | lidirl | {"batch_idx_step": 3497983.0, "num_updates_step": 218624.0, "train_loss_step": 0.0037091306876391172, "train_examples_step": 262144.0, "lr_step": 0.0010000000474974513, "epoch": 0} 2025-10-03 23:36:29 | INFO | lidirl | {"batch_idx_step": 3506175.0, "num_updates_step": 219136.0, "train_loss_step": 0.003691368969157338, "train_examples_step": 262144.0, "lr_step": 0.0010000000474974513, "epoch": 0} 2025-10-03 23:40:37 | INFO | lidirl | {"batch_idx_step": 3514367.0, "num_updates_step": 219648.0, "train_loss_step": 0.003694879589602351, "train_examples_step": 262144.0, "lr_step": 0.0010000000474974513, "epoch": 0} 2025-10-03 23:44:56 | INFO | lidirl | {"batch_idx_step": 3522559.0, "num_updates_step": 220160.0, "train_loss_step": 0.0036632937844842672, "train_examples_step": 262144.0, "lr_step": 0.0010000000474974513, "epoch": 0} 2025-10-03 23:48:55 | INFO | lidirl | {"batch_idx_step": 3530751.0, "num_updates_step": 220672.0, "train_loss_step": 0.0036716414615511894, "train_examples_step": 262144.0, "lr_step": 0.0010000000474974513, "epoch": 0} 2025-10-03 23:52:53 | INFO | lidirl | {"batch_idx_step": 3538943.0, "num_updates_step": 221184.0, "train_loss_step": 0.003681425703689456, "train_examples_step": 262144.0, "lr_step": 0.0010000000474974513, "epoch": 0} 2025-10-03 23:55:10 | INFO | lidirl | {"val_loss_total": 324799.3125, "val_examples_total": 219340.0, "val_f1_total": 0.7868512868881226, "val_loss_dev/floresplus": 324799.3125, "val_examples_dev/floresplus": 219340.0, "val_f1_dev/floresplus": 0.7868512868881226, "epoch": 0} 2025-10-03 23:59:34 | INFO | lidirl | {"batch_idx_step": 3547135.0, "num_updates_step": 221696.0, "train_loss_step": 0.003674866398796439, "train_examples_step": 262144.0, "lr_step": 0.0010000000474974513, "epoch": 0} 2025-10-04 00:03:31 | INFO | lidirl | {"batch_idx_step": 3555327.0, "num_updates_step": 222208.0, "train_loss_step": 0.003671012818813324, "train_examples_step": 262144.0, "lr_step": 0.0010000000474974513, "epoch": 0} 2025-10-04 00:08:08 | INFO | lidirl | {"batch_idx_step": 3563519.0, "num_updates_step": 222720.0, "train_loss_step": 0.0036762808449566364, "train_examples_step": 262144.0, "lr_step": 0.0010000000474974513, "epoch": 0} 2025-10-04 00:12:31 | INFO | lidirl | {"batch_idx_step": 3571711.0, "num_updates_step": 223232.0, "train_loss_step": 0.0036719145718961954, "train_examples_step": 262144.0, "lr_step": 0.0010000000474974513, "epoch": 0} 2025-10-04 00:16:53 | INFO | lidirl | {"batch_idx_step": 3579903.0, "num_updates_step": 223744.0, "train_loss_step": 0.0036696484312415123, "train_examples_step": 262144.0, "lr_step": 0.0010000000474974513, "epoch": 0} 2025-10-04 00:20:38 | INFO | lidirl | {"batch_idx_step": 3588095.0, "num_updates_step": 224256.0, "train_loss_step": 0.003674944629892707, "train_examples_step": 262144.0, "lr_step": 0.0010000000474974513, "epoch": 0} 2025-10-04 00:24:22 | INFO | lidirl | {"batch_idx_step": 3596287.0, "num_updates_step": 224768.0, "train_loss_step": 0.003688832512125373, "train_examples_step": 262144.0, "lr_step": 0.0010000000474974513, "epoch": 0} 2025-10-04 00:27:57 | INFO | lidirl | {"batch_idx_step": 3604479.0, "num_updates_step": 225280.0, "train_loss_step": 0.00366632710210979, "train_examples_step": 262144.0, "lr_step": 0.0010000000474974513, "epoch": 0} 2025-10-04 00:32:05 | INFO | lidirl | {"batch_idx_step": 3612671.0, "num_updates_step": 225792.0, "train_loss_step": 0.003669280558824539, "train_examples_step": 262144.0, "lr_step": 0.0010000000474974513, "epoch": 0} 2025-10-04 00:36:02 | INFO | lidirl | {"batch_idx_step": 3620863.0, "num_updates_step": 226304.0, "train_loss_step": 0.003657641587778926, "train_examples_step": 262144.0, "lr_step": 0.0010000000474974513, "epoch": 0} 2025-10-04 00:39:49 | INFO | lidirl | {"batch_idx_step": 3629055.0, "num_updates_step": 226816.0, "train_loss_step": 0.0036693615838885307, "train_examples_step": 262144.0, "lr_step": 0.0010000000474974513, "epoch": 0} 2025-10-04 00:43:50 | INFO | lidirl | {"batch_idx_step": 3637247.0, "num_updates_step": 227328.0, "train_loss_step": 0.003678195411339402, "train_examples_step": 262144.0, "lr_step": 0.0010000000474974513, "epoch": 0} 2025-10-04 00:47:41 | INFO | lidirl | {"batch_idx_step": 3645439.0, "num_updates_step": 227840.0, "train_loss_step": 0.0036827761214226484, "train_examples_step": 262144.0, "lr_step": 0.0010000000474974513, "epoch": 0} 2025-10-04 00:51:22 | INFO | lidirl | {"batch_idx_step": 3653631.0, "num_updates_step": 228352.0, "train_loss_step": 0.003686046926304698, "train_examples_step": 262144.0, "lr_step": 0.0010000000474974513, "epoch": 0} 2025-10-04 00:54:52 | INFO | lidirl | {"batch_idx_step": 3661823.0, "num_updates_step": 228864.0, "train_loss_step": 0.0036588681396096945, "train_examples_step": 262144.0, "lr_step": 0.0010000000474974513, "epoch": 0} 2025-10-04 00:58:21 | INFO | lidirl | {"batch_idx_step": 3670015.0, "num_updates_step": 229376.0, "train_loss_step": 0.003695250255987048, "train_examples_step": 262144.0, "lr_step": 0.0010000000474974513, "epoch": 0} 2025-10-04 01:00:39 | INFO | lidirl | {"val_loss_total": 324723.09375, "val_examples_total": 219340.0, "val_f1_total": 0.7870413064956665, "val_loss_dev/floresplus": 324723.09375, "val_examples_dev/floresplus": 219340.0, "val_f1_dev/floresplus": 0.7870413064956665, "epoch": 0} 2025-10-04 01:04:09 | INFO | lidirl | {"batch_idx_step": 3678207.0, "num_updates_step": 229888.0, "train_loss_step": 0.0036798978690057993, "train_examples_step": 262144.0, "lr_step": 0.0010000000474974513, "epoch": 0} 2025-10-04 01:08:09 | INFO | lidirl | {"batch_idx_step": 3686399.0, "num_updates_step": 230400.0, "train_loss_step": 0.0036936430260539055, "train_examples_step": 262144.0, "lr_step": 0.0010000000474974513, "epoch": 0} 2025-10-04 01:12:00 | INFO | lidirl | {"batch_idx_step": 3694591.0, "num_updates_step": 230912.0, "train_loss_step": 0.0036715478636324406, "train_examples_step": 262144.0, "lr_step": 0.0010000000474974513, "epoch": 0} 2025-10-04 01:15:50 | INFO | lidirl | {"batch_idx_step": 3702783.0, "num_updates_step": 231424.0, "train_loss_step": 0.003691622754558921, "train_examples_step": 262144.0, "lr_step": 0.0010000000474974513, "epoch": 0} 2025-10-04 01:19:20 | INFO | lidirl | {"batch_idx_step": 3710975.0, "num_updates_step": 231936.0, "train_loss_step": 0.00366916973143816, "train_examples_step": 262144.0, "lr_step": 0.0010000000474974513, "epoch": 0} 2025-10-04 01:22:56 | INFO | lidirl | {"batch_idx_step": 3719167.0, "num_updates_step": 232448.0, "train_loss_step": 0.0036774305626749992, "train_examples_step": 262144.0, "lr_step": 0.0010000000474974513, "epoch": 0} 2025-10-04 01:43:23 | INFO | lidirl | {"batch_idx_step": 3727359.0, "num_updates_step": 232960.0, "train_loss_step": 0.0036734757013618946, "train_examples_step": 262144.0, "lr_step": 0.0010000000474974513, "epoch": 0} 2025-10-04 02:04:01 | INFO | lidirl | {"batch_idx_step": 3735551.0, "num_updates_step": 233472.0, "train_loss_step": 0.0037102613132447004, "train_examples_step": 262144.0, "lr_step": 0.0010000000474974513, "epoch": 0} 2025-10-04 02:31:31 | INFO | lidirl | {"batch_idx_step": 3743743.0, "num_updates_step": 233984.0, "train_loss_step": 0.003680021967738867, "train_examples_step": 262144.0, "lr_step": 0.0010000000474974513, "epoch": 0} 2025-10-04 02:56:40 | INFO | lidirl | {"batch_idx_step": 3751935.0, "num_updates_step": 234496.0, "train_loss_step": 0.0037098408211022615, "train_examples_step": 262144.0, "lr_step": 0.0010000000474974513, "epoch": 0} 2025-10-04 03:16:58 | INFO | lidirl | {"batch_idx_step": 3760127.0, "num_updates_step": 235008.0, "train_loss_step": 0.003684757510200143, "train_examples_step": 262144.0, "lr_step": 0.0010000000474974513, "epoch": 0} 2025-10-04 03:32:45 | INFO | lidirl | {"batch_idx_step": 3768319.0, "num_updates_step": 235520.0, "train_loss_step": 0.0036915952805429697, "train_examples_step": 262144.0, "lr_step": 0.0010000000474974513, "epoch": 0} 2025-10-04 03:47:22 | INFO | lidirl | {"batch_idx_step": 3776511.0, "num_updates_step": 236032.0, "train_loss_step": 0.0036802971735596657, "train_examples_step": 262144.0, "lr_step": 0.0010000000474974513, "epoch": 0} 2025-10-04 04:05:19 | INFO | lidirl | {"batch_idx_step": 3784703.0, "num_updates_step": 236544.0, "train_loss_step": 0.003653506049886346, "train_examples_step": 262144.0, "lr_step": 0.0010000000474974513, "epoch": 0} 2025-10-04 04:21:39 | INFO | lidirl | {"batch_idx_step": 3792895.0, "num_updates_step": 237056.0, "train_loss_step": 0.003680330701172352, "train_examples_step": 262144.0, "lr_step": 0.0010000000474974513, "epoch": 0} 2025-10-04 04:41:45 | INFO | lidirl | {"batch_idx_step": 3801087.0, "num_updates_step": 237568.0, "train_loss_step": 0.003656987566500902, "train_examples_step": 262144.0, "lr_step": 0.0010000000474974513, "epoch": 0} 2025-10-04 04:43:57 | INFO | lidirl | {"val_loss_total": 324829.21875, "val_examples_total": 219340.0, "val_f1_total": 0.7871948480606079, "val_loss_dev/floresplus": 324829.21875, "val_examples_dev/floresplus": 219340.0, "val_f1_dev/floresplus": 0.7871948480606079, "epoch": 0} 2025-10-04 04:59:22 | INFO | lidirl | {"batch_idx_step": 3809279.0, "num_updates_step": 238080.0, "train_loss_step": 0.0036750915460288525, "train_examples_step": 262144.0, "lr_step": 0.0010000000474974513, "epoch": 0} 2025-10-04 05:15:03 | INFO | lidirl | {"batch_idx_step": 3817471.0, "num_updates_step": 238592.0, "train_loss_step": 0.003666353179141879, "train_examples_step": 262144.0, "lr_step": 0.0010000000474974513, "epoch": 0} 2025-10-04 05:35:11 | INFO | lidirl | {"batch_idx_step": 3825663.0, "num_updates_step": 239104.0, "train_loss_step": 0.0036929575726389885, "train_examples_step": 262144.0, "lr_step": 0.0010000000474974513, "epoch": 0} 2025-10-04 05:53:25 | INFO | lidirl | {"batch_idx_step": 3833855.0, "num_updates_step": 239616.0, "train_loss_step": 0.0036594930570572615, "train_examples_step": 262144.0, "lr_step": 0.0010000000474974513, "epoch": 0} 2025-10-04 06:09:41 | INFO | lidirl | {"batch_idx_step": 3842047.0, "num_updates_step": 240128.0, "train_loss_step": 0.0036922229919582605, "train_examples_step": 262144.0, "lr_step": 0.0010000000474974513, "epoch": 0} 2025-10-04 06:31:32 | INFO | lidirl | {"batch_idx_step": 3850239.0, "num_updates_step": 240640.0, "train_loss_step": 0.003653663443401456, "train_examples_step": 262144.0, "lr_step": 0.0010000000474974513, "epoch": 0} 2025-10-04 07:14:31 | INFO | lidirl | {"batch_idx_step": 3858431.0, "num_updates_step": 241152.0, "train_loss_step": 0.003682768903672695, "train_examples_step": 262144.0, "lr_step": 0.0010000000474974513, "epoch": 0} 2025-10-04 08:00:01 | INFO | lidirl | {"batch_idx_step": 3866623.0, "num_updates_step": 241664.0, "train_loss_step": 0.003664731979370117, "train_examples_step": 262144.0, "lr_step": 0.0010000000474974513, "epoch": 0} 2025-10-04 08:48:09 | INFO | lidirl | {"batch_idx_step": 3874815.0, "num_updates_step": 242176.0, "train_loss_step": 0.003664138726890087, "train_examples_step": 262144.0, "lr_step": 0.0010000000474974513, "epoch": 0} 2025-10-04 09:36:09 | INFO | lidirl | {"batch_idx_step": 3883007.0, "num_updates_step": 242688.0, "train_loss_step": 0.0036598765291273594, "train_examples_step": 262144.0, "lr_step": 0.0010000000474974513, "epoch": 0} 2025-10-04 10:23:03 | INFO | lidirl | {"batch_idx_step": 3891199.0, "num_updates_step": 243200.0, "train_loss_step": 0.0036774268373847008, "train_examples_step": 262144.0, "lr_step": 0.0010000000474974513, "epoch": 0} 2025-10-04 11:16:12 | INFO | lidirl | {"batch_idx_step": 3899391.0, "num_updates_step": 243712.0, "train_loss_step": 0.003692409722134471, "train_examples_step": 262144.0, "lr_step": 0.0010000000474974513, "epoch": 0} 2025-10-04 12:10:08 | INFO | lidirl | {"batch_idx_step": 3907583.0, "num_updates_step": 244224.0, "train_loss_step": 0.0036614704877138138, "train_examples_step": 262144.0, "lr_step": 0.0010000000474974513, "epoch": 0} 2025-10-04 13:25:25 | INFO | lidirl | {"batch_idx_step": 3915775.0, "num_updates_step": 244736.0, "train_loss_step": 0.0036788901779800653, "train_examples_step": 262144.0, "lr_step": 0.0010000000474974513, "epoch": 0} 2025-10-04 14:27:40 | INFO | lidirl | {"batch_idx_step": 3923967.0, "num_updates_step": 245248.0, "train_loss_step": 0.00369038013741374, "train_examples_step": 262144.0, "lr_step": 0.0010000000474974513, "epoch": 0} 2025-10-04 14:52:01 | INFO | lidirl | {"batch_idx_step": 3932159.0, "num_updates_step": 245760.0, "train_loss_step": 0.0036893144715577364, "train_examples_step": 262144.0, "lr_step": 0.0010000000474974513, "epoch": 0} 2025-10-04 14:54:10 | INFO | lidirl | {"val_loss_total": 324500.9375, "val_examples_total": 219340.0, "val_f1_total": 0.7874125838279724, "val_loss_dev/floresplus": 324500.9375, "val_examples_dev/floresplus": 219340.0, "val_f1_dev/floresplus": 0.7874125838279724, "epoch": 0} 2025-10-04 15:05:30 | INFO | lidirl | {"batch_idx_step": 3940351.0, "num_updates_step": 246272.0, "train_loss_step": 0.0036498168483376503, "train_examples_step": 262144.0, "lr_step": 0.0010000000474974513, "epoch": 0} 2025-10-04 15:12:20 | INFO | lidirl | {"batch_idx_step": 3948543.0, "num_updates_step": 246784.0, "train_loss_step": 0.003660190850496292, "train_examples_step": 262144.0, "lr_step": 0.0010000000474974513, "epoch": 0} 2025-10-04 15:17:11 | INFO | lidirl | {"batch_idx_step": 3956735.0, "num_updates_step": 247296.0, "train_loss_step": 0.003709452925249934, "train_examples_step": 262144.0, "lr_step": 0.0010000000474974513, "epoch": 0} 2025-10-04 15:21:10 | INFO | lidirl | {"batch_idx_step": 3964927.0, "num_updates_step": 247808.0, "train_loss_step": 0.0036988803185522556, "train_examples_step": 262144.0, "lr_step": 0.0010000000474974513, "epoch": 0} 2025-10-04 15:24:47 | INFO | lidirl | {"batch_idx_step": 3973119.0, "num_updates_step": 248320.0, "train_loss_step": 0.0036748996935784817, "train_examples_step": 262144.0, "lr_step": 0.0010000000474974513, "epoch": 0} 2025-10-04 15:28:04 | INFO | lidirl | {"batch_idx_step": 3981311.0, "num_updates_step": 248832.0, "train_loss_step": 0.003679150016978383, "train_examples_step": 262144.0, "lr_step": 0.0010000000474974513, "epoch": 0} 2025-10-04 15:31:12 | INFO | lidirl | {"batch_idx_step": 3989503.0, "num_updates_step": 249344.0, "train_loss_step": 0.0036763842217624187, "train_examples_step": 262144.0, "lr_step": 0.0010000000474974513, "epoch": 0} 2025-10-04 15:34:12 | INFO | lidirl | {"batch_idx_step": 3997695.0, "num_updates_step": 249856.0, "train_loss_step": 0.0036667983513325453, "train_examples_step": 262144.0, "lr_step": 0.0010000000474974513, "epoch": 0} 2025-10-04 15:35:02 | INFO | lidirl | {"batch_idx_epoch": 2002943.875, "num_updates_epoch": 125184.0, "train_loss_epoch": 0.0038626862224191427, "train_examples_epoch": 262144.0, "lr_epoch": 0.0009999943431466818, "epoch": 0} INFO: `Trainer.fit` stopped: `max_steps=250000.0` reached. 2025-10-04 15:35:02 | INFO | lightning.pytorch.utilities.rank_zero | `Trainer.fit` stopped: `max_steps=250000.0` reached.