| # Supported Languages | |
| WLDetect supports **148 languages** trained on OpenLID-v2 and evaluated on FLORES+. | |
| ## Performance Summary | |
| - **Average Accuracy**: 92.92% | |
| - **Macro Precision**: 0.9399 | |
| - **Macro Recall**: 0.9294 | |
| - **Macro F1**: 0.9274 | |
| - **Languages ≥ 95% accuracy**: 103/148 | |
| - **Languages ≥ 90% accuracy**: 113/148 | |
| ## Language List | |
| Languages sorted by FLORES accuracy (highest to lowest). | |
| | Language Code | Accuracy | Precision | F1 | | |
| |--------------|----------|-----------|-----| | |
| | asm_Beng | 100.00% | 1.0000 | 1.0000 | | |
| | ben_Beng | 100.00% | 0.9930 | 0.9965 | | |
| | cmn_Hant | 100.00% | 0.9379 | 0.9680 | | |
| | dzo_Tibt | 100.00% | 1.0000 | 1.0000 | | |
| | ell_Grek | 100.00% | 0.9970 | 0.9985 | | |
| | guj_Gujr | 100.00% | 1.0000 | 1.0000 | | |
| | heb_Hebr | 100.00% | 1.0000 | 1.0000 | | |
| | hun_Latn | 100.00% | 0.9477 | 0.9732 | | |
| | hye_Armn | 100.00% | 1.0000 | 1.0000 | | |
| | jpn_Jpan | 100.00% | 0.9990 | 0.9995 | | |
| | kan_Knda | 100.00% | 1.0000 | 1.0000 | | |
| | kat_Geor | 100.00% | 1.0000 | 1.0000 | | |
| | khm_Khmr | 100.00% | 1.0000 | 1.0000 | | |
| | kor_Hang | 100.00% | 1.0000 | 1.0000 | | |
| | lao_Laoo | 100.00% | 1.0000 | 1.0000 | | |
| | mal_Mlym | 100.00% | 1.0000 | 1.0000 | | |
| | mya_Mymr | 100.00% | 1.0000 | 1.0000 | | |
| | ory_Orya | 100.00% | 1.0000 | 1.0000 | | |
| | pan_Guru | 100.00% | 1.0000 | 1.0000 | | |
| | pes_Arab | 100.00% | 0.8692 | 0.9300 | | |
| | sat_Olck | 100.00% | 1.0000 | 1.0000 | | |
| | shn_Mymr | 100.00% | 1.0000 | 1.0000 | | |
| | sin_Sinh | 100.00% | 1.0000 | 1.0000 | | |
| | snd_Arab | 100.00% | 0.9970 | 0.9985 | | |
| | tam_Taml | 100.00% | 1.0000 | 1.0000 | | |
| | taq_Tfng | 100.00% | 1.0000 | 1.0000 | | |
| | tel_Telu | 100.00% | 1.0000 | 1.0000 | | |
| | tha_Thai | 100.00% | 1.0000 | 1.0000 | | |
| | uig_Arab | 100.00% | 0.9990 | 0.9995 | | |
| | ukr_Cyrl | 100.00% | 0.9842 | 0.9920 | | |
| | urd_Arab | 100.00% | 0.9130 | 0.9545 | | |
| | vie_Latn | 100.00% | 0.9891 | 0.9945 | | |
| | ckb_Arab | 99.90% | 1.0000 | 0.9995 | | |
| | hin_Deva | 99.90% | 0.5605 | 0.7181 | | |
| | kir_Cyrl | 99.90% | 0.9891 | 0.9940 | | |
| | lit_Latn | 99.90% | 0.9755 | 0.9871 | | |
| | lvs_Latn | 99.90% | 0.8078 | 0.8933 | | |
| | npi_Deva | 99.90% | 0.9970 | 0.9980 | | |
| | rus_Cyrl | 99.90% | 0.9930 | 0.9960 | | |
| | amh_Ethi | 99.80% | 0.9531 | 0.9750 | | |
| | arb_Arab | 99.80% | 0.4802 | 0.6484 | | |
| | mar_Deva | 99.80% | 0.9891 | 0.9935 | | |
| | ron_Latn | 99.80% | 0.9698 | 0.9837 | | |
| | tuk_Latn | 99.80% | 0.9822 | 0.9900 | | |
| | tur_Latn | 99.80% | 0.9679 | 0.9827 | | |
| | eng_Latn | 99.70% | 0.8955 | 0.9435 | | |
| | kik_Latn | 99.70% | 0.9832 | 0.9900 | | |
| | pbt_Arab | 99.70% | 1.0000 | 0.9985 | | |
| | pol_Latn | 99.70% | 0.9395 | 0.9674 | | |
| | als_Latn | 99.60% | 0.9641 | 0.9798 | | |
| | bjn_Arab | 99.60% | 0.9940 | 0.9950 | | |
| | deu_Latn | 99.60% | 0.9697 | 0.9827 | | |
| | khk_Cyrl | 99.60% | 0.9990 | 0.9975 | | |
| | mlt_Latn | 99.60% | 0.9890 | 0.9925 | | |
| | por_Latn | 99.60% | 0.9077 | 0.9498 | | |
| | azj_Latn | 99.50% | 0.7619 | 0.8630 | | |
| | bul_Cyrl | 99.50% | 0.9940 | 0.9945 | | |
| | fra_Latn | 99.50% | 0.9026 | 0.9466 | | |
| | tat_Cyrl | 99.40% | 0.8528 | 0.9180 | | |
| | tgk_Cyrl | 99.40% | 1.0000 | 0.9970 | | |
| | ekk_Latn | 99.30% | 0.9252 | 0.9579 | | |
| | mni_Beng | 99.30% | 1.0000 | 0.9965 | | |
| | fin_Latn | 99.20% | 0.9556 | 0.9734 | | |
| | kaz_Cyrl | 99.20% | 0.9940 | 0.9930 | | |
| | uzn_Latn | 99.20% | 0.8942 | 0.9406 | | |
| | ilo_Latn | 99.00% | 0.7992 | 0.8844 | | |
| | nld_Latn | 99.00% | 0.7711 | 0.8669 | | |
| | slk_Latn | 99.00% | 0.9164 | 0.9518 | | |
| | epo_Latn | 98.90% | 0.9880 | 0.9885 | | |
| | bel_Cyrl | 98.80% | 1.0000 | 0.9939 | | |
| | cym_Latn | 98.80% | 0.9970 | 0.9924 | | |
| | mkd_Cyrl | 98.80% | 0.9572 | 0.9724 | | |
| | tpi_Latn | 98.80% | 0.9919 | 0.9899 | | |
| | hau_Latn | 98.70% | 0.9619 | 0.9743 | | |
| | ita_Latn | 98.70% | 0.8586 | 0.9183 | | |
| | nus_Latn | 98.70% | 1.0000 | 0.9934 | | |
| | eus_Latn | 98.50% | 0.9590 | 0.9718 | | |
| | ewe_Latn | 98.50% | 0.9534 | 0.9689 | | |
| | ces_Latn | 97.99% | 0.9939 | 0.9869 | | |
| | gaz_Latn | 97.89% | 0.9683 | 0.9736 | | |
| | swe_Latn | 97.89% | 0.9597 | 0.9692 | | |
| | bak_Cyrl | 97.79% | 1.0000 | 0.9888 | | |
| | spa_Latn | 97.69% | 0.9137 | 0.9443 | | |
| | ceb_Latn | 97.59% | 0.8935 | 0.9329 | | |
| | cmn_Hans | 97.49% | 1.0000 | 0.9873 | | |
| | slv_Latn | 97.29% | 0.9327 | 0.9524 | | |
| | tsn_Latn | 97.19% | 0.9133 | 0.9417 | | |
| | afr_Latn | 96.89% | 0.9244 | 0.9461 | | |
| | som_Latn | 96.79% | 0.9718 | 0.9698 | | |
| | fij_Latn | 96.69% | 0.9377 | 0.9521 | | |
| | hat_Latn | 96.59% | 0.9008 | 0.9322 | | |
| | gle_Latn | 96.39% | 0.9049 | 0.9335 | | |
| | fil_Latn | 96.29% | 0.9152 | 0.9384 | | |
| | ind_Latn | 96.29% | 0.5000 | 0.6582 | | |
| | lin_Latn | 95.89% | 0.9775 | 0.9681 | | |
| | srp_Cyrl | 95.89% | 0.9927 | 0.9755 | | |
| | yue_Hant | 95.79% | 1.0000 | 0.9785 | | |
| | twi_Latn | 95.74% | 0.9770 | 0.9671 | | |
| | ibo_Latn | 95.59% | 0.9958 | 0.9754 | | |
| | nya_Latn | 95.59% | 0.7975 | 0.8695 | | |
| | sna_Latn | 95.39% | 0.9342 | 0.9439 | | |
| | tso_Latn | 95.29% | 0.8482 | 0.8975 | | |
| | tir_Ethi | 95.09% | 0.9979 | 0.9738 | | |
| | hrv_Latn | 94.88% | 0.9643 | 0.9565 | | |
| | swh_Latn | 94.18% | 0.9418 | 0.9418 | | |
| | war_Latn | 93.58% | 0.9648 | 0.9501 | | |
| | kab_Latn | 93.48% | 0.9759 | 0.9549 | | |
| | bem_Latn | 92.78% | 0.9095 | 0.9186 | | |
| | run_Latn | 92.38% | 0.8583 | 0.8899 | | |
| | kmr_Latn | 91.57% | 0.9796 | 0.9466 | | |
| | yor_Latn | 91.27% | 0.9681 | 0.9396 | | |
| | nob_Latn | 91.22% | 0.9182 | 0.9152 | | |
| | kas_Arab | 90.17% | 0.9967 | 0.9468 | | |
| | pag_Latn | 89.87% | 0.9614 | 0.9290 | | |
| | pap_Latn | 89.77% | 0.9179 | 0.9077 | | |
| | gug_Latn | 89.67% | 0.8756 | 0.8860 | | |
| | oci_Latn | 88.52% | 0.9231 | 0.9037 | | |
| | lua_Latn | 88.47% | 0.8991 | 0.8918 | | |
| | gla_Latn | 88.16% | 0.9681 | 0.9228 | | |
| | lus_Latn | 87.96% | 0.9300 | 0.9041 | | |
| | quy_Latn | 87.26% | 0.9285 | 0.8997 | | |
| | dan_Latn | 87.16% | 0.8076 | 0.8384 | | |
| | ktu_Latn | 87.06% | 0.9538 | 0.9103 | | |
| | fao_Latn | 85.96% | 0.8248 | 0.8418 | | |
| | mos_Latn | 85.96% | 0.9695 | 0.9112 | | |
| | fur_Latn | 85.36% | 0.9092 | 0.8805 | | |
| | san_Deva | 84.85% | 1.0000 | 0.9181 | | |
| | smo_Latn | 84.05% | 0.9405 | 0.8877 | | |
| | cat_Latn | 83.45% | 0.9014 | 0.8667 | | |
| | isl_Latn | 81.44% | 0.8817 | 0.8467 | | |
| | lug_Latn | 81.34% | 0.9632 | 0.8820 | | |
| | tum_Latn | 80.54% | 0.9710 | 0.8805 | | |
| | zul_Latn | 80.34% | 0.7629 | 0.7826 | | |
| | vec_Latn | 78.44% | 0.9861 | 0.8737 | | |
| | xho_Latn | 78.44% | 0.7045 | 0.7423 | | |
| | jav_Latn | 77.03% | 0.8321 | 0.8000 | | |
| | ayr_Latn | 76.43% | 0.9361 | 0.8415 | | |
| | plt_Latn | 75.93% | 0.9895 | 0.8593 | | |
| | sag_Latn | 72.42% | 0.9014 | 0.8031 | | |
| | mri_Latn | 71.01% | 0.9944 | 0.8286 | | |
| | ban_Latn | 63.59% | 0.7955 | 0.7068 | | |
| | lim_Latn | 63.09% | 0.9844 | 0.7689 | | |
| | sun_Latn | 55.07% | 0.9515 | 0.6976 | | |
| | knc_Latn | 53.56% | 1.0000 | 0.6976 | | |
| | zsm_Latn | 51.25% | 0.8559 | 0.6412 | | |
| | knc_Arab | 45.84% | 1.0000 | 0.6286 | | |
| | bho_Deva | 35.71% | 0.9972 | 0.5258 | | |
| | arz_Arab | 27.88% | 0.8968 | 0.4254 | | |
| ## Notes | |
| - **Language Codes**: ISO 639-3 language code + ISO 15924 script code | |
| - Format: `{lang}_{Script}` (e.g., `eng_Latn` for English in Latin script) | |
| - **FLORES Evaluation**: FLORES+ dev set (1012 sentences per language) | |
| - **Removed Languages**: Languages with high confusion or insufficient training data: | |
| - `crh_Latn` (Crimean Tatar) | |
| - `ltz_Latn` (Luxembourgish) | |