Supported Languages
WLDetect supports 148 languages trained on OpenLID-v2 and evaluated on FLORES+.
Performance Summary
- Average Accuracy: 92.92%
- Macro Precision: 0.9399
- Macro Recall: 0.9294
- Macro F1: 0.9274
- Languages ≥ 95% accuracy: 103/148
- Languages ≥ 90% accuracy: 113/148
Language List
Languages sorted by FLORES accuracy (highest to lowest).
| Language Code | Accuracy | Precision | F1 |
|---|---|---|---|
| asm_Beng | 100.00% | 1.0000 | 1.0000 |
| ben_Beng | 100.00% | 0.9930 | 0.9965 |
| cmn_Hant | 100.00% | 0.9379 | 0.9680 |
| dzo_Tibt | 100.00% | 1.0000 | 1.0000 |
| ell_Grek | 100.00% | 0.9970 | 0.9985 |
| guj_Gujr | 100.00% | 1.0000 | 1.0000 |
| heb_Hebr | 100.00% | 1.0000 | 1.0000 |
| hun_Latn | 100.00% | 0.9477 | 0.9732 |
| hye_Armn | 100.00% | 1.0000 | 1.0000 |
| jpn_Jpan | 100.00% | 0.9990 | 0.9995 |
| kan_Knda | 100.00% | 1.0000 | 1.0000 |
| kat_Geor | 100.00% | 1.0000 | 1.0000 |
| khm_Khmr | 100.00% | 1.0000 | 1.0000 |
| kor_Hang | 100.00% | 1.0000 | 1.0000 |
| lao_Laoo | 100.00% | 1.0000 | 1.0000 |
| mal_Mlym | 100.00% | 1.0000 | 1.0000 |
| mya_Mymr | 100.00% | 1.0000 | 1.0000 |
| ory_Orya | 100.00% | 1.0000 | 1.0000 |
| pan_Guru | 100.00% | 1.0000 | 1.0000 |
| pes_Arab | 100.00% | 0.8692 | 0.9300 |
| sat_Olck | 100.00% | 1.0000 | 1.0000 |
| shn_Mymr | 100.00% | 1.0000 | 1.0000 |
| sin_Sinh | 100.00% | 1.0000 | 1.0000 |
| snd_Arab | 100.00% | 0.9970 | 0.9985 |
| tam_Taml | 100.00% | 1.0000 | 1.0000 |
| taq_Tfng | 100.00% | 1.0000 | 1.0000 |
| tel_Telu | 100.00% | 1.0000 | 1.0000 |
| tha_Thai | 100.00% | 1.0000 | 1.0000 |
| uig_Arab | 100.00% | 0.9990 | 0.9995 |
| ukr_Cyrl | 100.00% | 0.9842 | 0.9920 |
| urd_Arab | 100.00% | 0.9130 | 0.9545 |
| vie_Latn | 100.00% | 0.9891 | 0.9945 |
| ckb_Arab | 99.90% | 1.0000 | 0.9995 |
| hin_Deva | 99.90% | 0.5605 | 0.7181 |
| kir_Cyrl | 99.90% | 0.9891 | 0.9940 |
| lit_Latn | 99.90% | 0.9755 | 0.9871 |
| lvs_Latn | 99.90% | 0.8078 | 0.8933 |
| npi_Deva | 99.90% | 0.9970 | 0.9980 |
| rus_Cyrl | 99.90% | 0.9930 | 0.9960 |
| amh_Ethi | 99.80% | 0.9531 | 0.9750 |
| arb_Arab | 99.80% | 0.4802 | 0.6484 |
| mar_Deva | 99.80% | 0.9891 | 0.9935 |
| ron_Latn | 99.80% | 0.9698 | 0.9837 |
| tuk_Latn | 99.80% | 0.9822 | 0.9900 |
| tur_Latn | 99.80% | 0.9679 | 0.9827 |
| eng_Latn | 99.70% | 0.8955 | 0.9435 |
| kik_Latn | 99.70% | 0.9832 | 0.9900 |
| pbt_Arab | 99.70% | 1.0000 | 0.9985 |
| pol_Latn | 99.70% | 0.9395 | 0.9674 |
| als_Latn | 99.60% | 0.9641 | 0.9798 |
| bjn_Arab | 99.60% | 0.9940 | 0.9950 |
| deu_Latn | 99.60% | 0.9697 | 0.9827 |
| khk_Cyrl | 99.60% | 0.9990 | 0.9975 |
| mlt_Latn | 99.60% | 0.9890 | 0.9925 |
| por_Latn | 99.60% | 0.9077 | 0.9498 |
| azj_Latn | 99.50% | 0.7619 | 0.8630 |
| bul_Cyrl | 99.50% | 0.9940 | 0.9945 |
| fra_Latn | 99.50% | 0.9026 | 0.9466 |
| tat_Cyrl | 99.40% | 0.8528 | 0.9180 |
| tgk_Cyrl | 99.40% | 1.0000 | 0.9970 |
| ekk_Latn | 99.30% | 0.9252 | 0.9579 |
| mni_Beng | 99.30% | 1.0000 | 0.9965 |
| fin_Latn | 99.20% | 0.9556 | 0.9734 |
| kaz_Cyrl | 99.20% | 0.9940 | 0.9930 |
| uzn_Latn | 99.20% | 0.8942 | 0.9406 |
| ilo_Latn | 99.00% | 0.7992 | 0.8844 |
| nld_Latn | 99.00% | 0.7711 | 0.8669 |
| slk_Latn | 99.00% | 0.9164 | 0.9518 |
| epo_Latn | 98.90% | 0.9880 | 0.9885 |
| bel_Cyrl | 98.80% | 1.0000 | 0.9939 |
| cym_Latn | 98.80% | 0.9970 | 0.9924 |
| mkd_Cyrl | 98.80% | 0.9572 | 0.9724 |
| tpi_Latn | 98.80% | 0.9919 | 0.9899 |
| hau_Latn | 98.70% | 0.9619 | 0.9743 |
| ita_Latn | 98.70% | 0.8586 | 0.9183 |
| nus_Latn | 98.70% | 1.0000 | 0.9934 |
| eus_Latn | 98.50% | 0.9590 | 0.9718 |
| ewe_Latn | 98.50% | 0.9534 | 0.9689 |
| ces_Latn | 97.99% | 0.9939 | 0.9869 |
| gaz_Latn | 97.89% | 0.9683 | 0.9736 |
| swe_Latn | 97.89% | 0.9597 | 0.9692 |
| bak_Cyrl | 97.79% | 1.0000 | 0.9888 |
| spa_Latn | 97.69% | 0.9137 | 0.9443 |
| ceb_Latn | 97.59% | 0.8935 | 0.9329 |
| cmn_Hans | 97.49% | 1.0000 | 0.9873 |
| slv_Latn | 97.29% | 0.9327 | 0.9524 |
| tsn_Latn | 97.19% | 0.9133 | 0.9417 |
| afr_Latn | 96.89% | 0.9244 | 0.9461 |
| som_Latn | 96.79% | 0.9718 | 0.9698 |
| fij_Latn | 96.69% | 0.9377 | 0.9521 |
| hat_Latn | 96.59% | 0.9008 | 0.9322 |
| gle_Latn | 96.39% | 0.9049 | 0.9335 |
| fil_Latn | 96.29% | 0.9152 | 0.9384 |
| ind_Latn | 96.29% | 0.5000 | 0.6582 |
| lin_Latn | 95.89% | 0.9775 | 0.9681 |
| srp_Cyrl | 95.89% | 0.9927 | 0.9755 |
| yue_Hant | 95.79% | 1.0000 | 0.9785 |
| twi_Latn | 95.74% | 0.9770 | 0.9671 |
| ibo_Latn | 95.59% | 0.9958 | 0.9754 |
| nya_Latn | 95.59% | 0.7975 | 0.8695 |
| sna_Latn | 95.39% | 0.9342 | 0.9439 |
| tso_Latn | 95.29% | 0.8482 | 0.8975 |
| tir_Ethi | 95.09% | 0.9979 | 0.9738 |
| hrv_Latn | 94.88% | 0.9643 | 0.9565 |
| swh_Latn | 94.18% | 0.9418 | 0.9418 |
| war_Latn | 93.58% | 0.9648 | 0.9501 |
| kab_Latn | 93.48% | 0.9759 | 0.9549 |
| bem_Latn | 92.78% | 0.9095 | 0.9186 |
| run_Latn | 92.38% | 0.8583 | 0.8899 |
| kmr_Latn | 91.57% | 0.9796 | 0.9466 |
| yor_Latn | 91.27% | 0.9681 | 0.9396 |
| nob_Latn | 91.22% | 0.9182 | 0.9152 |
| kas_Arab | 90.17% | 0.9967 | 0.9468 |
| pag_Latn | 89.87% | 0.9614 | 0.9290 |
| pap_Latn | 89.77% | 0.9179 | 0.9077 |
| gug_Latn | 89.67% | 0.8756 | 0.8860 |
| oci_Latn | 88.52% | 0.9231 | 0.9037 |
| lua_Latn | 88.47% | 0.8991 | 0.8918 |
| gla_Latn | 88.16% | 0.9681 | 0.9228 |
| lus_Latn | 87.96% | 0.9300 | 0.9041 |
| quy_Latn | 87.26% | 0.9285 | 0.8997 |
| dan_Latn | 87.16% | 0.8076 | 0.8384 |
| ktu_Latn | 87.06% | 0.9538 | 0.9103 |
| fao_Latn | 85.96% | 0.8248 | 0.8418 |
| mos_Latn | 85.96% | 0.9695 | 0.9112 |
| fur_Latn | 85.36% | 0.9092 | 0.8805 |
| san_Deva | 84.85% | 1.0000 | 0.9181 |
| smo_Latn | 84.05% | 0.9405 | 0.8877 |
| cat_Latn | 83.45% | 0.9014 | 0.8667 |
| isl_Latn | 81.44% | 0.8817 | 0.8467 |
| lug_Latn | 81.34% | 0.9632 | 0.8820 |
| tum_Latn | 80.54% | 0.9710 | 0.8805 |
| zul_Latn | 80.34% | 0.7629 | 0.7826 |
| vec_Latn | 78.44% | 0.9861 | 0.8737 |
| xho_Latn | 78.44% | 0.7045 | 0.7423 |
| jav_Latn | 77.03% | 0.8321 | 0.8000 |
| ayr_Latn | 76.43% | 0.9361 | 0.8415 |
| plt_Latn | 75.93% | 0.9895 | 0.8593 |
| sag_Latn | 72.42% | 0.9014 | 0.8031 |
| mri_Latn | 71.01% | 0.9944 | 0.8286 |
| ban_Latn | 63.59% | 0.7955 | 0.7068 |
| lim_Latn | 63.09% | 0.9844 | 0.7689 |
| sun_Latn | 55.07% | 0.9515 | 0.6976 |
| knc_Latn | 53.56% | 1.0000 | 0.6976 |
| zsm_Latn | 51.25% | 0.8559 | 0.6412 |
| knc_Arab | 45.84% | 1.0000 | 0.6286 |
| bho_Deva | 35.71% | 0.9972 | 0.5258 |
| arz_Arab | 27.88% | 0.8968 | 0.4254 |
Notes
- Language Codes: ISO 639-3 language code + ISO 15924 script code
- Format:
{lang}_{Script}(e.g.,eng_Latnfor English in Latin script)
- Format:
- FLORES Evaluation: FLORES+ dev set (1012 sentences per language)
- Removed Languages: Languages with high confusion or insufficient training data:
crh_Latn(Crimean Tatar)ltz_Latn(Luxembourgish)