-
-
Notifications
You must be signed in to change notification settings - Fork 16
Language list
George Rhoten edited this page Aug 1, 2025
·
53 revisions
Our end goal is to cover all the languages, but if you are looking for a prioritized list to contribute to, see the table below.
List is derived from CLDR database and refers to languages marked as modern in column "Target level".
This is the data quantity of lexemes from Wikidata on July 30, 2025
Symbol | Description |
---|---|
✅ | >10000 lexemes |
>1000 lexemes | |
❌ | <1000 lexemes |
N/A | Not applicable or not needed |
Code | Script code | Description | Data Quantity | Supported |
---|---|---|---|---|
en | Latn | English | ✅ | ✅ |
zh | Hans | Chinese (Simplified, Mandarin) | N/A | ✅ |
es | Latn | Spanish | ✅ | ✅ |
fr | Latn | French | ✅ | ✅ |
pt | Latn | Portuguese | ✅ | |
hi | Deva | Hindi | ✅ | |
ar | Arab | Arabic (Modern Standard) | ✅ | |
ru | Cyrl | Russian | ✅ | ✅ |
de | Latn | German | ✅ | ✅ |
ja | Jpan | Japanese | N/A | ✅ |
it | Latn | Italian | ✅ | ✅ |
id | Latn | Indonesian | N/A | ✅ |
vi | Latn | Vietnamese | N/A | ✅ |
pl | Latn | Polish | ❌ | |
ko | Kore | Korean | ✅ | |
tr | Latn | Turkish | ✅ | |
nl | Latn | Dutch | ✅ | |
zh | Hant | Chinese (Traditional, Mandarin) | N/A | ✅ |
sv | Latn | Swedish | ✅ | ✅ |
ro | Latn | Romanian | ❌ | ❌ |
bn | Beng | Bangla (Bengali) | ✅ | ❌ |
th | Thai | Thai | N/A | ✅ |
cs | Latn | Czech | ✅ | ❌ |
hu | Latn | Hungarian | ❌ | ❌ |
no | Latn | Norwegian (Bokmål) | ✅ | ✅ |
el | Grek | Greek | ✅ | ❌ |
fi | Latn | Finnish | ❌ | |
da | Latn | Danish | ✅ | ✅ |
sk | Latn | Slovak | ✅ | ❌ |
uk | Cyrl | Ukrainian | ✅ | ❌ |
bg | Cyrl | Bulgarian | ❌ | ❌ |
hr | Latn | Croatian | ❌ | ❌ |
iw | Hebr | Hebrew | ✅ | ✅ |
lt | Latn | Lithuanian | ❌ | ❌ |
sl | Latn | Slovenian | ❌ | ❌ |
ms | Latn | Malay | N/A | ✅ |
ca | Latn | Catalan | ❌ | ❌ |
kk | Cyrl | Kazakh | ❌ | ❌ |
fa | Arab | Persian | ✅ | ❌ |
ur | Arab | Urdu | ❌ | |
sw | Latn | Swahili | ❌ | ❌ |
lv | Latn | Latvian | ❌ | ❌ |
et | Latn | Estonian | ✅ | ❌ |
te | Telu | Telugu | ❌ | ❌ |
ta | Taml | Tamil | ❌ | ❌ |
mr | Deva | Marathi | ❌ | ❌ |
fil | Latn | Filipino | ❌ | ❌ |
gu | Gujr | Gujarati | ❌ | ❌ |
is | Latn | Icelandic | ❌ | ❌ |
kn | Knda | Kannada | ❌ | ❌ |
ml | Mlym | Malayalam | ✅ | ❌ |
sr | Cyrl | Serbian (Cyrillic) | ❌ | ✅ |
pa | Guru | Punjabi | ❌ | |
or | Orya | Odia | ❌ | ❌ |
my | Mymr | Burmese (Myanmar) | ❌ | ❌ |
uz | Latn | Uzbek | ❌ | ❌ |
mk | Cyrl | Macedonian | ❌ | ❌ |
az | Latn | Azerbaijani | ❌ | ❌ |
hy | Armn | Armenian | ❌ | ❌ |
as | Beng | Assamese | ❌ | ❌ |
eu | Latn | Basque | ✅ | ❌ |
si | Sinh | Sinhala | ❌ | ❌ |
af | Latn | Afrikaans | ❌ | ❌ |
ka | Geor | Georgian | ❌ | ❌ |
ne | Deva | Nepali | ❌ | ❌ |
sq | Latn | Albanian | ❌ |