Adapters for
Language modeling for the Afrikaans language on the CC-100 corpus. Afrikaans
Language modeling for the Albanian language on the CC-100 corpus. Albanian
Language modeling for the Amharic language on the CC-100 corpus. Amharic
Language modeling for the Amharic language on Wikipedia. Amharic
Language modeling for the Arabic language on the CC-100 corpus. Arabic
Language modeling for the Arabic language on Wikipedia. Arabic is a Semitic language that first emerged in the 1st to 4th centuries CE. It is now the lingua franca of the Arab world. Arabic
Language modeling for the Armenian language on the CC-100 corpus. Armenian
Language modeling for the Armenian language on Wikipedia. Armenian
Language modeling for the Azerbaijani language on the CC-100 corpus. Azerbaijani
Language modeling for the Basque language on the CC-100 corpus. Basque
Language modeling for the Basque language on Wikipedia. Basque
Language modeling for the Belarusian language on the CC-100 corpus. Belarusian
Language modeling for the Bengali language on the CC-100 corpus. Bengali
Language modeling for the Bengali language on Wikipedia. Bengali
Language modeling for the Bhojpuri language on Wikipedia. Bhojpuri
Language modeling for the Bulgarian language on the CC-100 corpus. Bulgarian
Language modeling for the Burmese language on the CC-100 corpus. Burmese
Language modeling for the Burmese language on Wikipedia. The Burmese language is a Sino-Tibetan language spoken in Myanmar where it is an official language and the language of the Bamar people,... Burmese
Language modeling for the Buryat language on Wikipedia. Buryat
Language modeling for the Cantonese language on Wikipedia. Cantonese
Language modeling for the Catalan language on the CC-100 corpus. Catalan
Language modeling for the Central Khmer language on the CC-100 corpus. Central Khmer
Language modeling for the Chinese (traditional) language on the CC-100 corpus. Chinese
Language modeling for the Chinese language on Wikipedia. Chinese is a family of East Asian analytic languages that form the Sinitic branch of the Sino-Tibetan languages. Chinese
Language modeling for the Croatian language on the CC-100 corpus. Croatian
Language modeling for the Czech language on the CC-100 corpus. Czech
Language modeling for the Czech language on Wikipedia. Czech
Language modeling for the Danish language on the CC-100 corpus. Danish
Language modeling for the Dutch language on the CC-100 corpus. Dutch
Language modeling for the English language on the CC-100 corpus. English
Language modeling for the English language on Wikipedia. English is a West Germanic language that was first spoken in early medieval England and eventually became a global lingua franca. English
Language modeling for the Erzya language on Wikipedia. Erzya
Language modeling for the Esperanto language on the CC-100 corpus. Esperanto
Language modeling for the Estonian language on the CC-100 corpus. Estonian
Language modeling for the Estonian language on Wikipedia. Estonian is a Uralic language of the Finnic branch spoken in Estonia. Estonian
Language modeling for the Finnish language on the CC-100 corpus. Finnish
Language modeling for the Finnish language on Wikipedia. Finnish, whose endonym is suomi or suomen kieli is a Uralic language of the Finnic branch spoken by the majority of the population in... Finnish
Language modeling for the French language on the CC-100 corpus. French
Language modeling for the French language on Wikipedia. French or la langue française is a Romance language of the Indo-European family. French
Language modeling for the Galician language on the CC-100 corpus. Galician
Language modeling for the Georgian language on the CC-100 corpus. Georgian
Language modeling for the Georgian language on Wikipedia. Georgian
Language modeling for the German language on the CC-100 corpus. German
Language modeling for the German language on Wikipedia. German is a West Germanic language that is mainly spoken in Central Europe. German
Language modeling for the Greek language on the CC-100 corpus. Greek
Language modeling for the Greek language on Wikipedia. Greek is an independent branch of the Indo-European family of languages, native to Greece, Cyprus, Albania and other parts of the Eastern... Greek
Language modeling for the Guarani language on Wikipedia. Guarani specifically the primary variety known as Paraguayan Guarani, is an indigenous language of South America that belongs to the... Guarani
Language modeling for the Gujarati language on the CC-100 corpus. Gujarati
Language modeling for the Haitian Creole language on Wikipedia. Haitian Creole is a French-based creole language spoken by 10–12 million people worldwide, and the only language of most Haitians. Haitian Creole
Language modeling for the Hausa language on the CC-100 corpus. Hausa
Language modeling for the Hebrew language on the CC-100 corpus. Hebrew
Language modeling for the Hindi language on the CC-100 corpus. Hindi
Language modeling for the Hindi language on Wikipedia. Hindi, or more precisely Modern Standard Hindi, is an Indo-Aryan language spoken in India. Hindi
Language modeling for the Hungarian language on the CC-100 corpus. Hungarian
Language modeling for the Hungarian language on Wikipedia. Hungarian
Language modeling for the Icelandic language on the CC-100 corpus. Icelandic
Language modeling for the Icelandic language on Wikipedia. Icelandic is a North Germanic language spoken by about 314,000 people, the vast majority of whom live in Iceland where it is the national... Icelandic
Language modeling for the Ilocano language on Wikipedia. Ilocano (also Ilokano) is an Austronesian language spoken in the Philippines. Ilocano
Language modeling for the Indonesian language on the CC-100 corpus. Indonesian
Language modeling for the Indonesian language on Wikipedia. Indonesian is the official language of Indonesia. It is a standardised variety of Malay, an Austronesian language that has been used as... Indonesian
Language modeling for the Irish language on the CC-100 corpus. Irish
Language modeling for the Italian language on the CC-100 corpus. Italian
Language modeling for the Italian language on Wikipedia. Italian or lingua italiana, is a Romance language of the Indo-European language family. Italian descended from the Vulgar Latin of the... Italian
Language modeling for the Japanese language on the CC-100 corpus. Japanese
Language modeling for the Japanese language on Wikipedia. Japanese is an East Asian language spoken by about 128 million people, primarily in Japan, where it is the national language. Japanese
Language modeling for the Javanese language on Wikipedia. Javanese is the language of the Javanese people from the central and eastern parts of the island of Java, in Indonesia. Javanese
Language modeling for the Kannada language on the CC-100 corpus. Kannada
Language modeling for the Kazakh language on the CC-100 corpus. Kazakh
Language modeling for the Kirghiz language on the CC-100 corpus. Kirghiz
Language modeling for the Komi language on Wikipedia. Komi
Language modeling for the Korean language on the CC-100 corpus. Korean
Language modeling for the Korean language on Wikipedia. The Korean language is an East Asian language spoken by about 77 million people. Korean
Language modeling for the Kurdish language on the CC-100 corpus. Kurdish
Language modeling for the Lao language on the CC-100 corpus. Lao
Language modeling for the Latin language on the CC-100 corpus. Latin
Language modeling for the Latin language on Wikipedia. Latin
Language modeling for the Latvian language on the CC-100 corpus. Latvian
Language modeling for the Latvian language on Wikipedia. Latvian
Language modeling for the Lithuanian language on the CC-100 corpus. Lithuanian
Language modeling for the Macedonian language on the CC-100 corpus. Macedonian
Language modeling for the Malay language on the CC-100 corpus. Malay
Language modeling for the Malayalam language on the CC-100 corpus. Malayalam
Language modeling for the Marathi language on the CC-100 corpus. Marathi
Language modeling for the Meadow Mari language on Wikipedia. Meadow Mari or Meadow-Eastern Mari or Eastern Mari is a standardised dialect of the Mari language used by about half a million people... Meadow Mari
Language modeling for the Min Dong language on Wikipedia. Eastern Min or Min Dong, is a branch of the Min group of Sinitic languages of China. Min Dong
Language modeling for the Mingrelian language on Wikipedia. Megrelian is a Kartvelian language spoken in Western Georgia (regions of Samegrelo and Abkhazia), primarily by the Megrelians. Mingrelian
Language modeling for the Mongolian language on the CC-100 corpus. Mongolian
Language modeling for the Māori language on Wikipedia. Māori, also known as te reo ('the language'), is an Eastern Polynesian language spoken by the Māori people, the indigenous population of New Zealand. Māori
Language modeling for the Nepali language on the CC-100 corpus. Nepali
Language modeling for the Northern Sami language on Wikipedia. Northern Sami
Language modeling for the Norwegian language on the CC-100 corpus. Norwegian
Language modeling for the Oriya language on the CC-100 corpus. Oriya
Language modeling for the Pashto language on the CC-100 corpus. Pashto
Language modeling for the Persian language on the CC-100 corpus. Persian
Language modeling for the Persian language on Wikipedia. Persian
Language modeling for the Polish language on the CC-100 corpus. Polish
Language modeling for the Portuguese language on the CC-100 corpus. Portuguese
Language modeling for the Portuguese language on Wikipedia. Portuguese
Language modeling for the Punjabi language on the CC-100 corpus. Punjabi
Language modeling for the Quechuan language on Wikipedia. Quechua, usually called Runasimi ("people's language") in Quechuan languages, is an indigenous language family spoken by the Quechua... Quechua
Language modeling for the Romanian language on the CC-100 corpus. Romanian
Language modeling for the Russian language on the CC-100 corpus. Russian
Language modeling for the Russian language on Wikipedia. Russian is an East Slavic language, which is an official language in Russia, Belarus, Kazakhstan, Kyrgyzstan, as well as being widely used... Russian
Language modeling for the Sanskrit language on the CC-100 corpus. Sanskrit
Language modeling for the Serbian language on the CC-100 corpus. Serbian
Language modeling for the Sinhala language on the CC-100 corpus. Sinhala
Language modeling for the Slovak language on the CC-100 corpus. Slovak
Language modeling for the Slovenian language on the CC-100 corpus. Slovenian
Language modeling for the Somali language on the CC-100 corpus. Somali
Language modeling for the Spanish language on the CC-100 corpus. Spanish
Language modeling for the Spanish language on Wikipedia. Spanish, or Castilian, is a Romance language that originated in the Iberian Peninsula and today is a global language with more than 483... Spanish
Language modeling for the Swahili language on the CC-100 corpus. Swahili
Language modeling for the Swahili language on Wikipedia. Swahili, also known by its native name Kiswahili, is a Bantu language and the first language of the Swahili people. Swahili
Language modeling for the Swedish language on the CC-100 corpus. Swedish
Language modeling for the Tagalog language on the CC-100 corpus. Tagalog
Language modeling for the Tamil language on the CC-100 corpus. Tamil
Language modeling for the Tamil language on Wikipedia. Tamil, About this soundpronunciation is a Dravidian language predominantly spoken by the Tamil people of India and Sri Lanka, and by the... Tamil
Language modeling for the Telugu language on the CC-100 corpus. Telugu
Language modeling for the Thai language on the CC-100 corpus. Thai
Language modeling for the Thai language on Wikipedia. Thai, Central Thai, is the official language of Thailand and the first language of the Central Thai people. Thai
Language modeling for the Turkish language on the CC-100 corpus. Turkish
Language modeling for the Turkish language on Wikipedia. Turkish, also referred to as Istanbul Turkish or Turkey Turkish, is the most widely spoken of the Turkic languages, with around 70 to 80... Turkish
Language modeling for the Turkmen language on Wikipedia. Turkmen is an Oghuz Turkic language spoken by the Turkmens of Central Asia and the official language of Turkmenistan. Turkmen
Language modeling for the Ukrainian language on the CC-100 corpus. Ukrainian
Language modeling for the Urdu language on the CC-100 corpus. Urdu
Language modeling for the Uzbek language on the CC-100 corpus. Uzbek
Language modeling for the Vietnamese language on the CC-100 corpus. Vietnamese
Language modeling for the Vietnamese language on Wikipedia. Vietnamese is an Austroasiatic language that originated in Vietnam, where it is the national and official language. Spoken natively by... Vietnamese
Language modeling for the Welsh language on the CC-100 corpus. Welsh
Language modeling for the Wolof language on Wikipedia. Wolof
Add your language to AdapterHub, it's super awesome!