Filter by:
Written Language (75)
Finnish (34)
Swedish (32)
English (13)
Estonian (11)
German (9)
French (6)
Croatian (4)
Latvian (4)
Lithuanian (4)
Romanian (4)
Greek (3)
Hungarian (3)
Italian (3)
Slovenian (3)
Czech (2)
Hill Mari (2)
Moksha (2)
Polish (2)
Danish (1)
Eastern Mari (1)
Erzya (1)
Finland Swedish (1)
Ingrian (1)
Karelian (1)
Khanty (1)
Kildin Sami (1)
Livonian (1)
Mansi (1)
Nenets (1)
Northern Sami (1)
Olonets (1)
Portuguese (1)
Russian (1)
Selkup (1)
Spanish (1)
Swahili (1)
Ter Sami (1)
Tundra Nenets (1)
Turkish (1)
Veps (1)
True (24)
Human Use (9)
Nlp Applications (9)
Other (24)
Text/plain (1)
Resource Type:
Corpus: | |
Lexical/Conceptual: | |
Tool/Service: | |
Language Description: |
Media Type:
Text: | |
Audio: | |
Image: | |
Video: | |
Text Numerical: | |
Text N-Gram: |
75 Language Resources (Page 1 of 4)
« Previous | Next »Order by:
ACCURAT balanced test corpus for under resourced languages
67
367
- Croatian
- English
- Estonian
- German
- Greek
- Latvian
- Lithuanian
- Romanian
- Slovenian
ACCURAT corpus of comparable sentences
79
421
- Croatian
- English
- Estonian
- German
- Greek
- Latvian
- Lithuanian
- Romanian
- Slovenian
ACCURAT corpus of Wikipedia texts
68
432
- Croatian
- English
- Estonian
- German
- Greek
- Latvian
- Lithuanian
- Romanian
- Slovenian
Fenno-ugrica, Kielipankki Version
0
86
- Eastern Mari
- Erzya
- Hill Mari
- Ingrian
- Khanty
- Mansi
- Moksha
- Selkup
- Tundra Nenets
- Veps
« Previous | Next »