Filter by:
Multilingual (17)
Written Language (17)
English (12)
Estonian (9)
German (9)
Latvian (6)
Lithuanian (6)
Finnish (5)
Swedish (5)
Croatian (4)
French (4)
Romanian (4)
Greek (3)
Slovenian (3)
Hungarian (2)
Italian (2)
Polish (2)
Russian (2)
Czech (1)
Danish (1)
Eastern Mari (1)
Erzya (1)
Hill Mari (1)
Ingrian (1)
Khanty (1)
Mansi (1)
Moksha (1)
Portuguese (1)
Selkup (1)
Tundra Nenets (1)
Turkish (1)
Veps (1)
Corpus (14)
Text (17)
Nlp Applications (8)
Human Use (2)
Text Mining (1)
Resource Type:
Corpus: | |
Lexical/Conceptual: | |
Tool/Service: | |
Language Description: |
Media Type:
Text: | |
Audio: | |
Image: | |
Video: | |
Text Numerical: | |
Text N-Gram: |
17 Language Resources
Order by:
ACCURAT balanced test corpus for under resourced languages
67
368
- Croatian
- English
- Estonian
- German
- Greek
- Latvian
- Lithuanian
- Romanian
- Slovenian
ACCURAT corpus of comparable sentences
79
421
- Croatian
- English
- Estonian
- German
- Greek
- Latvian
- Lithuanian
- Romanian
- Slovenian
ACCURAT corpus of Wikipedia texts
68
432
- Croatian
- English
- Estonian
- German
- Greek
- Latvian
- Lithuanian
- Romanian
- Slovenian
English-Estonian cross-linked collection of comparable sentences from Wikipedia
6
97
- English
- Estonian
English-Latvian cross-linked collection of comparable sentences from Wikipedia
11
124
- English
- Latvian
English-Lithuanian cross-linked collection of comparable sentences from Wikipedia
7
88
- English
- Lithuanian
Fenno-ugrica, Kielipankki Version
0
86
- Eastern Mari
- Erzya
- Hill Mari
- Ingrian
- Khanty
- Mansi
- Moksha
- Selkup
- Tundra Nenets
- Veps
Latvian-Lithuanian cross-linked collection of comparable sentences from Wikipedia
4
91
- Latvian
- Lithuanian
Opus, Helsinki Korp Version
0
133
- Czech
- Danish
- English
- Estonian
- Finnish
- French
- German
- Greek, Modern (1453-)
- Hungarian
- Italian
- Polish
- Portuguese
- Russian
- Spanish; Castilian
- Swedish
- Turkish
The Helsinki Korp Europarl Bilingual Corpora
0
40
- English
- Estonian
- Finnish
- French
- German
- Spanish; Castilian
- Swedish
The Helsinki Korp JRC-Acquis Bilingual Parallel Corpora
0
22
- English
- Estonian
- Finnish
- French
- German
- Hungarian
- Italian
- Polish
- Spanish; Castilian
- Swedish