Filter by:
Multilingual (20)
English (15)
Estonian (13)
German (10)
Latvian (8)
Lithuanian (7)
Croatian (6)
Finnish (6)
Romanian (6)
French (5)
Slovenian (5)
Swedish (5)
Russian (4)
Czech (3)
Danish (3)
Greek (3)
Hungarian (3)
Italian (3)
Polish (3)
Portuguese (3)
Bulgarian (2)
Slovak (2)
Spanish (2)
Basque (1)
Dutch; Flemish (1)
Eastern Mari (1)
Erzya (1)
Hill Mari (1)
Ingrian (1)
Irish (1)
Khanty (1)
Maltese (1)
Mansi (1)
Moksha (1)
Selkup (1)
Tundra Nenets (1)
Turkish (1)
Veps (1)
Corpus (10)
Text (20)
True (1)
Nlp Applications (4)
Human Use (2)
Text Mining (1)
Written Language (11)
Europe (1)
1996-2011 (1)
Resource Type:
Corpus: | |
Lexical/Conceptual: | |
Tool/Service: | |
Language Description: |
Media Type:
Text: | |
Audio: | |
Image: | |
Video: | |
Text Numerical: | |
Text N-Gram: |
20 Language Resources
Order by:
ACCURAT balanced test corpus for under resourced languages
67
367
- Croatian
- English
- Estonian
- German
- Greek
- Latvian
- Lithuanian
- Romanian
- Slovenian
ACCURAT corpus of comparable sentences
79
419
- Croatian
- English
- Estonian
- German
- Greek
- Latvian
- Lithuanian
- Romanian
- Slovenian
ACCURAT corpus of Wikipedia texts
68
430
- Croatian
- English
- Estonian
- German
- Greek
- Latvian
- Lithuanian
- Romanian
- Slovenian
Bilingual term pairs extracted from comparable news feeds resources using the TaaS Bilingual Term Extraction System.
0
111
- English
- German
- Latvian
Bilingual term pairs extracted from comparable Web resources using the TaaS Bilingual Term Extraction System
0
406
- Bulgarian
- Croatian
- Czech
- Danish
- Dutch; Flemish
- English
- Estonian
- Finnish
- French
- German
- Greek, Modern (1453-)
- Hungarian
- Italian
- Latvian
- Lithuanian
- Polish
- Portuguese
- Romanian
- Russian
- Slovak
- Slovenian
- Spanish
- Swedish
Bilingual term pairs extracted from Wikipedia using the TaaS Bilingual Term Extraction System
0
167
- Bulgarian
- Croatian
- Danish
- English
- Estonian
- Greek, Modern (1453-)
- Irish
- Latvian
- Lithuanian
- Maltese
- Romanian
- Slovak
- Slovenian
Collection of comparable Lithuanian, Latvian and Estonian laws and legislations
0
112
- English
- Estonian
- Latvian
- Lithuanian
Fenno-ugrica, Kielipankki Version
0
86
- Eastern Mari
- Erzya
- Hill Mari
- Ingrian
- Khanty
- Mansi
- Moksha
- Selkup
- Tundra Nenets
- Veps
Opus, Helsinki Korp Version
0
133
- Czech
- Danish
- English
- Estonian
- Finnish
- French
- German
- Greek, Modern (1453-)
- Hungarian
- Italian
- Polish
- Portuguese
- Russian
- Spanish; Castilian
- Swedish
- Turkish
The Helsinki Korp Europarl Bilingual Corpora
0
40
- English
- Estonian
- Finnish
- French
- German
- Spanish; Castilian
- Swedish
The Helsinki Korp JRC-Acquis Bilingual Parallel Corpora
0
22
- English
- Estonian
- Finnish
- French
- German
- Hungarian
- Italian
- Polish
- Spanish; Castilian
- Swedish