Filter by:
Estonian (11)
English (7)
German (7)
Finnish (6)
Swedish (4)
Croatian (3)
French (3)
Greek (3)
Latvian (3)
Lithuanian (3)
Romanian (3)
Slovenian (3)
Hungarian (2)
Italian (2)
Polish (2)
Czech (1)
Danish (1)
Portuguese (1)
Russian (1)
Turkish (1)
CC - BY (11)
Proprietary (1)
Written Language (11)
Text (11)
True (1)
Nlp Applications (4)
Human Use (2)
Text Mining (1)
Parallel (7)
Comparable (2)
1996-2011 (2)
Resource Type:
Corpus: | |
Lexical/Conceptual: | |
Tool/Service: | |
Language Description: |
Media Type:
Text: | |
Audio: | |
Image: | |
Video: | |
Text Numerical: | |
Text N-Gram: |
11 Language Resources
Order by:
ACCURAT balanced test corpus for under resourced languages
67
370
- Croatian
- English
- Estonian
- German
- Greek
- Latvian
- Lithuanian
- Romanian
- Slovenian
ACCURAT corpus of comparable sentences
79
428
- Croatian
- English
- Estonian
- German
- Greek
- Latvian
- Lithuanian
- Romanian
- Slovenian
ACCURAT corpus of Wikipedia texts
68
438
- Croatian
- English
- Estonian
- German
- Greek
- Latvian
- Lithuanian
- Romanian
- Slovenian
Opus, Helsinki Korp Version
0
134
- Czech
- Danish
- English
- Estonian
- Finnish
- French
- German
- Greek, Modern (1453-)
- Hungarian
- Italian
- Polish
- Portuguese
- Russian
- Spanish; Castilian
- Swedish
- Turkish
The Helsinki Korp Europarl Bilingual Corpora
0
40
- English
- Estonian
- Finnish
- French
- German
- Spanish; Castilian
- Swedish
The Helsinki Korp JRC-Acquis Bilingual Parallel Corpora
0
22
- English
- Estonian
- Finnish
- French
- German
- Hungarian
- Italian
- Polish
- Spanish; Castilian
- Swedish