Filter by:
Greek (5)
Croatian (5)
English (5)
German (5)
Estonian (3)
Latvian (3)
Lithuanian (3)
Romanian (3)
Slovenian (3)
Arabic (2)
Chinese (2)
Czech (2)
Danish (2)
Dutch (2)
Finnish (2)
French (2)
Italian (2)
Japanese (2)
Korean (2)
Norwegian (2)
Polish (2)
Portuguese (2)
Russian (2)
Spanish (2)
Swedish (2)
Thai (2)
Turkish (2)
Vietnamese (2)
Hindi (1)
Persian (1)
Corpus (5)
CC - BY (3)
ELRA_END_USER (2)
True (1)
Nlp Applications (3)
Text Mining (1)
Multilingual (3)
Monolingual (2)
Comparable (2)
Parallel (1)
Written Language (3)
Brazil (2)
Modern (1453-) (2)
Resource Type:
Corpus: | |
Lexical/Conceptual: | |
Tool/Service: | |
Language Description: |
Media Type:
Text: | |
Audio: | |
Image: | |
Video: | |
Text Numerical: | |
Text N-Gram: |
5 Language Resources
Order by:
ACCURAT balanced test corpus for under resourced languages
67
368
- Croatian
- English
- Estonian
- German
- Greek
- Latvian
- Lithuanian
- Romanian
- Slovenian
ACCURAT corpus of comparable sentences
79
421
- Croatian
- English
- Estonian
- German
- Greek
- Latvian
- Lithuanian
- Romanian
- Slovenian
ACCURAT corpus of Wikipedia texts
68
432
- Croatian
- English
- Estonian
- German
- Greek
- Latvian
- Lithuanian
- Romanian
- Slovenian
Collins Multilingual database (MLD) – PhraseBank with audio files
0
86
- Arabic
- Chinese
- Croatian
- Czech
- Danish
- Dutch
- English
- Finnish
- French
- German
- Greek
- Hindi
- Italian
- Japanese
- Korean
- Norwegian
- Persian
- Polish
- Portuguese
- Russian
- Spanish
- Swedish
- Thai
- Turkish
- Vietnamese
Collins Multilingual database (MLD) – WordBank with audio files
0
84
- Arabic
- Chinese
- Croatian
- Czech
- Danish
- Dutch
- English
- Finnish
- French
- German
- Greek
- Italian
- Japanese
- Korean
- Norwegian
- Polish
- Portuguese
- Russian
- Spanish
- Swedish
- Thai
- Turkish
- Vietnamese