Filter by:
Latvian (10)
English (8)
Lithuanian (7)
Romanian (6)
Estonian (5)
German (5)
Slovenian (5)
Croatian (4)
Greek (3)
Bulgarian (2)
Czech (2)
Danish (2)
Dutch; Flemish (2)
Finnish (2)
French (2)
Hungarian (2)
Italian (2)
Polish (2)
Portuguese (2)
Slovak (2)
Spanish (2)
Swedish (2)
Maltese (1)
Written Language (10)
Text (10)
True (3)
Nlp Applications (10)
Human Use (1)
Text Mining (1)
Parallel (5)
Comparable (4)
1996-2011 (1)
Resource Type:
Corpus: | |
Lexical/Conceptual: | |
Tool/Service: | |
Language Description: |
Media Type:
Text: | |
Audio: | |
Image: | |
Video: | |
Text Numerical: | |
Text N-Gram: |
10 Language Resources
Order by:
ACCURAT balanced test corpus for under resourced languages
67
368
- Croatian
- English
- Estonian
- German
- Greek
- Latvian
- Lithuanian
- Romanian
- Slovenian
ACCURAT corpus of comparable sentences
79
424
- Croatian
- English
- Estonian
- German
- Greek
- Latvian
- Lithuanian
- Romanian
- Slovenian
ACCURAT corpus of Wikipedia texts
68
435
- Croatian
- English
- Estonian
- German
- Greek
- Latvian
- Lithuanian
- Romanian
- Slovenian
English-Latvian cross-linked collection of comparable sentences from Wikipedia
11
124
- English
- Latvian
Europarl Parallel Corpus
0
176
- Bulgarian
- Czech
- Danish
- Dutch; Flemish
- English
- Estonian
- Finnish
- French
- German
- Greek, Modern (1453-)
- Hungarian
- Italian
- Latvian
- Lithuanian
- Polish
- Portuguese
- Romanian
- Slovak
- Slovenian
- Spanish
- Swedish
JRC-Acquis Multilingual Parallel Corpus
0
146
- Bulgarian
- Czech
- Danish
- Dutch; Flemish
- English
- Estonian
- Finnish
- French
- German
- Greek, Modern (1453-)
- Hungarian
- Italian
- Latvian
- Lithuanian
- Maltese
- Polish
- Portuguese
- Romanian
- Slovak
- Slovenian
- Spanish
- Swedish
Latvian-Lithuanian cross-linked collection of comparable sentences from Wikipedia
4
91
- Latvian
- Lithuanian