Filter by:
English (16)
German (16)
Estonian (8)
Finnish (5)
French (5)
Latvian (5)
Swedish (5)
Croatian (4)
Czech (4)
Lithuanian (4)
Romanian (4)
Slovenian (4)
Greek (3)
Hungarian (3)
Italian (3)
Polish (3)
Portuguese (3)
Danish (2)
Russian (2)
Basque (1)
Bulgarian (1)
Dutch; Flemish (1)
Slovak (1)
Spanish (1)
Turkish (1)
CC - BY (16)
Text (16)
Attribution (9)
True (1)
Nlp Applications (3)
Human Use (1)
Text Mining (1)
Multilingual (10)
Bilingual (6)
Parallel (9)
Comparable (2)
Written Language (8)
Text/tsv (4)
1996-2011 (1)
Resource Type:
Corpus: | |
Lexical/Conceptual: | |
Tool/Service: | |
Language Description: |
Media Type:
Text: | |
Audio: | |
Image: | |
Video: | |
Text Numerical: | |
Text N-Gram: |
14 Language Resources
Order by:
ACCURAT balanced test corpus for under resourced languages
68
371
- Croatian
- English
- Estonian
- German
- Greek
- Latvian
- Lithuanian
- Romanian
- Slovenian
ACCURAT corpus of comparable sentences
79
434
- Croatian
- English
- Estonian
- German
- Greek
- Latvian
- Lithuanian
- Romanian
- Slovenian
ACCURAT corpus of Wikipedia texts
68
444
- Croatian
- English
- Estonian
- German
- Greek
- Latvian
- Lithuanian
- Romanian
- Slovenian
Bilingual term pairs extracted from comparable news feeds resources using the TaaS Bilingual Term Extraction System.
0
113
- English
- German
- Latvian
Bilingual term pairs extracted from comparable Web resources using the TaaS Bilingual Term Extraction System
0
417
- Bulgarian
- Croatian
- Czech
- Danish
- Dutch; Flemish
- English
- Estonian
- Finnish
- French
- German
- Greek, Modern (1453-)
- Hungarian
- Italian
- Latvian
- Lithuanian
- Polish
- Portuguese
- Romanian
- Russian
- Slovak
- Slovenian
- Spanish
- Swedish