Filter by:
English (213)
German (100)
Russian (86)
French (81)
Estonian (67)
Latvian (63)
Finnish (61)
Swedish (51)
Icelandic (45)
Spanish (34)
Portuguese (33)
Italian (32)
Danish (30)
Lithuanian (30)
Hungarian (29)
Polish (25)
Latin (24)
Czech (20)
Bulgarian (18)
Romanian (17)
Spanish; Castilian (16)
Norwegian (14)
Basque (13)
Slovenian (12)
Dutch (11)
Croatian (10)
Dutch; Flemish (10)
Slovak (9)
Greek (8)
Maltese (7)
Erzya (6)
Northern Sami (6)
Faroese (5)
Japanese (5)
Moksha (5)
Arabic (4)
Catalan (4)
Chinese (4)
Galician (4)
Khanty (4)
Ingrian (3)
Persian (3)
Sign Languages (3)
Tundra Nenets (3)
Albanian (2)
Avaric (2)
Bengali (2)
Chukchi (2)
Chuvash (2)
Eastern Mari (2)
Even (2)
Evenki (2)
Gujarati (2)
Hill Mari (2)
Hindi (2)
Kalmyk; Oirat (2)
Komi Zyrian (2)
Koryak (2)
Kurdish (2)
Lak (2)
Ludian (2)
Mansi (2)
Panjabi (2)
Sami languages (2)
Selkup (2)
Serbian (2)
Sinhalese (2)
Tabassaran (2)
Tajik (2)
Tamil (2)
Tatar (2)
Turkish (2)
Udmurt (2)
Urdu (2)
Uzbek (2)
Veps (2)
Votic (2)
Armenian (1)
Assamese (1)
Baltic languages (1)
Celtic languages (1)
Gaelic (1)
Inari Sami (1)
Indic languages (1)
Ingrian Finnish (1)
Ingrian Finnish (1)
Irish (1)
Proprietary (147)
CC - BY (47)
ELRA_END_USER (20)
Under Negotiation (17)
ELRA_VAR (14)
CLARIN_RES (12)
CC - BY - NC - SA (11)
ELRA_EVALUATION (6)
Other (6)
CC - BY - SA (5)
CC - BY - NC (4)
CLARIN_ACA - NC (4)
CLARIN_ACA (3)
CC - BY - ND (2)
GFDL (2)
CC - ZERO (1)
LGPL (1)
MS Commons - BY (1)
Attribution (36)
Commercial Use (15)
No Redistribution (13)
Other (13)
Share Alike (7)
Evaluation Use (6)
Inform Licensor (5)
Redeposit (5)
No Derivatives (4)
Nlp Applications (159)
Human Use (155)
Information Retrieval (144)
Machine Translation (13)
Annotation (2)
Other (2)
Text Mining (2)
Event Extraction (1)
Lemmatization (1)
Parsing (1)
Pos Tagging (1)
Health (7)
Economics (6)
Humanities (6)
Communications (5)
Environment (5)
Energy (4)
Finance (4)
Science (4)
Social questions (4)
Taxation (4)
Community law (3)
Education (3)
Law (3)
Marketing (3)
Social affairs (3)
Teaching (3)
Wood industry (3)
Accounting (2)
Chemistry (2)
Civil law (2)
Consumption (2)
Documentation (2)
Medicine (2)
Tariff policy (2)
Transport (2)
Accomodation (1)
Europarl (1)
Legal news (1)
News (1)
Renewable energy (1)
Wikipedia (1)
Animal product (1)
Budget (1)
Criminal law (1)
Defence (1)
Economy (1)
Family (1)
Fisheries (1)
Food technology (1)
Foodstuff (1)
Forestry (1)
General (1)
Geography (1)
Industry (1)
Land transport (1)
Management (1)
Politics (1)
Prices (1)
Trade (1)
Transport policy (1)
Resource Type:
Corpus: | |
Lexical/Conceptual: | |
Tool/Service: | |
Language Description: |
Media Type:
Text: | |
Audio: | |
Image: | |
Video: | |
Text Numerical: | |
Text N-Gram: |
291 Language Resources (Page 1 of 15)
« Previous | Next »Order by:
ACCURAT balanced test corpus for under resourced languages
67
368
- Croatian
- English
- Estonian
- German
- Greek
- Latvian
- Lithuanian
- Romanian
- Slovenian
ACCURAT corpus of comparable sentences
79
421
- Croatian
- English
- Estonian
- German
- Greek
- Latvian
- Lithuanian
- Romanian
- Slovenian
ACCURAT corpus of Wikipedia texts
68
432
- Croatian
- English
- Estonian
- German
- Greek
- Latvian
- Lithuanian
- Romanian
- Slovenian
ARCADE II Evaluation Package
0
213
- Arabic
- Chinese
- English
- French
- German
- Greek, Modern (1453-)
- Italian
- Japanese
- Persian
- Russian
- Spanish
Art Lexicon: Painting, Sculpture, Graphics, Architecture and Industrial Artist in Estonian, English, French, German and Swedish
0
159
- English
- Estonian
- French
- German
Bilingual term pairs extracted from comparable news feeds resources using the TaaS Bilingual Term Extraction System.
0
111
- English
- German
- Latvian
Bilingual term pairs extracted from comparable Web resources using the TaaS Bilingual Term Extraction System
0
408
- Bulgarian
- Croatian
- Czech
- Danish
- Dutch; Flemish
- English
- Estonian
- Finnish
- French
- German
- Greek, Modern (1453-)
- Hungarian
- Italian
- Latvian
- Lithuanian
- Polish
- Portuguese
- Romanian
- Russian
- Slovak
- Slovenian
- Spanish
- Swedish
Bilingual term pairs extracted from Wikipedia using the TaaS Bilingual Term Extraction System
0
167
- Bulgarian
- Croatian
- Danish
- English
- Estonian
- Greek, Modern (1453-)
- Irish
- Latvian
- Lithuanian
- Maltese
- Romanian
- Slovak
- Slovenian
« Previous | Next »