Filter by:
English (923)
Spanish (344)
German (209)
French (202)
Finnish (80)
Italian (70)
Estonian (66)
Russian (66)
Swedish (63)
Portuguese (58)
Icelandic (46)
Latvian (45)
Danish (43)
Polish (34)
Spanish; Castilian (33)
Romanian (32)
Czech (30)
Lithuanian (30)
Hungarian (28)
Basque (23)
Vietnamese (21)
Arabic (20)
Bulgarian (20)
Latin (19)
Norwegian (18)
Catalan (16)
Chinese (16)
Dutch; Flemish (16)
Slovenian (16)
Croatian (15)
Dutch (15)
Slovak (11)
Turkish (11)
Galician (10)
Greek (10)
Japanese (10)
Maltese (10)
Hindi (8)
Korean (7)
Persian (6)
Faroese (5)
Northern Sami (4)
Tamil (4)
Thai (4)
Bengali (3)
Erzya (3)
Kurdish (3)
Malayalam (3)
Panjabi (3)
Serbian (3)
Sign Languages (3)
Urdu (3)
Uzbek (3)
Afrikaans (2)
Albanian (2)
Chuvash (2)
Esperanto (2)
Gujarati (2)
Indonesian (2)
Irish (2)
Kannada (2)
Khanty (2)
Modern Greek (2)
Moksha (2)
Pushto (2)
Sinhalese (2)
Swahili (2)
Tatar (2)
Telugu (2)
Udmurt (2)
Welsh (2)
Amharic (1)
Armenian (1)
Assamese (1)
Avaric (1)
Chukchi (1)
English (1)
Englishu (1)
Even (1)
Evenki (1)
Gaelic (1)
Georgian (1)
Hebrew (1)
Hill Mari (1)
Ingrian (1)
Kalmyk; Oirat (1)
Kashmiri (1)
Kildin Sami (1)
Komi Zyrian (1)
Koryak (1)
Lak (1)
Under Negotiation (12)
ELRA_VAR (438)
ELRA_END_USER (229)
Proprietary (137)
CC - BY (95)
Under Negotiation (33)
GPL (32)
ELRA_EVALUATION (29)
CC - BY - SA (27)
CLARIN_RES (17)
CC - BY - NC - SA (15)
Other (10)
CC - BY - NC (5)
CLARIN_ACA - NC (5)
CLARIN_ACA (3)
CC - BY - ND (2)
LGPL (2)
BSD - Style (1)
CC - ZERO (1)
MS Commons - BY (1)
Commercial Use (452)
Attribution (79)
No Redistribution (37)
Evaluation Use (30)
Share Alike (21)
Inform Licensor (20)
Other (18)
No Derivatives (15)
Redeposit (6)
Nlp Applications (170)
Human Use (135)
Information Retrieval (117)
Text Mining (24)
Machine Translation (14)
Pos Tagging (12)
Linguistic Research (10)
Parsing (10)
Lemmatization (5)
Event Extraction (4)
Annotation (3)
Speech Synthesis (3)
Other (2)
Speech Analysis (2)
Web Services (2)
Lexicon Access (1)
Opinion Mining (1)
Spell Checking (1)
Text Generation (1)
Written Language (244)
Spoken Language (20)
Voice (17)
Body Gesture (12)
Facial Expression (11)
Sign Language (6)
Text/xml (25)
Text/plain (17)
Plain text (8)
Text/tsv (8)
Text / plain (4)
Text / xml (4)
Text (2)
20 (1)
MS Excel (1)
US- ASCII (1)
XML (1)
Rdf+xml (1)
Text/turtle (1)
Txt (1)
Wav (1)
Xml (1)
Environment (15)
General (14)
Labour legislation (10)
Health (6)
Law (5)
Communications (4)
Economics (4)
Energy (4)
Humanities (4)
Medicine (4)
Taxation (4)
Community law (3)
Finance (3)
Science (3)
Social affairs (3)
Social questions (3)
Accounting (2)
Civil law (2)
Computer science (2)
Documentation (2)
Economy (2)
Education (2)
Law_politics (2)
Marketing (2)
Movies (2)
Tariff policy (2)
Teaching (2)
Transport (2)
Wood industry (2)
Biodiversity (1)
Europarl (1)
General (1)
Legal news (1)
Medical History (1)
News (1)
Political (1)
Renewable energy (1)
Wikipedia (1)
Animal product (1)
Budget (1)
Camera (1)
Consumption (1)
Criminal law (1)
Defence (1)
Family (1)
Fisheries (1)
Food technology (1)
Foodstuff (1)
Geography (1)
Land transport (1)
Laws (1)
Management (1)
Physics (1)
Politics (1)
Prices (1)
Trade (1)
Transport policy (1)
1996-2011 (4)
1410-1681 (1)
1540-1750 (1)
1800-2000 (1)
1840 - 2013 (1)
1967-2008 (1)
1970-1989 (1)
1986-1994 (1)
2003 (1)
2011-2012 (1)
Early 1990s (1)
Years 2010-2011 (1)
Ca. 730–1710 (1)
Castilian (309)
Flemish (7)
Brazil (4)
Valencian (4)
Punjabi (3)
American English (2)
British English (2)
Legalese (2)
Mandarin Chinese (2)
Modern (1453-) (2)
American English (1)
American Finnish (1)
American Spanish (1)
American Spanish (1)
Australian (1)
British English (1)
European Spanish (1)
European Spanish (1)
Finland Swedish (1)
Indian English (1)
Mandarin Chinese (1)
Middle English (1)
Native Finnish (1)
New Zealand (1)
Read-aloud text (1)
Scottish (1)
Scottish English (1)
Scottish Gaelic (1)
Southern English (1)
Resource Type:
Corpus: | |
Lexical/Conceptual: | |
Tool/Service: | |
Language Description: |
Media Type:
Text: | |
Audio: | |
Image: | |
Video: | |
Text Numerical: | |
Text N-Gram: |
923 Language Resources (Page 1 of 47)
« Previous | Next »Order by:
ACCURAT balanced test corpus for under resourced languages
68
372
- Croatian
- English
- Estonian
- German
- Greek
- Latvian
- Lithuanian
- Romanian
- Slovenian
ACCURAT corpus of comparable sentences
79
436
- Croatian
- English
- Estonian
- German
- Greek
- Latvian
- Lithuanian
- Romanian
- Slovenian
ACCURAT corpus of Wikipedia texts
68
446
- Croatian
- English
- Estonian
- German
- Greek
- Latvian
- Lithuanian
- Romanian
- Slovenian
ACL RD-TEC: A Reference Dataset for Terminology Extraction and Classification Research in Computational Linguistics
0
243
- English