|
|
Past
|
Present
|
Future
|
|
Corpus
|
Raw
text
|
27
Mw (newspapers)
|
100
Mw (2010)
|
|
Word-forms
are tagged with POS and lemma.
|
27
Mw (newspapers)
30
Kw (hand corrected)
|
100
Mw (2010)
2
Mw (hand corrected)
|
|
Syntactically
tagged text
|
30
Kw
|
1
Mw (2010)
|
|
Semantically
tagged text
|
4
Kw (meanings)
|
1Mw
(2010?) meanings
and
semantically analyzed
|
|
Multilingual
and parallel corpus.
|
1
Mw (Spanish-Basque)
...
|
100
Mw(2010)
|
|
Lexicon
|
EDBL
lexical database. Lexical support for constructing general
applications, including POS and morphological information.
|
80.000
entries Enrichment of the lexical database: - Multiword
lexical units - Verb subcategorization
|
Improving
design. Enrichment of the lexical database. - Multiword
lexical units - Verb subcategorization - Semantics
|
|
Machine-readable
dictionaries
|
Machine-readable
dictionaries
|
Machine-readable
dictionaries
|
|
Morpho
|
morphological
description
|
|
|
|
Syntax
|
Syntax
description
|
Syntax
description - Clause boundaries - Postpositions - Verb
subcategorization - Dependencies
|
Syntax
description - Broad coverage - Different formalisms
(Unfication, CG, Dependency grammar)
|
|
Sem
|
Lexical-semantic
multilingual knowledge base. Taxonomy of concepts (such as
WordNet) BasqueWN
|
Automatic
acquisition from other languages corpora
Enriching
& optimising BasqueWN
|
Enriching
& optimising EuskalWN -
Terms -
Entity
names -
(100
K-entry)
|