Gero Corpus Historikoa
Deskribapen laburra:
Datasets for modernising historical Basque words
Egileak (ixakideak):
Esteka (orokorra):
Esteka (deskarga):
Harremanetarako:
ixa@ehu.eus
Deskribapena:
Datasets for modernising historical Basque words.
The lexicons have been automatically extracted from this corpus:
http://klasikoak.armiarma.eus/idazlanak/A/AxularGero.htm
Based on this corpus some paragraphs have been selected and annotated using BRAT.
The lexicons have been automatically extracted from this corpus:
http://klasikoak.armiarma.eus/idazlanak/A/AxularGero.htm
Based on this corpus some paragraphs have been selected and annotated using BRAT.
Funtzionalitatea:
Training/dev. corpus and test corpus- train-gero training word-form lexicon for Gero book (only non-standard words)
- train-gero-std training word-form lexicon for Gero book (with standard words, half of them)
- test-gero test word-form lexicon for Gero book (only non-standard words)
- train-gero-std training word-form lexicon for Gero book (with standard words, half of them)
- test-gero test word-form lexicon for Gero book (only non-standard words)
Berrikuntza:
Selection and manual annotation
Jabetza:
Ixa taldea
Lizentzia:
CC By
Oharrak:
This resources are available under CC-BY: if you use them please refer to the reference above.