Eihera

Short description: 
Basque named entities recognizer/classifier
Contact: 
i.alegria abildua/at ehu puntu/dot es
Description: 
Eihera is a system for Named Entity recognition and classification in written Basque. The system is designed in four steps: first, the development of a recognizer based on linguistic information represented on finite-state-transducers; second, the generation of semi-automatically annotated corpora from the result of these transducers; third, the achievement of the best possible recognizer by training different ML techniques on these corpora; and finally, the combination of the different recognizers obtained.
Functionality: 
Eihera classifies the named entities into three classes: person, organization and location.
Technology: 
Finite-state and Machine learning.
Modules: 
Recognition by rules, recognition by ML, classification by rules, classification by ML. Eustagger is a previous step.
Innovation: 
It is the first NERC system for Basque.
Development: 
Different projects funded by the Basque government and the Spanish R&D agency.