Conversational Question Answering dataset in Basque
Aitor Agirre
Arantxa Otegi - arantza.otegi[abildua/at]ehu.eus
ElkarHizketak is a low resource conversational Question Answering (QA) dataset in Basque created by Basque speaker volunteers. The dataset contains close to 400 dialogues and more than 1600 question and answers, and its small size presents a realistic low-resource scenario for conversational QA systems. The dataset is built on top of Wikipedia sections about popular people and organizations. The dialogues involve two crowd workers: (1) a student ask questions after reading a small introduction about the person, but without seeing the section text; and (2) a teacher answers the questions selecting a span of text of the section.
Copyright (C) by Ixa Taldea, University of the Basque Country UPV/EHU
Creative Commons Attribution-ShareAlike 4.0 International Public License (CC BY-SA 4.0)