Deskribapen laburra: 
A dataset for Domain specific FAQs via conversational QA
Egileak (ixakideak ez direnak): 
Jan Deriu, Mark Cieliebak
Jon Ander Campos jonander.campos[abildua/at]

DoQA is a dataset for accessing Domain Specific FAQs via conversational
QA that contains 1,637 information-seeking dialogues on the cooking
domain (7,329 questions in total). Note that we include in the generic
concept of FAQs also Community Question Answering sites, as well as
corporate information in intranets which is maintained in textual form
similar to FAQs, often referred to as internal “knowledge bases”.

These dialogues are created by crowd workers that play the following
two roles: the user who asks questions about a certain cooking topic
posted in Stack Exchange (, and the
domain expert who replies to the questions by selecting a short span
of text from the long textual reply in the original post. The expert
can rephrase the selected span, in order to make it look more natural.

DoQA enables the development and evaluation of conversational QA
systems that help users access the knowledge buried in domain specific

Copyright (C) by Ixa Taldea, University of the Basque Country UPV/EHU
Creative Commons Attribution-ShareAlike 4.0 International Public License (CC BY-SA 4.0)