ISBNdun kongresua

Machine translation (MT) has benefited from using synthetic training data originating from translating monolingual corpora, a technique known as backtranslation. Combining backtranslated data from different sources has led to better results than when using such data in isolation. In this work we analyse the impact that data translated with rule-based, phrase-based statistical and neural MT systems has on new MT systems.

Gehiago irakurriSelecting Backtranslated Data from Multiple Sources for Improved Neural Machine Translation -ri buruz

Give your Text Representation Models some Love: the Case for Basque

Gehiago irakurriGive your Text Representation Models some Love: the Case for Basque -ri buruz

Conversational Question Answering in Low Resource Scenarios: A Dataset and Case Study for Basque

Gehiago irakurriConversational Question Answering in Low Resource Scenarios: A Dataset and Case Study for Basque -ri buruz

Detection of Reading Absorption in User-Generated Book Reviews: Resources Creation and Evaluation

To detect how and when readers are experiencing engagement with a literary work, we bring together empirical literary studies and language technology via focusing on the affective state of absorption. The goal of our resource development is to enable the detection of different levels of reading absorption in millions of user-generated reviews hosted on social reading platforms. We present a corpus of social book reviews in English that we annotated with reading absorption categories.

Gehiago irakurriDetection of Reading Absorption in User-Generated Book Reviews: Resources Creation and Evaluation -ri buruz

Linguistic Appropriateness and Pedagogic Usefulness of Reading Comprehension Questions

Automatic generation of reading comprehension questions is a topic receiving growing interest in the NLP community, but there is currently no consensus on evaluation metrics and many approaches focus on linguistic quality only while ignoring the pedagogic value and appropriateness of questions. This paper overcomes such weaknesses by a new evaluation scheme where questions from the questionnaire are structured in a hierarchical way to avoid confronting human annotators with evaluation measures that do not make sense for a certain question.

Gehiago irakurriLinguistic Appropriateness and Pedagogic Usefulness of Reading Comprehension Questions -ri buruz

Domain Adapted Distant Supervision for Pedagogically Motivated Relation Extraction

In this paper we present a relation extraction system that given a text extracts pedagogically motivated relation types, as a previousstep to obtaining a semantic representation of the text which will make possible to automatically generate questions for reading comprehension. The system maps pedagogically motivated relations with relations from ConceptNet and deploys Distant Supervisionfor relation extraction. We run a study on a subset of those relationships in order to analyse the viability of our approach.

Gehiago irakurriDomain Adapted Distant Supervision for Pedagogically Motivated Relation Extraction -ri buruz

Hizkuntzak

Nor gara?

Zer egiten dugu?

Beste batzuk

Exploring the Enrichment of Basque WordNet with a Sentiment Lexicon

Towards modeling SUMO attributes through WordNet adjectives: a Case Study on Qualities.

A Call for More Rigor in Unsupervised Cross-lingual Learning

On the cross-lingual transferability of monolingual representations

Selecting Backtranslated Data from Multiple Sources for Improved Neural Machine Translation