Distância diacrónica automática entre variantes diatópicas do português e do espanhol

O objetivo deste trabalho é aplicar uma metodo- logia baseada na perplexidade, para calcular automa- ticamente a distância interlinguística entre diferentes períodos históricos de variantes diatópicas de idiomas.

Morfeus+: Word Parsing in Basque beyond Morphological Segmentation

This work describes the formalization of a word structure grammar that represents the complex morphological and morphosyntactic information embedded within the word forms of an agglutinative language (Basque), giving a comprehensive linguistic description of the main morphological phenomena, such as affixation, derivation, and composition, and also taking into account the modeling of both standard and non standard words. We have identified the relevant issues to be addressed in the representation of such a grammar.

Contextualized Translations of Phrasal Verbs with Distributional Compositional Semantics and Monolingual Corpora

This article describes a compositional distributional method to generate contextualized senses of words and identify their appropriate translations in the target language using monolingual corpora. Word translation is modeled in the same way as contextualization of word meaning, but in a bilingual vector space. The contextualization of meaning is carried out by means of distributional composition within a structured vector space with syntactic dependencies, and the bilingual space is created by means of transfer rules and a bilingual dictionary.

Measuring Language Distance of Isolated European Languages

Phylogenetics is a sub-field of historical linguistics whose aim is to classify a group of languages by considering their distances within a rooted tree that stands for their historical evolution. A few European languages do not belong to the Indo-European family or are otherwise isolated in the European rooted tree. Although it is not possible to establish phylogenetic links using basic strategies, it is possible to calculate the distances between these isolated languages and the rest using simple corpus-based techniques and natural language processing methods. The objective of this


RSS - Aldizkaria-rako harpidetza egin