Baliabideak eta tresnak



  • Adafre S., Rijke M. (2006). Finding Similar Sentences across Multiple Languages in Wikipedia. Proceedings of the workshop on NEW TEXT Wikis and blogs and other dynamic text sources (pp. 61-69)
  • Auer S., Bizer C., Kobilarov G. Lehmann J., Cyganiak R., Ives Z. (2007) DBpedia: A Nucleus for a Web of Open Data. Lecture Notes in Computer Science, 2007, Volume 4825/2007, 722-735
  • Bangalore, S., Haffner, P., & Kanthak, S. (2007). Statistical Machine Translation through Global Lexical Selection and Sentence Reconstruction. Proceedings of the 45th Annual Meeting of the Association for Computational Linguistics (ACL) (pp. 152?159).
  • Barrière C. & Isabelle P. (2011). Searching parallel corpora for contextually equivalent terms. Proceedings of the 15th conference of the European Association for Machine Translation, 30-31 May 2011, Leuven, Belgium; eds. Mikel L.Forcada, Heidi Depraetere, Vincent Vandeghinste; pp.105-112.
  • Belz A. & Kow E. (2011). Unsupervised alignment of comparable data and text resources. ACL 2011: Proceedings of the Fourth Workshop on Building and Using Comparable Corpora, on Portland, Oregon, USA, 24 June 2011; pp.102-109
  • Brown, P. F., Pietra, S. A. D., Mercer, R. L., & Pietra, V. J. D. (1993). The Mathematics of Statistical Machine Translation: Parameter Estimation. Computational Linguistics, 19(2), 263?311.
  • Carpuat, M., & Wu, D. (2007). Improving Statistical Machine Translation Using Word Sense Disambiguation. Proceedings of the Conference on Empirical Methods in Natural Language Processing (EMNLP) (pp. 61?72).
  • Cettolo M., Federico M., & Bertoldi N. (2010). Mining parallel fragments from comparable texts. Proceedings of the 7th International Workshop on Spoken Language Translation, 2-3 December 2010, on Paris, France; pp.227-234
  • Cucerzan S. (2007) Large-Scale Name Entity Disambiguation Based on Wikipedia Data. Proceedings of the 2007 Joint Conference EMNLP and CoNLL; pp. 708-716.
  • España-Bonet, C., Giménez, J., & Màrquez L. (2009). Discriminative Phrase-Based Models for Arabic Machine Translation. ACM Transactions on Asian Language Information Processing Journal (TALIP), vol. 8, No. 4, pag. 1-20.
  • Fišer D., Ljubeši? N, Vintar S., & Pollak S (2011). Building and using comparable corpora for domain-specific bilingual lexicon extraction. ACL 2011: Proceedings of the Fourth Workshop on Building and Using Comparable Corpora, on Portland, Oregon, USA, 24 June 2011; pp.19-26
  • Giménez, J., & Màrquez, L. (2008). Discriminative Phrase Selection for Statistical Machine Translation. In Learning Machine Translation, NIPS Workshop series. MIT Press.
  • Gurrutxaga A, Leturia I, Pociello E, San Vicente I, Saralegi X. (2010) Exploiting the Internet to build language resources for less resourced languages. In: SALTMIL 2010 workshop. Valetta, Malta.
  • Hálek O., Rosa R., Tamchyna A., and Bojar O. (2011). Named Entities from Wikipedia for Machine Translation. In Markéta Lopatková, editor, ITAT 2011 Information Technologies - Applications and Theory, volume 788, pages 23-30
  • Hewavitharana S. & Vogel S. (2011). Extracting parallel phrases from comparable data. ACL 2011: Proceedings of the Fourth Workshop on Building and Using Comparable Corpora, on Portland, Oregon, USA, 24 June 2011; pp.61-68.
  • Hudík, T. & Ruopp, A. (2011). The integration of Moses into localization industry. Proceedings of the 15th conference of the European Association for Machine Translation (EAMT).
  • Jehl L., Osborne M. (2010) Machine Translation for Twitter. Linguistics and English Language Masters thesis collection. The University of Edinburgh.
  • Kaufmann J. and Kalita J. (2011) ?Syntactic normalization of twitter messages,? in International Conference on Natural Language Processing. (ICON 2011). New Delhi: McMillan, India, 2010, pp. 149?158
  • Koehn, P., Och, F. J., & Marcu, D. (2003). Statistical Phrase-Based Translation. Proceedings of the Joint Conference on Human Language Technology and the North American Chapter of the Association for Computational Linguistics (HLT-NAACL).
  • Krstovski K. & Smith D. (2011). A minimally supervised approach for detecting and ranking document translation pairs. Proceedings of the 6th Workshop on Statistical Machine Translation, Edinburgh, Scotland, on UK, July 30-31, 2011; pp.207-216
  • Leturia I, San Vicente I, Saralegi X. (2009) Search engine based approaches for collecting domain-specific Basque-English comparable corpora from the Internet. In: 5th International Web as Corpus Workshop (WAC5). Donostia-San Sebastian, Spain.
  • Petrovic S., Osborne M., Lavrenko V. (2010). The Edinburgh Twitter corpus. In Proceedings of the NAACL HLT 2010 Workshop on Computational. Linguistics in a World of Social Media , pages 25?26.
  • Roturier, J. (2009). Deploying novel MT technology to raise the bar for quality: a review of key advantages and challenges. MT Summit XII: proceedings of the twelfth Machine Translation Summit, August 26-30, 2009.
  • X. Saralegi, I. San Vicente, M. López de Lacalle, 2008. Mining Term Translations from Domain Restricted Comparable Corpora. SEPLN 2008. Madrid
  • Stroppa, N., van den Bosch, A., & Way, A. (2007). Exploiting Source Similarity for SMT using Context-Informed Features. Proceedings of the 11th Conference on Theoretical and Methodolog-ical Issues in Machine Translation (TMI) (pp. 231?240).
  • Sun, Y., Li, J., & Li. Y. (2011) Deploying MT into a localisation workflow: pains and gains. MT Summit XIII: the Thirteenth Machine Translation Summit organized by the Asia-Pacific Association for Machine Translation (AAMT)
  • Vasi?jevs, A., Skadi?š, R. & Skadi?a, I. (2011). Towards application of user-tailored machine translation in localization. Proceedings of the Third Joint EM+/CNGL Workshop ?Bringing MT to the User: Research Meets Translators?
  • Vickrey, D., Biewald, L., Teyssier, M., & Collen, D. (2005). Word-Sense Disambiguation for Machine Translation.Proceedings of the Joint Conference on Human Language Technology and Empirical Methods in Natural Language Processing (HLT-EMNLP).
  • Vuli? I., De Smet W., & Moens M. (2011). Identifying word translations from comparable corpora using latent topic models. ACL-HLT 2011: Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Short papers, on Portland, Oregon, June 19-24, 2011; pp.479-484
  • Wu F., Hoffmann R., Weld D. (2008) Information extraction from Wikipedia: moving down the long tail. Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining


IXA taldea - EHU-UPV

Mikel Artetxe-k saria jaso du Bartzelonako HP Hackatoian 2017-Mar-01

Mintegia: Jarreren sailkapena sare sozialetan (A. Zubiaga, 2017-02-14) 2017-Ots-10

Mintegia: Denbora-egituren tratamendu automatikoa (B. Altuna, 2017-02-07) 2017-Ots-07

Mintegia: Itzulpen Automatiko Neuronala: TAdeep eta Modela proiektuak (G. Labaka, 2017-01-31) 2017-Urt-31

IXA-ko berri gehiago ikusi   

20/07/2016 - Anna Raboshchuck PhD Dissertation

PhD scholarship on Deep Learning applied to Speech Technologies

Joan Bruna & Oriol Vinyals Seminar on Deep Learning

27/04/2016 - Hermann Ney - Human Language Technology and Machine Learning

UPC-ko berri gehiago ikusi