Proiektua

Helburuak

Partehartzaileak

Argitalpenak

Intranet

Demoak

Baliabideak eta tresnak

Zientzia-egoera 2008

Erreferentziak 2009

References


Aduriz I., Alegria I., Artola X., Ezeiza N., Sarasola K.  1997 A spelling corrector for Basque based on morphology Literary & Linguistic Computing, Vol. 12, No. 1. 31-38. Oxford University Press. Oxford. 1997.ISSN 0268-1145

Agirre E., Aldezabal I., Alegria I., Arregi X., Arriola J., Artola X., Díaz de Ilarraza A., Ezeiza N., Gojenola K., Sarasola K., Soroa A. 2001. Developing Language Technology for a Minority Language: Progress and Strategy. ELSNews. 10.1 www.elsnet.org/publications/elsnews/10.1.pdf

Agirre E., Atutxa A., Gojenola K., Sarasola K. 2004b. Exploring portability of syntactic information from English to Basque. Proceedings of the 4rd International Conference on Language Resources and Evaluations (LREC). Lisbon, Portugal.

Agirre E., Atutxa A., Gojenola K., Sarasola K., Terrón Robles D. 2005a. PP Attachment for Basque based on English parses. Proceedings of the Workshop on Cross-lingual Knowledge Induction. Eurolan 2005. Romania

Agirre E., Lopez de Lacalle Lekuona O. 2004a. Clustering WordNet Word Senses. Nicolas Nicolov, Kalina Botcheva, Galia Angelova and Ruslan Mitkov (eds.). Recent Advances in Natural Language Procesing III: Selected Papers from RANLP 2003. John Benjamins Publishers, Amsterdam. A longer version appears in Proceedings of the Conference on Recent Advances on Natural Language Processing (RANLP'03). Bulgary.

Agirre E., Lopez de Lacalle Lekuona O. 2004c. Publicly available topic signatures for all WordNet nominal senses. Proceedings of the 4rd International Conference on Language Resources and Evaluations (LREC). Lisbon, Portugal.

Agirre E., Lopez de Lacalle Lekuona O., Martinez D. 2005b. Exploring feature spaces with svd and unlabeled data for Word Sense Disambiguation. Proceedings of the Conference on Recent Advances on Natural Language Processing (RANLP'05). Bulgary

Al-Onaizan, Y. and K. Knight. 2002. ?Machine transliteration of names in Arabic text.? In Proceedings of the ACL-02 workshop on Computational approaches to Semitic languages, pp.1-13.

Albrecht, J., & Hwa, R. (2007a). A Re-examination of Machine Learning Approaches for Sentence-Level MT Evaluation. Proceedings of the 45th Annual Meeting of the Association for Computational Linguistics (ACL) (pp. 880?887).

Albrecht, J., & Hwa, R. (2007b). Regression for Sentence-Level MT Evaluation with Pseudo References. Proceedings of the 45th Annual Meeting of the Association for Computational Linguistics (ACL) (pp. 296?303).

Alegria I., Aranzabe M., Ezeiza A., Ezeiza N., Urizar R. 2002. Using Finite State Technology in Natural Language Processing of Basque. Implementations and Applications of Automata, pp. 1-12. 2001. LNCS 2494. ISBN 3-540-00400-9

Alegria I., Arregi X., Artola X., Díaz de Ilarraza A., Labaka G., Lersundi M., Mayor A., Sarasola K. 2008. Strategies for sustainable MT for Basque: incremental design, reusability, standardization and open-source. Proceedings of the IJCNLP-08 Workshop on NLP for Less Privileged Languages. pages 59-64. Asian Federation of Natural Language Processing, Hyderabad, India, http://www.aclweb.org/anthology-new/I/I08/I08-3000.pdf

Alegria I., Artola X., Sarasola K. 1996. 1996. Automatic morphological analysis of Basque. Literary & Linguistic Computing Vol. 11, No. 4, 193-203. Oxford University Press. Oxford. 1996.

Alegria I., Casillas A., Diaz de Ilarraza A., Igartua J., Labaka G., Lersundi M., Mayor A., Sarasola K. 2008. Spanish-to-Basque MultiEngine Machine Translation for a Restricted Domain. Proc. of the 8th Conference of the Association for Machine Translation in the Americas (AMTA-2008).

Alegria I., Díaz de Ilarraza A., Labaka G., Lersundi M., Mayor A., Sarasola K. 2005. Itzulpen automatikoa IXA taldean. Ordenagailuz Lagundutako itzulpena. EHUko Udako Ikastaroak 2005

Alegria I., Díaz de Ilarraza A., Labaka G., Lersundi M., Mayor A., Sarasola K. 2005b. An FST grammar for verb chain transfer in a Spanish-Basque MT System. FSMNLP Finite-State Methods and Natural Language Processing (poster session)

Alegria I., Díaz de Ilarraza A., Labaka G., Lersundi M., Mayor A., Sarasola K. 2007. Transfer-based MT from Spanish into Basque: reusability, standardization and open source. LNCS 4394. 374-384. Cicling 2007.

Alegria I., Díaz de Ilarraza A., Labaka G., Lersundi M., Mayor A., Sarasola K., M. Forcada, S. Ortiz, L. Padró 2005a. An Open Architecture for Transfer-based Machine Translation between Spanish and Basque. MT Summit. Workshop in Open Source MT

Alegria I., Gurrutxaga A., Lizaso P., Saralegi X., Ugartetxea S. & Urizar R. 2004 "A Xml-Based Term Extraction Tool for Basque." In LREC2004: 4 Th International Conference On Language Resources And Evaluation. [on line] http://ixa.si.ehu.es/Ixa/Argitalpenak/Artikuluak/1078851980/publikoak/Erauzterm_LREC

Alegria I., Gurrutxaga A., Lizaso P., Saralegi X., Ugartetxea S. & Urizar R. 2004. "Linguistic and Statistical Approaches to Basque Term Extraction." In GLAT-2004: The Production Of Specialized Texts. [on line] [05-06-10] <http://ixa.si.ehu.es/Ixa/Argitalpenak/Artikuluak/1079630425/publikoak/Term_Erauzketa.pdf>

Alegria Iñaki , Bertol Arrieta, Xavier Carreras, Arantza Díaz de Ilarraza, Larraitz Uria 2008. Chunk and Clause Identification for Basque by Filtering and Ranking with Perceptrons. SEPLN-2008 (Leganés, Madrid). Revista del procesamiento del lenguaje natural, nº 41, pags: 5-12; sept 2008; ISSN: 1135-5948

Alegria, I., Areta, N., Artola, X., Diaz de Ilarraza, A., Ezeiza, N., Gurrutxaga, A., Leturia, I., Saiz, R., Sologaistoa, A. Soroa, A.,& Valverde, A. 2005. "Zientzia eta Teknologiaren Corpusa." In Mendebalde Kultur Alkartea, IX. Jardunaldiak: Euskera zientifiko-teknikoa. Bilbao.

Alegria; I:, Gurrutxaga A., Pagoaga, A., Saralegi X., & Ugartetxea S. 2005. "Itzulpen-memorietatik terminologia elebiduna erauzteko teknikak." In. Ordenagailuz Lagundutako itzulpena. San Sebastián: Cursos de Verano de la UPV [on line] [06-01-16]<http://www.elhuyar.org/ehu_ikastaroa/>

Alegria; I:, Gurrutxaga A., Saralegi X., & Ugartetxea S. 2005. "Erauzterm: euskarazko terminoak erauzteko tresna erdiautomatikoa." In Mendebalde Kultur Alkartea, IX. Jardunaldiak: Euskera zientifiko-teknikoa. Bilbao.

Alshawi, H. (1996). Head automata and bilingual tiling: Translation with minimal representations (invited talk). Proceedings of the Joint 16th International Conference on Computational Linguistics and the 34th Annual Meeting of the Association for Computational Linguistics (COLING-ACL) (pp. 167?176).

Alshawi, H., Bangalore, S., & Douglas, S. (1998). Automatic Acquisition of Hierarchical Transduction Models for Machine Translation. Proceedings of the Joint 17th International Conference on Computational Linguistics and the 36th Annual Meeting of the Association for Computational Linguistics (COLING-ACL) (pp. 41?47).

Amigó, Enrique, Jesús Giménez, Julio Gonzalo and Lluís Márquez. MT Evaluation: Human-like vs. Human Acceptable. Proceedings of COLING-ACL 2006.

Amigó, Enrique, Julio Gonzalo, Anselmo Peñas, Felisa Verdejo. QARLA: A Framework for the Evaluation of Text Summarization Systems. (ACL'05).

Aranzabe M., Arriola J.M., Díaz de Ilarraza 2004. Towards a Dependency Parser of Basque. Proceedings of the Coling 2004 Workshop on Recent Advances in Dependency Grammar. Geneva, Switzerland.

Babych, Bogdan and Tony Hartley. Extending the BLEU MT Evaluation Method with Frequency Weightings. (ACL '04).

Banerjee, S., Alon Lavie. METEOR: An Automatic Metric for MT Evaluation with Improved Correlation with Human Judgments. (ACL'05 Workshop on Intrinsic and Extrinsic Evaluation Measures for Machine Translation and/or Summarization).

Bangalore, S., Haffner, P., & Kanthak, S. (2007). Statistical Machine Translation through Global Lexical Selection and Sentence Reconstruction. Proceedings of the 45th Annual Meeting of the Association for Computational Linguistics (ACL) (pp. 152?159).

Baroni, M., Bernardini, S. (2004). BootCaT: Bootstrapping corpora and terms from the web. In Proceedings of LREC 2004. Lisbon, Portugal: ELDA, pp. 1313--1316.

Bengoetxea K., Gojenola K. 2007.Desarrollo de un analizador sintáctico estadístico basado en dependencias para el euskera. XXIII Congreso de la SEPLN.

Boas. Hans C. 2002. Bilingual framenet dictionaries for machine translation. In Proceedings of LREC 2002, pages 1364-1371, Las Palmas, Canary Islands.

Broder, A.Z. (2000). Identifying and filtering near-duplicate documents. In Proceedings of Combinatorial Pattern Matching: 11th Annual Symposium. Montreal, Canada: Springer, pp. 1--10.

Brown P., John Cocke, Stephen Della Pietra, Vincent Della Pietra, Fred Jelinek, Robert Mercer, and Paul Roossin. 1990. A statistical approach to machine translation Computational Linguistics 16:79?85.

Brown, P. F., Pietra, S. A. D., Pietra, V. J. D., & Mercer, R. L. (1991a). A statistical approach to sense disambiguation in machine translation. HLT ?91: Proceedings of the workshop on Speech and Natural Language (pp. 146?151). Morristown, NJ, USA: Association for Computational Linguistics.

Brown, P. F., Pietra, S. A. D., Pietra, V. J. D., & Mercer, R. L. (1991b). Word-sense Disambiguation Using Statistical Methods. Proceedings of the 29th annual meeting on Association for Computational Linguistics (pp. 264?270). Morristown, NJ, USA: Association for Computational Linguistics.

Brown,R.D. "Example-Based Machine Translation in the Pangloss System". In Proceedings of the 16th International Conference on Computational Linguistics (COLING-96), p. 169-174. Copenhagen, Denmark, August 5-9, 1996.

Burbank, A., M. Carpuat, S. Clark, M. Dreyer, P. Fox, D. Groves, K. Hall, M. Hearne, D. Melamed, Y. Shen, A. Way, B. Wellington, and D. Wu (2005) "Final Report of the 2005 Language Engineering Workshop on Statistical Machine Translation by Parsing", http://www.clsp.jhu.edu/ws2005/groups/statistical/documents/finalreport.pdf .

Cabezas, C., & Resnik, P. (2005). Using WSD Techniques for Lexical Selection in Statistical Machine Translation (CS-TR-4736/LAMP-TR-124/UMIACS-TR-2005-42) (Technical Report). University of Maryland, College Park. http://lampsrv01.umiacs.umd.edu/pubs/TechReports/LAMP\ 124/LAMP\ 124.pdf.

Carl, M. & Way, A. (Eds.) Recent Advances in Example-Based Machine Translation. Springer.2003,

Carpuat, M. and Dekai Wu. Word Sense Disambiguation vs. Statistical Machine Translation. In Proceedings of the 43rd Annual Conference of the Association for Computational Linguistics (ACL'05), Ann Arbor, MI, USA, 2005.

Carpuat, M., & Wu, D. (2005a). Evaluating the Word Sense Disambiguation Performance of Statistical Machine Translation. Proceedings of the Second International Joint Conference on Natural Language Processing (IJCNLP) (pp. 122?127).

Carpuat, M., & Wu, D. (2007a). How Phrase Sense Disambiguation outperforms Word Sense Disambiguation for Statistical Machine Translation. Proceedings of the 11th Conference on Theoretical and Methodological Issues in Machine Translation (TMI).

Carpuat, M., & Wu, D. (2007b). Improving Statistical Machine Translation Using Word Sense Disambiguation. Proceedings of the Conference on Empirical Methods in Natural Language Processing (EMNLP) (pp. 61?72).

Carpuat, M., Shen, Y., Xiaofeng, Y., & Wu, D. (2006). Toward Integrating Semantic Processing in Statistical Machine Translation. Proceedings of the International Workshop on Spoken Language Translation (IWSLT) (pp. 37?44).

Carreras Xavier , Lluís Màrquez, Vasin Punyakanok and Dan Roth. Learning and Inference for Clause Identification. 13th European Conference on Machine Learning (ECML'02). Helsinki, Finland. August 2002

Carreras, X. and Lluís Màrquez. Introduction to the CoNLL-2005 Shared Task: Semantic Role Labelling. In Proceedings of CoNLL-2005, Ann Arbor, Michigan, USA.

Carreras, X. and Lluís Màrquez. Introduction to the CoNLL-2004. Shared Task: Semantic Role Labelling. In Proceedings of CoNLL-2004, Boston, USA.

Carreras, X., Chao, I., Padró, L., & Padró, M. (2004). Freeling: An open-source suite of language analyzers. Proceedings of the 4th LREC.

Carreras, X., Màrquez, L., & Castro, J. (2005). Filtering-ranking perceptron learning for partial parsing. Machine Learning, 59, 1-31.

Casillas A. 2000. Explotación de corpus alineados para el desarrollo de entornos de composición de documentos estructurados bilingües. Ph. D. Thesis. Universidad de Deusto.. 2000

Casillas A., I. Fernández, R. Martínez 2004. Sentence alignment for spanish-basque bitexts: word correspondences vs. markup similarity. Fifth International Conference on Intelligent Text Processing and Computational Linguistics ? CICLing

Chakrabarti Soumen, Martin van den Berg, Byron Dom: Focused Crawling: A New Approach to Topic-Specific Web Resource Discovery. Computer Networks 31(11-16): 1623-1640 (1999).

Chan, Y. S., Ng, H. T., & Chiang, D. (2007). Word Sense Disambiguation Improves Statistical Machine Translation. Proceedings of the 45th Annual Meeting of the Association for Computational Linguistics (ACL) (pp. 33?40).

Chen, J., Nie, J.-Y. (2000) Parallel Web Text. Mining for Cross-Language IR. In Proceedings of RIAO-. 2000

Chiang, D. (2005). A Hierarchical Phrase-based Model for Statistical Machine Translation. Proceedings of the 43rd Annual Meeting of the Association for Computational Linguistics (ACL) (pp. 263?270).

Chiang, D. (2007). Hierarchical Phrase-based Translation. Computational Linguistics, 33(2), 201-228

Chklovski, T., Mihalcea, R., Pedersen, T., & Purandare, A. (2004). The Senseval-3 Multilingual EnglishA?Hindi lexical sample task. Senseval-3: Third International Workshop on the Evaluation of Systems for the Semantic Analysis of Text (pp. 5?8). Barcelona, Spain: Association for Computational Linguistics.

Collins, M., Koehn, P., & Kucerova, I. (2005). Clause Restructuring for Statistical Machine Translation. Proceedings of the 43rd Annual Meeting of the Association for Computational Linguistics (ACL).

Corbí-Bellot,A. M., Mikel L. Forcada, Sergio Ortiz-Rojas, Juan Antonio Pérez-Ortiz, Gema Ramírez-Sánchez, Felipe Sánchez-Martínez, Iñaki Alegria, Aingeru Mayor, Kepa Sarasola 2005. An Open-Source Shallow-Transfer Machine Translation Engine for the Romance Languages of Spain. Proceedings of the EAMT2005

Corston-Oliver, S., Gamon, M., & Brockett, C. (2001). A Machine Learning Approach to the Automatic Evaluation of Machine Translation. Proceedings of the 39th Annual Meeting of the Association for Computational Linguistics (ACL) (pp. 140?147).

Cowan, B., Kucerova, I., & Collins, M. (2006). A Discriminative Model for Tree-to-Tree Translation. Proceedings of the Conference on Empirical Methods in Natural Language Processing (EMNLP).

Crego, J. M., de Gispert, A., Lambert, P., Costa-jussà, M. R., Khalilov, M., Banchs, R., no, J. B. M., & Fonollosa, J. A. R. (2006). N-gram-based SMT System Enhanced with Reordering Patterns. Proceedings of the HLT-NAACL Workshop on Statistical Machine Translation (pp. 162?165).

Daille, Béatrice and  Emmanuel Morin.2005.  "French-English Terminology Extraction from Comparable Corpora". IJCNLP 2005: 707-718

Díaz de Ilarraza A., Igartua J., Sarasola K., Sologaistoa A., Casillas A., Martinez R. 2007. Spanish-Basque Parallel Corpus Structure: Linguistic Annotations and Translation Units. Proceedings of TSD 2007 Conference (Plzen, Czech Republic).

Díaz de Ilarraza A., Labaka G., Sarasola K. 2008. Statistical Post-Editing: A Valuable Method in Domain Adaptation of RBMT Systems. MATMT2008 workshop: Mixing Approaches to Machine Translation.

Díaz de Ilarraza A., Mayor A., Sarasola K. 2000. Building a Lexicon for an English-Basque Machine translation System from Heteogeneous Wide-Coverage dictionaries. MT 2000. University of Exeter, United Kingdom: 19-22 November 2000.

Díaz de Ilarraza A., Mayor A., Sarasola K. 2000. Reusability of Wide-Coverage Linguistic Resources in the Construction of a Multilingual Machine Translation System. MT 2000. University of Exeter, United Kingdom: 19-22 November 2000.

Díaz de Ilarraza A., Mayor A., Sarasola K. 2001. Inclusión del par castellano-euskara en un prototipo de traducción automática multilingue. Proceedings of the second international workshop on Spanish Language Processing and Language Technologies. Jaén 14-15 September 2001.

Díaz de Ilarraza A., Mayor A., Sarasola K. 2002. Adibideetan Oinarritutako Itzulpen Automatikoa (AOIA): azterketa bibliografikoa eta euskararekin lan egiteko proposamenak . Senez aldizkaria. ISBN: 847086287

Díaz de Ilarraza A., Mayor A., Sarasola K., Loinaz M., Chevreau K., Coch J. 2001. MultiMeteok euskaraz ere badaki. Elhuyar Zientzia eta Teknologia, 170. zbkia42-49 orr.; www.zientzianet.com/artikulua.asp?Artik_kod=4022

Doddington,G., Automatic Evaluation of Machine Translation Quality Using N-gram Co-Occurrence Statistics, Proc. of the 2nd International conference on Human Language Technology, 2002.

Doyon J., C. Doran, C. .Means, D. Parr. Automated Machine Translation Improvement Through Post-Editing Techniques: Analyst and Translator Experiments.. AMTA 2008.

Dugast, L., Senellart, J., & Koehn, P. (2007). Statistical post-editing on SYSTRAN?s rule-based translation system. ACL 2007: proceedings of the Second Workshop on Statistical Machine Translation, 23, 2007, Prague, Czech Republic; pp. 220-223

Elming, J. (2006). Transformation-based correction of rule-based MT. 11th Annual Conference of the European Association for Machine Translation, Oslo, Norway

España-Bonet, Cristina, Jesús Giménez and Lluís Márquez. The UPC-lsi Discriminative Phrase Selection System: NIST MT Evaluation 2008 . Proceedings of the 2008 NIST Open Machine Translation Evaluation Workshop.

Fairon Cédrick, Macé Kévin, Naets Hubert, GlossaNet 2: a linguistic search engine for RSS-based corpora, In: Proceedings of LREC 2008. Workshop WAC4., Marrakesh, 2008,

Farwell, David, Jesús Giménez, Edgar González, Reda Halkoum, Horacio Rodríguez and Mihai Surdeanu. The UPC System for Arabic-to-English Entity Translation . Proceedings of ACE 2007.

Fellbaum, C. (Ed.). (1998). WordNet. An Electronic Lexical Database. The MIT Press.

Ferrer i Cancho, R. (2004). The Euclidean distance between syntactically linked words. Physical Review E 70, 056135.

Ferrer i Cancho, R. (2006). Why do syntactic links not cross? Europhysics Letters 76 1228-1235.

Frederking R. and Sergei Nirenburg. 1994. Three heads are better than one. Proceedings of the fourth ANLP

Fung P.. A Pattern Matching Method for Finding Noun and Proper Noun Translations from Noisy Parallel Corpora. ACL 1995: 236-243.

Fung, P. (1995) "Compiling Bilingual Lexicon Entries from a Non-Parallel English-Chinese Corpus." In Proceedings of the Third Workshop on Very Large Corpora, p.173-183, Boston, Massachusetts.

Fung, P. and Lo Yuen Yee (1998) ?An IR Approach for Translating New Words from Nonparallel Comparable Texts.? In COLING-ACL 1998: 414-420.

Galley, M., Graehl, J., Knight, K., Marcu, D., DeNeefe, S., Wang, W., & Thayer, I. (2006). Scalable Inference and Training of Context-Rich Syntactic Translation Models. Proceedings of the Joint 21st International Conference on Computational Linguistics and the 44th Annual Meeting of the Association for Computational Linguistics (COLING-ACL) (pp. 961?968).

Gamallo P. (2008) "Evaluating two different methods for the task of extracting bilingual lexicons from comparable corpora", In Proceedings of LREC 2008 Workshop on Comparable Corpora, Marrakech, Marocco, pp. 19-26. ISBN: 2-9517408-4-0.

Gamon, M. Anthony Ave and Martine Smets. Sentence-level MT evaluation without reference translations: Beyond language modeling. (EAMT '05).

Gildea, D. (2003). Loosely Tree-Based Alignment for Machine Translation. Proceedings of the 41st Annual Meeting of the Association for Computational Linguistics (ACL).

Gildea, D. (2004). Dependencies vs. Constituents for Tree-Based Alignment. Proceedings of the Conference on Empirical Methods in Natural Language Processing (EMNLP).

Gildea, D. and Daniel Jurafsky. Automatic Labelling of Semantic Roles. Computational Linguistics, 28(3):245-288, 2002.

Giménez J., Empirical Machine Translation and its Evaluation. PhD thesis, LSI program on Artificial Intelligence, Universitat Politècnica de Catalunya, July 2008.

Giménez, Jesús and Enrique Amigó. IQMT: A Framework for Automatic Machine Translation Evaluation. Proceedings of the 5th International Conference on Language Resources and Evaluation (LREC'06). Genoa, Italy, 22-28 May. 2006.

Giménez, Jesús and Lluís Márquez (2005). Combining Linguistic Data Views for Phrase-based SMT. ACL Workshop on ``Building and Using Parallel Texts: Data-Driven Machine Translation and Beyond''.

Giménez, Jesús and Lluís Márquez (2007a). Context-aware Discriminative Phrase Selection for Statistical Machine Translation . Proceedings of WMT 2007 (ACL'07).

Giménez, Jesús and Lluís Márquez (2007b). Linguistic Features for Automatic Evaluation of Heterogeneous MT Systems . Proceedings of WMT 2007 (ACL'07).

Giménez, Jesús and Lluís Márquez (2008a). Heterogeneous Automatic MT Evaluation Through Non-Parametric Metric Combinations . Proceedings of IJCNLP 2008.

Giménez, Jesús and Lluís Márquez (2008b). Towards Heterogeneous Automatic MT Error Analysis. Proceedings of the 6th International Conference on Language Resources and Evaluation (LREC'08).

Giménez, Jesús and Lluís Márquez (2008c). A Smorgasbord of Features for Automatic MT Evaluation. Proceedings of the 3rd ACL Workshop on Statistical Machine Translation (shared evaluation task), 2008.

Giménez, Jesús and Lluís Márquez (2008d). Discriminative Phrase Selection for Statistical Machine Translation . In Learning Machine Translation. NIPS Workshop Series. MIT Press, 2008.

Giménez, Jesús and Lluís Márquez. Combining Linguistic Data Views for Phrase-based SMT. ACL Workshop on ``Building and Using Parallel Texts: Data-Driven Machine Translation and Beyond''. Ann Arbor, Michigan, USA, June 29-30, 2005.

Giménez, Jesús and Lluís Màrquez. The LDV-COMBO system for SMT. Proceedings of the NAACL 2006 Workshop on Statistical Machine Translation.

Giménez, Je?us and Lluís Márquez.Low-cost Enrichment of Spanish WordNet with Automatically Translated Glosses: Combining General and Specialized Models. Proceedings of COLING-ACL 2006.

Giménez, Jesús, Enrique Amigó and Chiori Hori. Machine Translation Evaluation Inside QARLA. In Proceedings of the International Workshop on Spoken Language Technology (IWSLT'05). Pittsburgh, PA, USA October 24-25 2005.

Giménez, Jesús, Enrique Amigó, and Chiori Hori. Machine Translation Evaluation Inside QARLA. (IWSLT'05).

Giménez, Jesús, Lluís Márquez and German Rigau. Automatic Translation of WordNet Glosses. Eurolan Cross-Language Knowledge Induction Workshop. Cluj-Napoca, Romania, july 25 - August 5, 2005.

Giménez, Jesús. IQMT: A Framework for Automatic Machine Translation Evaluation based on Human Likeness. Technical Report LSI-07-29-R.

Giménez, Jesús. Rich Linguistic Knowledge for Empirical Machine Translation. PhD Thesis Project. LSI Department, Technical University of Catalonia, 2005.

Groves, D. and A. Way. 2005. Hybrid Example-Based SMT: the Best of Both Worlds?. In Proceedings of ACL 2005 Workshop on Building and Using Parallel Texts: Data-Driven Machine Translation and Beyond, Ann Arbor, MI., (to appear).

Gu, B., Anoop Sarkar and Fred Popowich. Description of SQUASH, the SFU Question Answering Summary Handler for the DUC-2005 Summarization Task, in Proceedings of Document Understanding Workshop, HLT/EMNLP Annual Meeting, 2005.

Hewavitharana Sanjika and Stephan Vogel: Enhancing a Statistical Machine Translation System by using an Automatically Extracted Parallel Corpus from Comparable Sources, LREC 2008 Workshop on Comparable Corpora, Marrakech, Morocco, May 2008.

Huang Fei and Kishore Papineni. 2007 Hierarchical system combination for machine translation. Proceedings of the 2007 Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning, pp. 277-286.

Hutchins, J. and F. Gaspari. Online and free! Ten years of online machine translation: origins, developments, current use and future prospects. MT Summit XI, Copenhagen, Denmark, 10-14 September 2007, Proceedings; pp.199-206.

Hutchins, J..Towards a definition of example-based machine translation. Second Workshop on Example-based Machine Translation [at] MT Summit X, , 16 September 2005

Intelligence Research, 29, 105?151, 2007.

Isabelle, P., Goutte, C., & Simard, M. (2007). Domain adaptation of MT systems through automatic post-editing. MT Summit XI, 10-14 September 2007, Copenhagen, Denmark. pp.255-261

Ixa taldea 2005. Itzultzaile automatiko bat software librean. Erabili.com: www.erabili.com/zer_berri/berriak/1111148758/

Jin, P., Wu, Y., & Yu, S. (2007). SemEval-2007 Task 05: Multilingual Chinese-English Lexical Sample. Proceedings of the Fourth International Workshop on Semantic Evaluations (SemEval-2007) (pp. 19?23). Prague, Czech Republic: Association for Computational Linguistics.

Kauchak, D., & Barzilay, R. (2006). Paraphrasing for Automatic Evaluation. Proceedings of the Joint Conference on Human Language Technology and the North American Chapter of the Association for Computational Linguistics (HLT-NAACL) (pp. 455?462).

Kilgarriff Adam, Gregory Grefenstette: Introduction to the Special Issue on the Web as Corpus. Computational Linguistics 29(3): 333-348 (2003)

Knight, K. (1999). Decoding complexity in word-replacement translation models. Computational Linguistics, 25.

Koehn P.. 2003. Europarl: A Multilingual Corpus for Evaluation of Machine Translation. Technical report, http://people.csail.mit.edu/-people/koehn/publications/europarl/.

Koehn Philipp. Statistical Significance Tests for Machine Translation Evaluation. In Proceedings of the 2004 Conference on Empirical Methods in Natural Language Processing (EMNLP 2004), Barcelona, Spain, pp.388?395.

Koehn, P. (2004). Pharaoh: a beam search decoder for phrase-based statistical machine translation models. Proceedings of AMTA.

Koehn, P., & Knight, K. (2002). Chunkmt: Statistical machine translation with richer linguistic knowledge. Draft.

Kulesza, A., & Shieber, S. M. (2004). A learning approach to improving sentence-level MT evaluation. Proceedings of the 10th International Conference on Theoretical and Methodological Issues in Machine Translation (TMI) (pp. 75?84).

Labaka G., Stroppa N., Way A., Sarasola K. 2007. Comparing Rule-Based and Data-Driven Approaches to Spanish-to-Basque Machine Translation. MT-Summit XI, Copenhagen

Labaka Gorka, Arantza Díaz de Ilarraza, Kepa Sarasola 2008 Descripción de los sistemas presentados por IXA-EHU a la evaluación ALBAYCIN?08 V Jornadas en Tecnología del Habla. Red Temática de Tecnologías del Habla. Orr: 93-95. Bilbao. ISBN: 978-84-9860-169-5

Lambert, Patrik, Jesús Giménez, Marta R. Costa-jussá, Enrique Amigó, Rafael E. Banchs, Lluís Márquez and J.A. R. Fonollosa. Machine Translation System Development based on Human Likeness. Proceedings of IEEE/ACL 2006 Workshop on Spoken Language Technology.

Lavie, A. Kenji Sagae, and Shyamsundar Jayaraman. The Significance of Recall in Automatic Metrics for MT Evaluation. (AMTA'04).

LC-STAR (IST-2001-32216). Lexica and Corpora for Speech-to-Speech Translation Components : http://www.lc-star.com

Leturia I., Gurrutxaga A., Alegria I., Ezeiza A. 2007. "CorpEus, a web as corpus tool designed for the agglutinative nature of Basque". WAC3 2007 (Web as a Corpus) workshop. Louvain.

Leturia, I. San Vicente, X. Saralegi, M. Lopez de Lacalle. 2008. "Basque specialized corpora from the web: language-specific performance tweaks and improving topic precision". 6th International Conference on Language Resources and Evaluations (LREC) - WAC4 workshop. Marrakech.

Li, C.-H., Li, M., Zhang, D., Li, M., Zhou, M., & Guan, Y. (2007). A Probabilistic Approach to Syntax-based Reordering for Statistical Machine Translation. Proceedings of the 45th Annual Meeting of the Association for Computational Linguistics (ACL) (pp. 720?727).

Lin, C.-Y., & Och, F. J. (2004a). Automatic Evaluation of Machine Translation Quality Using Longest Common Subsequence and Skip-Bigram Statics. Proceedings of the 42nd Annual Meeting of the Association for Computational Linguistics (ACL).

Lin, Chin-Yew and E.H. Hovy. Automatic Evaluation of Machine Translation Quality Using N-gram Co-Occurrence Statistics, NIST, 2002.

Lin, Chin-Yew and Franz Josef Och. ORANGE: a Method for Evaluating Automatic Evaluation Metrics for Machine Translation. (COLING '04).

Lin, D. (2004). A Path-Based Transfer Model for Machine Translation. Proceedings of the 20th International Conference on Computational Linguistics (COLING).

Liu, D., & Gildea, D. (2007). Source-Language Features and Maximum Correlation Training for Machine Translation Evaluation. Proceedings of the 2007 Meeting of the North American Chapter of the Association for Computational Linguistics (NAACL) (pp. 41?48).

Liu, Ding and Daniel Gildea. Syntactic Features for Evaluation of Machine Translation. (ACL'05 Workshop on Intrinsic and Extrinsic Evaluation Measures for Machine Translation and/or Summarization).

Marcu, D., Wang, W., Echihabi, A., & Knight, K. (2006). SPMT: Statistical Machine Translation with Syntactified Target Language Phrases. Proceedings of the Conference on Empirical Methods in Natural Language Processing (EMNLP).

Màrquez L., Mihai Surdeanu, Xavier Carreras and Pere R. Comas. Combination Strategies for Semantic Role Labeling. Journal of Artificial

Márquez, L., Comas, P., Giménez, J., & Català, N. (2005). Semantic role labelling as sequential tagging. Proceedings of CoNLL.

Matusov E., Nicola Ueffing, Hermann Ney. 2006. Computing Consensus Translation from Multiple Machine Translation Systems Using Enhanced Hypotheses Alignment. Proceedings of EACL 2006

Mayor A., Jordi Atseria , Eli Comelles 2005. TXALA un analizador libre de dependencias para el castellano. XXI SEPLN.

Mehay, D., Brew, C. (2007). BLEUATRE: Flattening Syntactic Dependencies for MT Evaluation. Proceedings of the 11th Conference on Theoretical and Methodological Issues in Machine Translation (TMI).

Melamed, D. (2001). Empirical Methods for Exploiting Parallel Texts MIT Press.

Melamed, D. (2004). Statistical Machine Translation by Parsing. Proceedings of the 42nd Annual Meeting of the Association for Computational Linguistics (ACL).

Melamed, D., Clark, S., Way, A., Wu, D., Hall, K., Hearne, M., Carpuat, M., Dreyer, M., Groves, D., Shen, Y., Wellington, B., Burbank, A., & Fox, P. (2005). Final Report of Johns Hopkins 2003 Summer Workshop on Statistical Machine Translation by Parsing (Technical Report). Johns Hopkins University.

Melamed, D., Ryan Green, and Joseph P. Turian. Precision and Recall of Machine Translation. (HLT/NAACL '03).

Mellebeek, B., A. Khasin, K. Owczarzak, J. Van Genabith and A. Way. 2005. Improving online Machine Translation Systems. In Proceedings of the 10th Machine Translation Summit, Phuket, Thailand, pp.290--297.

Melli, G., YangWang, Yudong Liu, Mehdi M. Kashani, Zhongmin Shi,

Menno van Zaanen and Harold Somers. 2005. DEMO-CRAT: Deciding between Multiple Outputs Created by Automatic Translation. MT Summit X.

Morin, E., Daille, B., Takeuchi, K. and Kageura, K. (2007) ?Bilingual Terminology Mining - Using Brain, not brawn comparable corpora.? In Proceedings of the 45th Annual Meeting of the Association of Computational Linguistics, June 2007, Prague, Czech Republic, ACL p. 664-671.

Munteanu D.S., Alexander Fraser, Daniel Marcu. Improved Machine Translation Performace via Parallel Sentence Extraction from Comparable Corpora. 2004. In Proceedings of the Human Language Technology and North American Association for Computational Linguistics Conference (HLT/NAACL 2004), Boston, MA.

Narayanan, S. and S. Harabagiu, Question Answering based on Semantic Structures, In Proceedings of the International Conference on Computational Linguistics (COLING),2004.

Och F. (2005). referenced in http://www.inf.ed.ac.uk/teaching/courses/emnlp/slides/emnlp14.pdf

Och F. and Hermann Ney. 2003. A Systematic Comparison of Various Statistical Alignment Models. Computational Linguistics 29:19?51.

Och, F.J & Ney, H. 2002. "Improved Statistical Alignment Models". In Proc. of the 38th Annual Meeting of the ACL, pp. 440-447, Hongkong.

Owczarzak, K., Groves, D., Genabith, J. V., & Way, A. (2006). Contextual Bitext-Derived Paraphrases in Automatic MT Evaluation. Proceedings of the 7th Conference of the Association for Machine Translation in the Americas (AMTA) (pp. 148?155).

Owczarzak, K., Van Genabith, J. & Way, A. 2007b). Labelled Dependencies in Machine Translation Evaluation. In Proceedings of the 2nd Workshop on Statistical Machine Translation, 45th Annual Meeting of the Association for Computational Linguistics (ACL-07), Prague, Czech Republic, pp.104-111. 

Owczarzak, K., van Genabith, J., & Way, A. (2007a). Dependency-Based Automatic Evaluation for Machine Translation. Proceedings of SSST, NAACL-HLT/AMTA Workshop on Syntax and Structure in Statistical Translation (pp. 80?87).

Papineni K. et al, Bleu: a method for automatic evaluation of machine translation, IBM Research Report, RC22176, September 2001.

Paul, M., Finch, A., & Sumita, E. (2007). Reducing Human Assessments of Machine Translation Quality to Binary Classifiers. Proceedings of the 11th Conference on Theoretical and Methodological Issues in Machine Translation (TMI).

Pekar V., Mitkov R., Blagoev D., and Mulloni A. (2007). Finding Translations for Low-Frequency Words in Comparable Corpora. In Proceedings of the CONTEXT-07 Workshop on "Contextual Information in Semantic Space Models" (CoSMo-2007). Roskille, Denmark. pp.17-25

Popovic, M., & Ney, H. (2007). Word Error Rates: Decomposition over POS classes and Applications for Error Analysis. Proceedings of the Second Workshop on Statistical Machine Translation (pp. 48?55). Prague, Czech Republic: Association for Computational Linguistics.

Pradhan, S., Hacioglu, K., Krugler, V., Ward, W., Martin, J., Jurafsky, D., Support Vector Learning for Semantic Argument Classification, Machine Learning journal, Special issue on Speech and Natural Language Processing, 2005

Quirk, C. (2004). Training a Sentence-Level Machine Translation Confidence Metric. Proceedings of the 4th International Conference on Language Resources and Evaluation (LREC) (pp. 825?828).

Quirk, C., Menezes, A., & Cherry, C. (2005). Dependency Treelet Translation: Syntactically Informed Phrasal SMT. Proceedings of the 43rd Annual Meeting of the Association for Computational Linguistics (ACL) (pp. 271?279).

Rapp, R. (1995) ?Identifying word translations in non-parallel texts.? In ACL, p.320-322.

Rapp, R. (1999) ?Automatic identification of word translations from unrelated English and German corpora.? In Proceedings of the 37th annual meeting of the Association for Computational Linguistics, p.519-526, June 20-26, 1999, College Park, Maryland.

Reeder, F., Miller, K., Doyon, J., & White, J. (2001). The Naming of Things and the Confusion of Tongues: an MT Metric. Proceedings of the Workshop on MT Evaluation ?Who did what to whom?? at Machine Translation Summit VIII (pp. 55?59).

Resnik, P., Smith, N.A.: The web as a parallel corpus. Computational Linguistics 29 (2003) 349?380.

Russo-Lassner, G., Lin, J., & Resnik, P. (2005). A Paraphrase-Based Approach to Machine Translation Evaluation (LAMP-TR-125/CS-TR-4754/UMIACS-TR-2005-57) (Technical Report). University of Maryland, College Park. http://lampsrv01.umiacs.umd.edu/pubs/TechReports/LAMP\ 125/LAMP\ 125.pdf.

Sánchez-Martínez, F., Pérez-Ortiz, J. A., & Forcada, M. L. (2007). Integrating Corpus-based and Rule-based Approaches in an Open-source Machine Translation System. Proceedings of METIS-II Workshop: New Approaches to Machine Translation (pp. 73?82). Leuven, Belgium.

Saralegi, X., Leturia, I. (2007). Kimatu, a tool for cleaning non-content text parts from HTML docs. In Building and exploring web corpora, Proceedings of the 3rd Web as Corpus workshop. Louvain-la-Neuve, Belgium: Presses universitaires de Louvain, pp. 163--167.

Saralegi, I. San Vicente, M. López de Lacalle, 2008. "Mining Term Translations from Domain Restricted Comparable Corpora". SEPLN 2008. Madrid.

Sarasola K. 2000. Strategic priorities for the development of language technology in minority languages. Invited speaker. Proceedings of Workshop on "Developing language resources for minority languages: re-useability and strategic priorities". Second International Conference on Language Resources and Evaluation (LREC 2000). Athens, Greece 2000.

Sarasola K. 2004. Europa, la ingeniería informática tiene un mercado abierto: Tecnología de la lengua. Colegio Oficial de Ingenieros en Informática COIEE "Retos de los ingenieros en informática"

Schafer, C., & Yarowsky, D. (2003). Statistical machine translation using coercive two-level syntactic transduction. Proceedings of EMNLP.

Schütz J.. Artificial Cognitive MT Post-Editing Intelligence . AMTA 2008.

Scott, B. The Logos Model: An Historical Perspective. In: Machine Translation 18 (2003), pp. 1-72.

Shao, Li and Ng, Hwee Tou (2004) ?Mining New Word Translations from Comparable Corpora.? In Proceedings of the 20th International Conference on Computational Linguistics (COLING 2004). (pp. 618-624). University of Geneva, Geneva, Switzerland.

Simard, M., Goutte, C., and Isabelle, P.. (2007a). Statistical phrase-based post-editing. In Human Language Technologies 2007: The Conference of the North American Chapter of the Association for Computational Linguistics; Proceedings of the Main Conference, pages 508?515, Rochester, USA, April.

Simard, M., Ueffing, N., Isabelle, P., & Kuhn, R. (2007b). Rule-based translation with statistical phrase-based post-editing. ACL 2007: proceedings of the Second Workshop on Statistical Machine Translation, June 23, 2007, Prague, Czech Republic

Somers H. An overview of EBMT, in M. Carl and A. Way (eds) Recent Advances in Example-Based Machine Translation, Dordrecht (2003): Kluwer, pages 3-57. ?

Specia, L., Sankaran, B., & das Graças Volpe Nunes, M. (2008). n-Best Reranking for the Efficient Integration of Word Sense Disambiguation and Statistical Machine Translation. In Computational Linguistics and Intelligent Text Processing, vol. 4919/2008 of Lecture Notes in Computer Science, 399?410. Springer Berlin / Heidelberg.

Specia, L., Stevenson, M., & das GraA§as Volpe Nunes, M. (2007). Learning Expressive Models for Word Sense Disambiguation. Proceedings of the 45th Annual Meeting of the Association for Computational Linguistics (ACL) (pp. 41?48).

Stolcke, A. (2002). Srilm - an extensible language modeling toolkit. Proceedings of ICSLP.

Stroppa N., Groves D., Way A., Sarasola K. 2006. Example-Based Machine Translation of the Basque Language. AMTA. 7th conference of the Association for Machine Translation in the Americas. pp.232--241.

Stroppa, N., van den Bosch, A., & Way, A. (2007). Exploiting Source Similarity for SMT using Context-Informed Features. Proceedings of the 11th Conference on Theoretical and Methodological Issues in Machine Translation (TMI) (pp. 231?240).

Surdeanu, M., Roser Morante and Lluís Màrquez. Analysis of Joint Inference Strategies for the Semantic Role Labeling of Spanish and Catalan. In Proceedings of the 9th International Conference on Computational Linguistics and Intelligent Text Processing, CICLing-2008, LNCS 4919, 206?218, Haifa, Israel, 2008.

Surdeanu, M., Sanda Harabagiu, John Williams, and Paul Aarseth. Using Predicate-Argument Structures for Information Extraction. In Proceedings of ACL 2003 , Sapporo, Japan, 2003.

Talbot David(2003) "Learning Translations from Comparable Corpora". Thesis.

Talvensaari Tuomas, Ari Pirkola, Kalervo Järvelin, Martti Juhola, Jorma Laurikkala: Focused web crawling in the acquisition of comparable corpora. Inf. Retr. 11(5): 427-445 (2008).

TC-STAR. Technology and Corporafor Speech to Speech Translation. http://www.tc-star.org

Temperley, D. 2007. Minimization of Dependency Length in Written English. Cognition 105, 300-333.

Tiedemann, Jörg (1998), ?Extraction of translation equivalents from parallel corpora?, in: Proceedings of the 11th Nordic conference on computational linguistics, Copenhagen 28?29 January 1998 (NODALIDA?98), Center for Sprogteknologi, University of Copenhagen. 120?128.

Turian,J.P., Luke Shen, and I. Dan Melamed. Evaluation of Machine Translation and its Evaluation. (MT SUMMIT IX '03).

Venkatapathy, S., & Bangalore, S. (2007). Three models for discriminative machine translation using Global Lexical Selection and Sentence Reconstruction. Proceedings of SSST, NAACL HLT/AMTA Workshop on Syntax and Structure in Statistical Translation (pp. 152?159).

Vickrey, D. L. Biewald, M. Teyssier, and D. Koller. Word-Sense Disambiguation for Machine Translation. In Proccedings of the Human Language Technology Conference & Conference on Empirical Methods in Natural Language Processing (HLT/EMNLP'05), Vancouver, Canada, 2005.

Villarejo, L., Màrquez, L., Agirre, E., Martínez, D., Magnini, B., Strapparava, C., McCarthy, D., Montoyo, A., & Suárez, A. (2004). The "meaning" system on the english all-words task. Proceedings of SENSEVAL-3 Third International Workshop on the Evaluation of Systems for the Semantic Analysis of Text, Task: English All Words.

Way, A. and N. Gough. 2005. Comparing Example-Based and Statistical Machine Translation. Journal of Natural Language Engineering (in press).

Williams B., Sarasola K., Ó?Cróinin D., Petek B. 2001. Speech and Language Technology for Minority Languages. Proceedings of Eurospeech 2001

Wu, D. (1997). Stochastic inversion transduction grammars and bilingual parsing of parallel corpora. Computational Linguistics, 23, 377-404.

Wu, D. (2000). Bracketing and aligning words and constituents in parallel text using Stochastic Inversion Transduction Grammars. Parallel Text Processing: Alignment and Use of Translation Corpora. Dordrecht: Kluwer.

Xue, N. and Martha Palmer. 2004. Calibrating Features for Semantic Role Labelling, in Proceedings of 2004 Conference on Empirical Methods in Natural Language Processing , in conjunction with ACL'04. Barcelona, Spain

Yamada, K. (2002). A Syntax-based Translation Model. Doctoral dissertation, University of Southern California.

Yamada, K., & Knight, K. (2001). A Syntax-based Statistical Translation Model. Proceedings of the 42nd Annual Meeting of the Association for Computational Linguistics (ACL).

Yamada, K., & Knight, K. (2002). A Decoder for Syntax-based Statistical MT. Proceedings of the 42nd Annual Meeting of the Association for Computational Linguistics (ACL).

Zhang, H., & Gildea, D. (2004). Syntax-based alignment: Supervised or unsupervised? Proceedings of COLING.

Zhang, Ying., Wu, Ke., Gao, Jianfeng, and Vines, P. 2006, Automatic Acquisition of Chinese-English Parallel Corpus from the Web. In Proceedings of ECIR-06, 28th European Conference on Information Retrieval.

Zhou, L., Lin, C.-Y., & Hovy, E. (2006). Re-evaluating Machine Translation Results with Paraphrase Support. Proceedings of the Conference on Empirical Methods in Natural Language Processing (pp. 77?84).

Pil-Pilean



IXA taldea - EHU-UPV

HITZ BESTE: Ez adiorik, Xuxen-zale amorratu hori 2017-Api-17

HAP Masterreko 7 ikasle-ohi IKERGAZTE biltzarrean 2017-Api-09

Mikel Artetxe-k saria jaso du Bartzelonako HP Hackatoian 2017-Mar-01

Mintegia: Jarreren sailkapena sare sozialetan (A. Zubiaga, 2017-02-14) 2017-Ots-10

IXA-ko berri gehiago ikusi   



20/07/2016 - Anna Raboshchuck PhD Dissertation

PhD scholarship on Deep Learning applied to Speech Technologies

Joan Bruna & Oriol Vinyals Seminar on Deep Learning

27/04/2016 - Hermann Ney - Human Language Technology and Machine Learning

UPC-ko berri gehiago ikusi